Video surveillance image enhancement via a convolutional neural network and stacked denoising autoencoder

Che Aminudin, Muhamad Faris; Suandi, Shahrel Azmin

doi:10.1007/s00521-021-06551-0

Video surveillance image enhancement via a convolutional neural network and stacked denoising autoencoder

Original Article
Published: 08 October 2021

Volume 34, pages 3079–3095, (2022)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

822 Accesses
7 Citations
2 Altmetric
Explore all metrics

Abstract

In an extensive-scale surveillance system, the quality of the surveillance camera installed varies. This variation of surveillance camera produces different image quality in terms of resolution, illumination, and noise. The quality of the captured image depends on the surveillance camera hardware, placement and orientation, and the surrounding light. A pixelated, low illumination and noisy image produced by a low-quality surveillance camera causes critical issues for video surveillance face recognition systems. To address these issues, a deep learning image enhancement (DLIE) model is proposed. By utilizing a deep learning architecture such as a convolutional neural network (CNN) and a denoising autoencoder, the image quality can be enhanced. The DLIE model is able to improve image resolution and illumination and reduce noise in an image. There are two deep learning blocks (DLB) in the DLIE model, which are DLB1 and DLB2. Both DLBs are arranged in parallel so that all the stated problems can be addressed simultaneously. DLB1 is proposed to address the occurrence of pixelated images by reconstructing a low-resolution image into a high-resolution image using a CNN. DLB2 used the capability of a denoising autoencoder to reconstruct the corrupted image into a clean image by enhancing the dark and noisy images. The output of each DLB is fused using image fusion to obtain the optimum image quality. The image is evaluated using the peak to signal noise ratio (PSNR) and structural similarity index (SSIM). The enhanced image from the DLIE model exhibits superior quality compared to the original image ranging from 13.3625 to 22.7728 for PSNR and 0.6207 to 0.8155 for SSIM.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 2

Effectiveness of State-of-the-Art Super Resolution Algorithms in Surveillance Environment

Enhancing Image Resolution and Denoising Using Autoencoder

On the Efficacy of Deep Image Denoising for Computer Vision Applications

References

Amolins K, Zhang Y, Dare P (2007) Wavelet based image fusion techniques-an introduction, review and comparison. ISPRS J Photogram Remote Sens 62(4):249–263. https://doi.org/10.1016/j.isprsjprs.2007.05.009
Article Google Scholar
Bonetto R, Latzko V (2020) Chapter 8-machine learning. In: Fitzek FH, Granelli F, Seeling P (eds) Computing in communication networks. Academic Press, London
Google Scholar
Bovik AC, Acton ST (2009) Chapter 10—basic linear filtering with application to image enhancement. In: Bovik A (ed) The essential guide to image processing. Academic Press, Boston, pp 225–239. https://doi.org/10.1016/B978-0-12-374457-9.00010-X
Chapter Google Scholar
Buber E, Diri B (2018) Performance analysis and cpu vs gpu comparison for deep learning. In: 2018 6th International conference on control engineering information technology (CEIT), pp 1–6
Cao G, Huang L, Tian H, Huang X, Wang Y, Zhi R (2018) Contrast enhancement of brightness-distorted images by improved adaptive gamma correction. Comput Electr Eng 66:569–582. https://doi.org/10.1016/j.compeleceng.2017.09.012
Article Google Scholar
Cermeño E, Pérez A, Sigüenza JA (2018) Intelligent video surveillance beyond robust background modeling. Expert Syst Appl 91:138–149. https://doi.org/10.1016/j.eswa.2017.08.052
Article Google Scholar
Chen CY, Chen CH, Chen CH, Lin KP (2016) An automatic filtering convergence method for iterative impulse noise filters based on psnr checking and filtered pixels detection. Expert Syst Appl 63:198–207. https://doi.org/10.1016/j.eswa.2016.07.003
Article Google Scholar
Dong C, Loy CC, He K, Tang X (2016a) Image super-resolution using deep convolutional networks. IEEE Trans Pattern Anal Mach Intell 38(2):295–307. https://doi.org/10.1109/TPAMI.2015.2439281
Article Google Scholar
Dong C, Loy CC, Tang X (2016b) Accelerating the super-resolution convolutional neural network. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 9906 LNCS:391–407, 1608.00367
Fan Z, Hu X, Chen C, Wang X, Peng S (2020) Facial image super-resolution guided by adaptive geometric features. Eurasip J Wirel Commun Netw 2020(1). https://doi.org/10.1186/s13638-020-01760-y
Article Google Scholar
Gai S, Bao Z (2019) New image denoising algorithm via improved deep convolutional neural network with perceptive loss. Expert Syst Appl 138:112815. https://doi.org/10.1016/j.eswa.2019.07.032
Article Google Scholar
Gharbi M, Chaurasia G, Paris S, Durand F (2016) Deep joint demosaicking and denoising. ACM Trans Graph 35(6), ACM SIGGRAPH Asia Conference, Macao, 2016. https://doi.org/10.1145/2980179.2982399
Grgic M, Delac K, Grgic S (2011) SCface—surveillance cameras face database. Multimed Tools Appl 51(3):863–879. https://doi.org/10.1007/s11042-009-0417-2
Article Google Scholar
He H, Siu WC (2011) Single image super-resolution using gaussian process regression. CVPR 2011:449–456. https://doi.org/10.1109/CVPR.2011.5995713
Article Google Scholar
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR), pp 770–778
Hore A, Ziou D (2010) Image quality metrics: Psnr vs. ssim. In: 2010 20th International conference on pattern recognition, pp 2366–2369. https://doi.org/10.1109/ICPR.2010.579
Huynh-Thu Q, Ghanbari M (2008) Scope of validity of psnr in image/video quality assessment. Electron Lett 44(13):800–801
Article Google Scholar
Iizuka S, Simo-Serra E, Ishikawa H (2016) Let there be color! joint end-to-end learning of global and local image priors for automatic image colorization with simultaneous classification. ACM Trans Graph 10(1145/2897824):2925974
Google Scholar
Jain V, Seung S (2009) Natural image denoising with convolutional networks. In: Koller D, Schuurmans D, Bengio Y, Bottou L (eds) Advances in neural information processing systems 21. Curran Associates Inc, London, pp 769–776
Google Scholar
Jiang Y, Gong X, Liu D, Cheng Y, Fang C, Shen X, Yang J, Zhou P, Wang Z (2021) Enlightengan: deep light enhancement without paired supervision. IEEE Trans Image Process 30:2340–2349. https://doi.org/10.1109/TIP.2021.3051462
Article Google Scholar
Jianji W, Chen P, Zheng N, Chen B, Principe JC, Wang FY (2021) Associations between mse and ssim as cost functions in linear decomposition with application to bit allocation for sparse coding. Neurocomputing 422:139–149. https://doi.org/10.1016/j.neucom.2020.10.018
Article Google Scholar
Kim J, Lee JK, Lee KM (2015) Accurate image super-resolution using very deep convolutional networks. IEEE Trans Pattern Anal Mach Intell 38(2):295–307. https://doi.org/10.1109/TPAMI.2015.2439281
Article Google Scholar
Ko KE, Sim KB (2018) Deep convolutional framework for abnormal behavior detection in a smart surveillance system. Eng Appl Artif Intell 67:226–234. https://doi.org/10.1016/j.engappai.2017.10.001
Article Google Scholar
Kwasniewska A, Ruminski J, Szankin M, Kaczmarek M (2020) Super-resolved thermal imagery for high-accuracy facial areas detection and analysis. Eng Appl Artif Intell 87:103263. https://doi.org/10.1016/j.engappai.2019.103263
Article Google Scholar
Lore KG, Akintayo A, Sarkar S (2017) LLNet: A deep autoencoder approach to natural low-light image enhancement. Pattern Recogn 61(Supplement C):650–662(Supplement C):650–652. https://doi.org/10.1016/j.patcog.2016.06.008
Article Google Scholar
Łoza A, Bull DR, Hill PR, Achim AM (2013) Automatic contrast enhancement of low-light images based on local statistics of wavelet coefficients. Digital Signal Process 23(6):1856–1866. https://doi.org/10.1016/j.dsp.2013.06.002
Article Google Scholar
Mafi M, Martin H, Cabrerizo M, Andrian J, Barreto A, Adjouadi M (2019) A comprehensive survey on impulse and gaussian denoising filters for digital images. Signal Process 157:236–260. https://doi.org/10.1016/j.sigpro.2018.12.006
Article Google Scholar
Meena SK, Potnis A, Mishra M, Dwivedy P, Soofi S (2017) Review and application of different contrast enhancement technique on various images. In: 2017 1st International conference on electronics, materials engineering and nano-technology (IEMENTech), pp 1–6. https://doi.org/10.1109/IEMENTECH.2017.8076993
Nixon M, Aguado AS (2012) Feature extraction and image processing for computer vision, 3rd edn. Academic Press, London
Google Scholar
Omar N, Sengur A, Al-Ali SGS (2020) Cascaded deep learning-based efficient approach for license plate detection and recognition. Expert Syst Appl. https://doi.org/10.1016/j.eswa.2020.113280
Article Google Scholar
Pang S, [del Coz] JJ, Yu Z, Luaces O, Díez J, (2017) Deep learning to frame objects for visual target tracking. Eng Appl Artif Intell 65:406–420. https://doi.org/10.1016/j.engappai.2017.08.010
Article Google Scholar
Shen X, Tao X, Gao H, Zhou C, Jia J (2016). Deep automatic portrait matting. In: Leibe B, Matas J, Sebe, N, Welling M (eds) Computer vision-ECCV 2016, PT I, Lecture Notes in Computer Science, vol 9905, pp 92–107, 10.1007/978-3-319-46448-0\_6, 14th European conference on computer vision (ECCV), Amsterdam, 2016
Sonali Sahu S, Singh AK, Ghrera S, Elhoseny M (2019) An approach for de-noising and contrast enhancement of retinal fundus image using clahe. Optics Laser Technol 110:87–98. https://doi.org/10.1016/j.optlastec.2018.06.061
Article Google Scholar
Tao L, Zhu C, Xiang G, Li Y, Jia H, Xie X (2017) Llcnn: a convolutional neural network for low-light image enhancement. In: 2017 IEEE visual communications and image processing (VCIP), pp 1–4
Vanmali AV, Kataria T, Kelkar SG, Gadre VM (2020) Ringing artifacts in wavelet based image fusion: analysis, measurement and remedies. Inf Fusion 56:39–69. https://doi.org/10.1016/j.inffus.2019.10.003
Article Google Scholar
Vincent P, Larochelle H, Bengio Y, Manzagol PA (2008) Extracting and composing robust features with denoising autoencoders. In: Proceedings of the 25th international conference on machine learning-ICML ’08, pp 1096–1103. https://doi.org/10.1145/1390156.1390294, http://portal.acm.org/citation.cfm?doid=1390156.1390294, arXiv:1412.6550v4
Wang X, Tao Q, Wang L, Li D, Zhang M (2015) Deep convolutional architecture for natural image denoising. In: 2015 International conference on wireless communications and signal processing (WCSP), pp 1–4. https://doi.org/10.1109/WCSP.2015.7341021, http://ieeexplore.ieee.org/document/7341021/
Wu S, Zhong S, Liu Y (2017) Deep residual learning for image steganalysis. Multimed Tools Appl. https://doi.org/10.1007/s11042-017-4440-4
Article Google Scholar
Xiao B, Tang H, Jiang Y, Li W, Wang G (2018) Brightness and contrast controllable image enhancement based on histogram specification. Neurocomputing 275:2798–2809. https://doi.org/10.1016/j.neucom.2017.11.057
Article Google Scholar
Zhang K, Zuo W, Chen Y, Meng D, Zhang L (2017) Beyond a gaussian denoiser: residual learning of deep cnn for image denoising. IEEE Trans Image Process 26(7):3142–3155
Article MathSciNet MATH Google Scholar
Zhou W, Bovik AC, Sheikh HR, Simoncelli EP (2004) Image quality assessment: from error visibility to structural similarity. IEEE Trans Image Process 13(4):600–612. https://doi.org/10.1109/TIP.2003.819861
Article Google Scholar

Download references

Acknowledgements

This research is fully supported by Universiti Sains Malaysia Bridging Grant (Bridging Grant) No. 304/PELECT/6316115 and Universiti Sains Malaysia Research University Individual (RUI) Research Grant No. 1001/PELECT/8014056.

Author information

Authors and Affiliations

Intelligent Biometric Group, School of Electrical and Electronic Engineering, USM Engineering Campus, Universiti Sains Malaysia, 14300, Nibong Tebal, Pulau Pinang, Malaysia
Muhamad Faris Che Aminudin & Shahrel Azmin Suandi

Authors

Muhamad Faris Che Aminudin
View author publications
You can also search for this author in PubMed Google Scholar
Shahrel Azmin Suandi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shahrel Azmin Suandi.

Ethics declarations

Conflict of interest

The authors declare that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This research is fully supported by Universiti Sains Malaysia Bridging Grant (Bridging Grant) No. 304/PELECT/6316115 and Universiti Sains Malaysia Research University Individual (RUI) Research Grant No. 1001/PELECT/8014056.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Che Aminudin, M.F., Suandi, S.A. Video surveillance image enhancement via a convolutional neural network and stacked denoising autoencoder. Neural Comput & Applic 34, 3079–3095 (2022). https://doi.org/10.1007/s00521-021-06551-0

Download citation

Received: 21 March 2021
Accepted: 14 September 2021
Published: 08 October 2021
Issue Date: February 2022
DOI: https://doi.org/10.1007/s00521-021-06551-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Video surveillance image enhancement via a convolutional neural network and stacked denoising autoencoder

Abstract

Access this article

Similar content being viewed by others

Effectiveness of State-of-the-Art Super Resolution Algorithms in Surveillance Environment

Enhancing Image Resolution and Denoising Using Autoencoder

On the Efficacy of Deep Image Denoising for Computer Vision Applications

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Video surveillance image enhancement via a convolutional neural network and stacked denoising autoencoder

Abstract

Access this article

Similar content being viewed by others

Effectiveness of State-of-the-Art Super Resolution Algorithms in Surveillance Environment

Enhancing Image Resolution and Denoising Using Autoencoder

On the Efficacy of Deep Image Denoising for Computer Vision Applications

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation