Enhanced detection and recognition system for vehicles and drivers using multi-scale retinex guided filter and machine learning

Mahmood, Zahid; Khan, Khurram; Shahzad, Mohsin; Fayyaz, Ahmad; Khan, Uzair

doi:10.1007/s11042-023-16140-z

Enhanced detection and recognition system for vehicles and drivers using multi-scale retinex guided filter and machine learning

Published: 24 July 2023

Volume 83, pages 15785–15824, (2024)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Zahid Mahmood ORCID: orcid.org/0000-0003-4121-3558¹,
Khurram Khan^2,3,
Mohsin Shahzad¹,
Ahmad Fayyaz¹ &
…
Uzair Khan¹

171 Accesses
1 Citation
Explore all metrics

Abstract

Accurate vehicle detection plays a vital role in intelligent transportation systems. Various day conditions, for instance, dawn, morning, noon, or non-uniform illuminations put restrictions on camera’s visibility. Such scenarios impact the performance of detection and recognition algorithms that are used in surveillance systems and autonomous driving. This paper aims to solve the aforementioned issues using machine learning methods, such as face detection and recognition. The core theme of this paper is the development of a vehicle detection and driver recognition system, which also focuses the situation where an input image is degraded by non-uniform illuminations. The proposed system is composed of four main processing modules: (i) image acquisition, (ii) image enhancement, (iii) object detection that locates vehicles’ and drivers’ faces, and (iv) the Pool of Face Recognition Algorithms (PoFRA), which uses four face recognition algorithms to conclude the driver’s identity. We implement suitable algorithms for each of the above-described modules to appraise its practicability. The system can be adjusted to work in different types of extreme weather conditions, such as strong or dim light. Experimental results demonstrate that the proposed system has significant potential to take the research on automated car parking systems to the next level.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Factex: A Practical Approach to Crime Detection

Face detection for rail transit passengers based on single shot detector and active learning

Article 30 August 2022

Improved Vehicle Detection and Tracking Using YOLO and CSRT

Data availability

Some data can be shared on request.

Notes

References

Afifi M, Brown M (2019) “What else can fool deep learning? Addressing color constancy errors on deep neural network performance,” In Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 243–252
Afifi M, Derpanis K, Ommer B, Brown M (2021) “Learning multi-scale photo exposure correction,” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9157–9167
Ahonen T, Hadid A, Pietikainen M (2006) Face description with local binary patterns, application to face recognition. IEEE Trans Pattern Anal Mach Intell 28(12):2037–2041
Google Scholar
Ameen M, Eleyan A, Eleyan G (2017) Wavelet transform based face recognition using SURF descriptors. Int J Electron Electric Eng 5(1):94–98
Google Scholar
Arash A, Ali A (2018) Predicting the future location of cars on urban street network by chaining spatial web services. IET Intell Transp Syst 12(8):793–800
Google Scholar
Asadianfam S, Shamsi M, Kenari A (2021) TVD-MRDL: traffic violation detection system using MapReduce-based deep learning for large-scale data. Multimed Tools Appl 80(2):2489–2516
Google Scholar
Badii C, Nesi P, Paoli I (2018) Predicting available parking slots on critical and regular services by exploiting a range of open data. IEEE Access 6:44059–44071
Google Scholar
Bernardo B, Cirillo S, Cuofano M, Desiato D (2022) Enhancing spatial perception through sound: mapping human movements into MIDI. Multimed Tools Appl:1–22. https://doi.org/10.1007/s11042-021-11077-7
Burgstahler D, Knapp F, Zöller S (2014) “Where is that car parked? A wireless sensor network-based approach to detect car positions,” 9^th International Workshop on Practical Issues in Building Sensor Network Applications, Edmonton, Canada, pp. 514–522
Caruccio L, Cirillo S, Deufemia V, Polese G (2021) “Efficient Validation of Functional Dependencies during Incremental Discovery,” In Proceedings of the 29^th Italian Symposium on Advanced Database Systems, Pizzo Calabro, Italy, pp. 5–9
Chao D, Loy C, He K, Tang X (2014) Learning a deep convolutional network for image super-resolution. Eur Conf Comput Vis:184–199
Chen G, Wang F, Qu S, Chen K, Yu J (2020) Pseudo-image and sparse points: vehicle detection with 2D LiDAR revisited by deep learning-based methods. IEEE Trans Intell Transp Syst 22(12):7699–7711
Google Scholar
Chil G, Musolino G, Polimeni A et al (2016) Transport models and intelligent transportation system to support urban evacuation planning process. IET Intell Transp Syst 10(4):279–286
Google Scholar
Christopher R, Fabio G, James W et al (2018) City-wide emissions modelling using fleet probe vehicles. IET Intell Transp Syst 12(9):1181–1188
Google Scholar
Chu W, Yao L, Cai D (2018) Multi-task vehicle detection with region-of-interest voting. IEEE Trans Image Process 27(1):432–441
MathSciNet Google Scholar
Daniel C, David S, Jose M et al (2018) Distribution of road hazard warning messages to distant vehicles in intelligent transport systems. IEEE Trans Intell Transp Syst 9(4):1152–1165
Google Scholar
Deng J, Guo J, Xue N, Zafeiriou S (2019) “Arcface: additive angular margin loss for deep face recognition,” In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 4690–4699
Ding C, Tao D (2015) Robust face recognition via multimodal deep face representation. IEEE Trans Multimed 17(11):2049–2058
Google Scholar
Farid A, Hussain F, Khan K, Shahzad M, Khan U, Mahmood Z (2023) A fast and accurate real-time vehicle detection method using deep learning for unconstrained environments. Appl Sci 30(5):1–30
Google Scholar
Ghimire D, Lee J (2011) Nonlinear transfer function-based local approach for color image enhancement. IEEE Trans Consum Electron 57(2):858–865
Google Scholar
Guo C, Li C, Guo J, Loy C, Hou J, Kwong S, Cong R (2020) “Zero-reference deep curve estimation for low-light image enhancement,” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1780–1789
Haitham C, You C, Philip A et al (2017) Blockchain-based dynamic key management for heterogeneous intelligent transportation systems. IEEE Internet Things J 4(6):1832–1843
Google Scholar
Han W, Chang S, Liu D, Yu M, Witbrock M, Huang TS (2018)“Image super-resolution via dual-state recurrent networks,” In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1654–1663
Haq M, Shahzad A, Mahmood Z, Shah A, Muhammad N, Akram T (2019) Boosting the face recognition performance of ensemble based LDA for pose, non-uniform illuminations, and low-resolution images. KSII Trans Int Inf Syst 13(6):3144–3164
Google Scholar
Hassaballah M, Mahmoud M, Kenk KM, Minaee S (2020) Vehicle detection and tracking in adverse weather using a deep learning framework. IEEE Trans Intell Transp Syst 22(7):4230–4242
Google Scholar
Hu Y, He H, Xu C, Wang B, Lin S (2018) Exposure: a white-box photo post-processing framework. ACM Trans Graph 37(2):1–17
Google Scholar
Hussain F, Afifi M, Moussa G (2018) A comprehensive study of the effect of spatial resolution and color of digital images on vehicle classification. IEEE Trans Intell Transp Syst 20(3):1181–1190
Google Scholar
Jamil K, Joao B, Lucio F et al (2018) A2CPS: a vehicle-centric safety conceptual framework for autonomous transport systems. IEEE Trans Intell Transp Syst 19(6):1925–1939
Google Scholar
Jiao Z, Xu B (2009) “An image enhancement approach using retinex and YIQ,” in International Conference on Information Technology and Computer Science (ITCS), pp. 476–479
Kafai M, Bhanu B (2012) Dynamic bayesian networks for vehicle classification in video. IEEE Trans Industrial Inf 8(1):100–109
Google Scholar
Lai W, Huang J, Ahuja N, Yang M (2017) Deep laplacian pyramid networks for fast and accurate super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition:624–632
Leitloff J, Rosenbaum D, Kurz F (2014) An operational system for estimating road traffic information from aerial images. Remote Sens 6(11):11 315–11 111341
Google Scholar
Liu K, Mattyus G (2015) Fast multiclass vehicle detection on aerial images. IEEE Geosci Remote Sens Lett 12(9):1938–1942
Google Scholar
Liu L, Jiabao W, Zheng Z, Su H (2021) An improved approach for mining association rules in parallel using spark streaming. Int J Circ Theory Appl 49(4):1028–1039
Google Scholar
Lu J, Plataniotis K, Venetsanopoulos A, Li S (2006) Ensemble-based discriminant learning with boosting for face recognition. IEEE Trans Neural Netw 17(1):166–178
Google Scholar
Lv T, Zhang Y, Luo L, Gao X (2022) MAFFNet: real-time multi-level attention feature fusion network with RGB-D semantic segmentation for autonomous driving. Appl Opt 61(9):2219–2229
Google Scholar
Maeda S (2020) “Unpaired image super-resolution using pseudo-supervision,” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 291–300
Mahmood Z, Ali T, Khattak S, Khan SU (2014) “A comparative study of baseline algorithms of face recognition,” 12^th International Conference on Frontier of Information Technology (FIT), pp. 263–268
Mahmood Z, Khan M, Jawad M, Khan SU, Yang LT (2015) “A parallel framework for object detection and recognition for secure vehicle parking,” 17^th International Conference on High Performance Computing and Communications (HPCC), pp. 892–895
Mahmood Z, Ali T, Khattak S, Hassan L, Khan SU (2015) Automatic player detection and identification for sports entertainment applications. Pattern Anal Applic 18(4):971–982
MathSciNet Google Scholar
Mahmood Z, Ali T, Khan SU (2016) Effects of pose and image resolution on automatic face recognition. IET Biometrics 5(2):111–119
Google Scholar
Mahmood Z, Ali T, Muhammad N, Bibi N, Shahzad I, Azmat S (2017) EAR: enhanced augmented reality system for sports entertainment applications. KSII Trans Int Inf Syst 11(12):6069–6091
Google Scholar
Mahmood Z, Muhammad N, Bibi N, Ali T (2017) A review on state-of-the-art face recognition approaches. Fractals 25(2):1750025-1–1750025-19
Google Scholar
Mahmood Z, Muhammad N, Bibi N, Malik Y, Ahmad N (2018) Visual enhancement of human observatory system using multi-scale retinex. Inf Med Unlocked 13:9–20
Google Scholar
Mahmood Z, Haneef O, Muhammad N, Khattak S (2018) Towards a fully automated car parking system. IET Intell Transp Syst 13(2):293–302
Google Scholar
Mahmood Z, Bibi N, Usman M, Khan U, Muhammad N (2019) Mobile cloud based framework for sports applications. Multidim Syst Sign Process 30(4):1991–2019
Google Scholar
Mahmood Z, Khan K, Khan U, Adil SH, Ali SSA, Shahzad M (2022) Towards automatic license plate detection. Sensors 22(3):1–19
Google Scholar
Masood S, Syed MI, Khan K, Fayyaz A, Sultan F, Mahmood Z (2020) “A Robust Method to Enhance Degraded Color Images Under Diverse Condition,” 14^th International Conference on Open Source Systems and Technologies (ICOSST), pp. 1–6
Meng J, Cao T, Peng J, Wang Z, Wang S (2022) Polarized image near-natural color fusion algorithm for target detection. Appl Opt 61(6):1323–1330
Google Scholar
Min K, Lee G, Lee S (2021) “ACNet: Mask-Aware Attention with Dynamic Context Enhancement for Robust Acne Detection,” In International Conference on Systems, Man, and Cybernetics (SMC), pp. 2724–2729
Moranduzzo T, Melgani (2014) Automatic car counting method for unmanned aerial vehicle images. IEEE Trans Geosci Remote Sens 52(3):1635–1647
Google Scholar
Reilly V, Idrees H, Shah M (2010) “Detection and tracking of large number of targets in wide area surveillance,” In Proceedings of the European Conference on Computer Vision, pp. 186–199
Richardos D, Bellotti F, Bakas I et al (2018) Gamified flexible transportation service for on-demand public transport. IEEE Trans Intell Transp Syst 19(3):921–933
Google Scholar
Sadia H, Azeem F, Ullah H, Mahmood Z, Khattak S, Khan G. Z (2018) “Color image enhancement using multi-scale retinex with guided filter,” 16^th International Conference on Frontiers of Information Technology (FIT), pp. 82–87
Silla A, Pirkko R, Lars L et al (2017) Quantifying the effectiveness of ITS in improving safety of VRUs. IET Intell Transp Syst 11(3):164–172
Google Scholar
Sultan F, Khan K, Shah YA, Shahzad M, Khan U, Mahmood Z (2023) Towards automatic license plate recognition in challenging conditions. Appl Sci 13(6):1–30
Google Scholar
Sun B, Tao W, Chen W (2008) “Luminance based MSR for color image enhancement,” in International Congress on Image and Signal Processing, pp. 358–362
Szegedy C, Toshev A, Erhan D (2013) “Deep neural network for object detection,” in Adv Neural Inf Proces Syst, pp. 2553–2561
Tuermer S, Kurz F, Reinartz P, Stilla U Airborne vehicle detection in dense urban areas using HoG features and disparity maps. IEEE J Sel Topics Appl Earth Observ Remote Sens 6(6, 2–13):2327–2337
Viola P, Michael J (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154
Google Scholar
Wang Q, Guo G (2020) LS-CNN: characterizing local patches at multiple scales for face recognition. IEEE Trans Inf Forensics Secur 15:1640–1653
Google Scholar
Wang Q, Guo G (2021) DSA-face: diverse and sparse attentions for face recognition robust to pose variation and occlusion. IEEE Trans Inf Forensics Secur 16:4534–4543
Google Scholar
Wang W, Xi J, Liu C et al (2017) Human-centered feed-forward control of a vehicle steering system based on a driver's path-following characteristics. IEEE Trans Intell Transp Syst 18(6):1440–1453
Google Scholar
Wang Z, Chen J, Hoi S (2020) Deep learning for image super-resolution: a survey. IEEE Trans Pattern Anal Mach Intell 43(10):3365–3387
Google Scholar
Wang Q, Wu T, Zheng H, Guo G (2020) “Hierarchical pyramid diverse attention networks for face recognition,” In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 8326–8335
Wen X, Zhao H, Wang N, Yuan H (2006) A rear-vehicle detection system for static images based on monocular vision. In: Proc. 9th Int. Conf. Control, autom., robot. Vis, pp 2421–2424
Google Scholar
Yang J, Zhang D, Frangi A (2004) Two-dimensional PCA: a new approach to appearance-based face representation and recognition. IEEE Trans Pattern Anal Mach Intell 26(1):131–137
Google Scholar
Zaki P, William M, Soliman B, Alexsan K, Khalil K, Moursy M (2020) “Traffic signs detection and recognition system using deep learning. arXiv 2020, arXiv:2003.03256
Zhan X, Qian X, Satish V (2016) A graph-based approach to measuring the efficiency of an urban taxi service system. IEEE Trans Intell Transp Syst 17(9):2479–2489
Google Scholar
Zhang Z, Wang Z, Lin Z, Qi H (2019) “Image super-resolution by neural texture transfer,” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7982–7991
Zhang Z, Lu X, Cao G, Yang Y, Jiao L, Liu F (2021) ViT-YOLO: transformer-based YOLO for object detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision:2799–2808
Zhang J, Liu T, Yin X, Wang X, Zhang K, Xu J, Wang D (2021) An improved parking space recognition algorithm based on panoramic vision. Multimed Tools Appl 80(12):18181–18209
Google Scholar
Zhao J, Hao S, Dai C, Zhang H, Zhao L (2022) Improved vision-based vehicle detection and classification by optimized YOLOv4. IEEE Access 10:8590–8603
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, COMSATS University Islamabad, Abbottabad Campus, Pakistan
Zahid Mahmood, Mohsin Shahzad, Ahmad Fayyaz & Uzair Khan
Department of Avionics Engineering, Air University Islamabad, Islamabad, Pakistan
Khurram Khan
Faculty of Computer Science and Engineering, GIK Institute of Engineering Sciences and Technology, Topi, KPK, Pakistan
Khurram Khan

Authors

Zahid Mahmood
View author publications
You can also search for this author in PubMed Google Scholar
Khurram Khan
View author publications
You can also search for this author in PubMed Google Scholar
Mohsin Shahzad
View author publications
You can also search for this author in PubMed Google Scholar
Ahmad Fayyaz
View author publications
You can also search for this author in PubMed Google Scholar
Uzair Khan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All Authors have equivalent contributions.

Corresponding authors

Correspondence to Zahid Mahmood or Khurram Khan.

Ethics declarations

Competing interests /competing interests

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Mahmood, Z., Khan, K., Shahzad, M. et al. Enhanced detection and recognition system for vehicles and drivers using multi-scale retinex guided filter and machine learning. Multimed Tools Appl 83, 15785–15824 (2024). https://doi.org/10.1007/s11042-023-16140-z

Download citation

Received: 21 May 2021
Revised: 12 May 2023
Accepted: 03 July 2023
Published: 24 July 2023
Issue Date: February 2024
DOI: https://doi.org/10.1007/s11042-023-16140-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Enhanced detection and recognition system for vehicles and drivers using multi-scale retinex guided filter and machine learning

Abstract

Access this article

Similar content being viewed by others

Factex: A Practical Approach to Crime Detection

Face detection for rail transit passengers based on single shot detector and active learning

Improved Vehicle Detection and Tracking Using YOLO and CSRT

Data availability

Notes

References

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests /competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Enhanced detection and recognition system for vehicles and drivers using multi-scale retinex guided filter and machine learning

Abstract

Access this article

Similar content being viewed by others

Factex: A Practical Approach to Crime Detection

Face detection for rail transit passengers based on single shot detector and active learning

Improved Vehicle Detection and Tracking Using YOLO and CSRT

Data availability

Notes

References

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests /competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation