Deep Learning Feature Extraction Architectures for Real-Time Face Detection

B, Ravi Teja; D, Mythili; Duvva, Laxmiprasanna; Bethu, Srikanth; Garapati, Yugandhar

doi:10.1007/s42979-023-02023-5

Deep Learning Feature Extraction Architectures for Real-Time Face Detection

Original Research
Published: 28 August 2023

Volume 4, article number 645, (2023)
Cite this article

SN Computer Science Aims and scope Submit manuscript

Ravi Teja B¹,
Mythili D²,
Laxmiprasanna Duvva²,
Srikanth Bethu ORCID: orcid.org/0000-0002-1091-4901³ &
…
Yugandhar Garapati¹

200 Accesses
2 Citations
Explore all metrics

Abstract

A Video Surveillance system can be used for a variety of purposes, including protection, secure data, crowd flux analytics and congestion analysis, individual recognition, anomalous activity detection, and so on. Video Surveillance systems play the key role in the human detection using the face features extraction. It helps in many applications like terrorists attack, thief identifying by detecting the face of the person but mostly failed in real-time aspect. In this context, we propose a method that significantly aids in the extraction and learning of features. To reduce the face recognition error, we use a bounding box regression model. To train the features, we utilized a CNN-based feature learning model with log-likelihood ratio calculations between inter- and intra-features. To increase the quality of video frames, we used a histogram redistribution image enhancement technique. Finally, a Background Subtracted Faster RCNN for video-based face recognition (BSF-RCNN-VFR) is used to discriminate the groups of detected faces. A comprehensive experiment is carried out on their datasets to demonstrate that the proposed solution performs better, and we compared the existing models with proposed models. We achieved 94.2 accuracy percentage. In this paper, the CNN models like AlexNet, ResNet and datasets like UADFV, Celeb-DF, FF++, DFDC, etc., accuracies are compared.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Deep Embedding for Face Recognition in Public Video Surveillance

A deep learning approach to building an intelligent video surveillance system

Article Open access 07 October 2020

Automation of surveillance systems using deep learning and facial recognition

Article 06 January 2023

Data availability

Data is provided on request.

References

Zhang S, Chi C, Lei Z, Li S Z. Refineface: refinement neural network for high performance face detection. IEEE Trans Pattern Anal Mach Intell. 2020
Li Y, Sun B, Wu T, Wang Y. Face detection with end-to-end integration of a convnet and a 3d model. In: European Conference on computer vision. Cham: Springer; Leibe et al. (Eds.): ECCV 2016, October, pp. 420–36.
Zhang K, Zhang Z, Li Z, Qiao Y. Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Process Lett. 2016;23(10):1499–503.
Article Google Scholar
Tao QQ, Zhan S, Li XH, Kurihara T. Robust face detection using local CNN and SVM based on kernel combination. Neurocomputing. 2016;211:98–105.
Article Google Scholar
Pham HX, Pavlovic V, Cai J, Cham TJ. Robust real- time performance-driven 3D face tracking. In: 2016 23rd International Conference on Pattern Recognition (ICPR). IEEE. 2016, December, pp. 1851–56.
Ranganatha S, Gowramma YP. A novel fused algorithm for human face tracking in video sequences. In 2016 International Conference on Computation System and Information Technology for Sustainable Solutions (CSITSS). IEEE. 2016, October, pp. 1–6.
Soldić M, Marčetić D, Maračić M, Mihalić D, Ribarić S. Real-time face tracking under long-term full occlusions. In: Proceedings of the 10th International Symposium on Image and Signal Processing and Analysis. IEEE, 2017, September; pp. 147–152.
Hu X, Chen L, Tang B, Cao D, He H. Dynamic path planning for autonomous driving on various roads with avoidance of static and moving obstacles. Mech Syst Signal Process. 2018;100:482–500.
Article Google Scholar
Maleš L, Marčetić D, Ribarić S. A multi-agent dynamic system for robust multi-face tracking. Expert Syst Appl. 2019;126:246–64.
Article Google Scholar
Yuan, S., Yu, X., Majid, A. Robust face tracking using Siamese VGG with pre-training and fine-tuning. In: 2019 4th International Conference on Control and Robotics Engineering (ICCRE). IEEE, 2019, April; pp. 170-74.
Wu B, Hu BG, Ji Q. A coupled hidden Markov random field model for simultaneous face clustering and tracking in videos. Pattern Recogn. 2017;64:361–73.
Article Google Scholar
Congcong Z, Zhenhua Y, Suping W, Hao L. Dual-cycle deep reinforcement learning for stabilizing face tracking. In 2019 IEEE International Conference on Multimedia Expo Workshops (ICMEW). IEEE, 2019, July; pp. 543–48.
Ding C, Tao D. Robust face recognition via multimodal deep face representation. IEEE Trans Multimed. 2015;17(11):2049–58.
Article Google Scholar
Sun Y, Liang D, Wang X, Tang X. Deepid3: face recognition with very deep neural networks. 2015. arXiv preprint arXiv:1502.00873.
Rejeesh MR. Interest point based face recognition using adaptive neuro fuzzy inference system. Multimed Tools Appl. 2019;78(16):22691–710.
Article Google Scholar
Ding C, Choi J, Tao D, Davis LS. Multi-directional multi-level dual-cross patterns for robust face recognition. IEEE Trans Pattern Anal Mach Intell. 2015;38(3):518–31.
Article Google Scholar
Ng CJ, Teoh ABJ. DCTNet: A simple learning-free approach for face recognition. In: 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA). IEEE, 2015, December; pp. 761-68.
Gao S, Zhang Y, Jia K, Lu J, Zhang Y. Single sample face recognition via learning deep supervised autoencoders. IEEE Trans Inf Forensics Secur. 2015;10(10):2108–18.
Article Google Scholar
Lei Z, Zhang X, Yang S, Ren Z, Akindipe OF. RFR-DLVT: a hybrid method for real-time face recognition using deep learning and visual tracking. Enterp Inform Syst. 2020;14(9–10):1339–79.
Google Scholar
Li Y, Xie Y, Lu X. Multi-face recognition and dynamic tracking based on reinforcement learning algorithm. In: MATEC Web of Conferences (Vol. 336, p. 06006). EDP Sciences. 2021.
Ren G, Lu X, Li Y. A cross-camera multi-face tracking system based on double triplet networks. IEEE Access. 2021;9:43759–74.
Article Google Scholar
Pujol FA, Pujol M, Jimeno-Morenilla A, Pujol MJ. Face detection based on skin color segmentation using fuzzy entropy. Entropy. 2017;19(1):26.
Article Google Scholar
Wu X, Zhao J, Wang H. Face segmentation based on leve set and deep learning prior shape. In: 2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI). IEEE, 2017; pp. 1-5.
Lin K, Zhao H, Lv J, Zhan J, Liu X, Chen R, Huang Z. Face detection and segmentation with generalized intersection over union based on mask R- CNN. In: International Conference on brain inspired cognitive systems. Springer, Cham, 2019; pp. 106-116.
He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on computer vision and pattern recognition, 2016; pp. 770–78.
Ortiz EG, Becker BC. Face recognition for web-scale datasets. Comput Vis Image Underst. 2014;118:153–70.
Article Google Scholar
Savaliya R, Kalaria V. A Video Surveillance system for traffic application. SIJ Trans Comput Sci En. Appl (CSEA). 2014;2(8).pp 1–5.
https://www.kaggle.com/code/blurredmachine/alexnet-architecture-a-complete-guide. Accessed 20 Sept 2021
https://www.geeksforgeeks.org/vgg-16-cnn-model/. Accessed 12 Jan 2023
https://builtin.com/machine-learning/relu-activation-function. Accessed 7 Feb 2023
Huang J, Wang X, Du B, Du P, Xu C. DeepFake MNIST+: a DeepFake facial animation dataset. In: 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), Montreal, BC, Canada, 2021; pp. 1973-1982. https://doi.org/10.1109/ICCVW54120.2021.00224.
https://paperswithcode.com/datasets. Accessed 12 Feb 2023
He Y, et al. ForgeryNet: a versatile benchmark for comprehensive forgery analysis. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 2021; pp. 4358–4367. https://doi.org/10.1109/CVPR46437.2021.00434.
https://ngrok.com/. Accessed 20 Dec 2022
Ayache F, Alti A. Performance evaluation of machine learning for recognizing human facial emotions. Revue d’Intell Artif. 2020;34(3):267–75. https://doi.org/10.18280/ria.340304.
Article Google Scholar
Ayeche F, Alti A. HDG and HDGG: an extensible feature extraction descriptor for effective face and facial expressions recognition. Pattern Anal Appl. 2021;24:1095–110. https://doi.org/10.1007/s10044-021-00972-2.
Article Google Scholar
Ayeche F, Alti A. Local directional gradients extension for recognising face and facial expressions. Int J Intell Syst Technol Appl. 2022;20(6):487–509. https://doi.org/10.1504/ijista.2022.128525
Ayeche F, Alti A. Novel descriptors for effective recognition of face and facial expressions. Revue d’Intell Artif. 2020;34(5):521–30. https://doi.org/10.18280/ria.340501.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, GITAM School of Technology, GITAM Deemed-to-be University, Hyderabad, Telangana, 502329, India
Ravi Teja B & Yugandhar Garapati
Department of Computer Science and Engineering, Vasavi College of Engineering, Hyderabad, Telangana, 500089, India
Mythili D & Laxmiprasanna Duvva
Department of Computer Science and Engineering, CVR College of Engineering, Hyderabad, Telangana, 501510, India
Srikanth Bethu

Authors

Ravi Teja B
View author publications
You can also search for this author in PubMed Google Scholar
Mythili D
View author publications
You can also search for this author in PubMed Google Scholar
Laxmiprasanna Duvva
View author publications
You can also search for this author in PubMed Google Scholar
Srikanth Bethu
View author publications
You can also search for this author in PubMed Google Scholar
Yugandhar Garapati
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Srikanth Bethu.

Ethics declarations

Funding

No funding available.

Conflict of Interest

No conflict of interest.

Ethical Approval

Yes.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article is part of the topical collection “Enabling Innovative Computational Intelligence Technologies for IOT” guest edited by Omer Rana, Rajiv Misra, Alexander Pfeiffer, Luigi Troiano and Nishtha Kesswani.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

B, R.T., D, M., Duvva, L. et al. Deep Learning Feature Extraction Architectures for Real-Time Face Detection. SN COMPUT. SCI. 4, 645 (2023). https://doi.org/10.1007/s42979-023-02023-5

Download citation

Received: 29 December 2022
Accepted: 28 April 2023
Published: 28 August 2023
DOI: https://doi.org/10.1007/s42979-023-02023-5

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Deep Learning Feature Extraction Architectures for Real-Time Face Detection

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Deep Embedding for Face Recognition in Public Video Surveillance

A deep learning approach to building an intelligent video surveillance system

Automation of surveillance systems using deep learning and facial recognition

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Funding

Conflict of Interest

Ethical Approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Deep Learning Feature Extraction Architectures for Real-Time Face Detection

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Deep Embedding for Face Recognition in Public Video Surveillance

A deep learning approach to building an intelligent video surveillance system

Automation of surveillance systems using deep learning and facial recognition

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Funding

Conflict of Interest

Ethical Approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation