Demographic attribute estimation in face videos combining local information and quality assessment

Becerra-Riera, Fabiola; Morales-González, Annette; Méndez-Vázquez, Heydi; Dugelay, Jean-Luc

doi:10.1007/s00138-021-01269-4

Demographic attribute estimation in face videos combining local information and quality assessment

Special Issue Paper
Published: 03 February 2022

Volume 33, article number 26, (2022)
Cite this article

Machine Vision and Applications Aims and scope Submit manuscript

176 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

Nowadays, video analysis applications are gaining popularity given the rise of CCTV systems and the availability of video cameras to the general public, such as cameras in mobile devices. Many image analysis and processing tasks have evolved toward video domain, with the advantage of redundant information obtained from several frames, which can help disambiguating many recognition outputs. In this context, there are also particular video problems to deal with, such as uncontrolled scenarios and poor image quality. Most existing works regarding facial demographic estimation are focused on still image datasets; therefore, we propose to address gender and age estimation in video scenarios. In order to handle known video problems such as low-quality image capture, occlusions and pose variations, we propose a threefold strategy to adapt current image-based attribute recognition algorithms. First, we employ a quality assessment step based on 12 metrics to select relevant good quality frames from a face video sequence. Second, we propose a component-based approach to determine the most discriminant local regions of the face for each specific attribute, under these varying conditions. Third, we evaluate different frame combination strategies to produce the final video prediction. In our experimental validation, conducted in 3 datasets (EURECOM Augmented, UvA-Nemo Smile and YouTube Faces datasets), we show the advantages of our proposed strategy for improving video-based demographic attribute classification.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A new YOLO-based method for real-time crowd detection from video and performance analysis of YOLO models

Article 30 January 2023

Mehmet Şirin Gündüz & Gültekin Işık

A review on face recognition systems: recent approaches and challenges

Article 30 July 2020

Muhtahir O. Oloyede, Gerhard P. Hancke & Hermanus C. Myburgh

Real-Time Human Pose Detection and Recognition Using MediaPipe

Availability of data and material

All the data employed for experimentation are under public domain.

Code availability

We cannot disclose the code.

Notes

References

Becerra-Riera, F., Morales-González, A., Méndez-Vázquez, H.: A survey on facial soft biometrics for video surveillance and forensic applications. Artif. Intell. Rev. 52(2), 1155–1187 (2019). https://doi.org/10.1007/s10462-019-09689-5
Article Google Scholar
Park, U., Jain, A.K.: Face matching and retrieval using soft biometrics. Trans. Inf. Forens. Secur. 5(3), 406–415 (2010). https://doi.org/10.1109/TIFS.2010.2049842
Article Google Scholar
Tome, P., Fierrez, J., Vera-Rodriguez, R., Nixon, M.S.: Soft biometrics and their application in person recognition at a distance. Trans. Inf. Forens. Secur. 9(3), 464–475 (2014). https://doi.org/10.1109/TIFS.2014.2299975
Article Google Scholar
Ng, C.-B., Lo, W.-H.: Effect of image distortion on facial age and gender classification performance of convolutional neural networks”. IOP Conference Series: Materials Science and Engineering 495, 012029 (2019)
Greco, A., Saggese, A., Vento, M., Vigilante, V.: Gender recognition in the wild: a robustness evaluation over corrupted images. J. Ambient Intell. Hum. Comput. (2020). https://doi.org/10.1007/s12652-020-02750-0
Article Google Scholar
Becerra-Riera, F., Morales-González, A., Mendez-Vazquez, H., Dugelay, J.-L.: Attribute-based quality assessment for demographic estimation in face videos. In: ICPR 2020, 25th International Conference on Pattern Recognition, 10-15 January 2021, Milan, Italy (Virtual Conference), IEEE, Ed., Milan (2021)
Becerra-Riera, F., Morales-González, A., Méndez-Vázquez, H.: Exploring local deep representations for facial gender classification in videos. In: International Workshop on Artificial Intelligence and Pattern Recognition (IWAIPR), ser. Lecture Notes in Computer Science, vol. 11047. Springer, pp. 104–112 (2018)
Zhang, Y., Shang, K., Wang, J., Li, N., Zhang, M.M.: Patch strategy for deep face recognition. IET Image Process. 12(5), 819–825 (2018)
Article Google Scholar
Méndez-Vázquez, H., Becerra-Riera, F., Morales-González, A., López-Avila, L., Tistarelli, M.: Local deep features for composite face sketch recognition. In: 7th International Workshop on Biometrics and Forensics, IWBF: Cancun, Mexico, May 2–3, 2019. IEEE, vol. 2019, pp. 1–6 (2019)
Carcagnì, P., Coco, M.D., Cazzato, D., Leo, M., Distante, C.: A study on different experimental configurations for age, race, and gender estimation problems. EURASIP J. Image Video Process. 2015, 1–22 (2015)
Article Google Scholar
Gupta, R., Kumar, S., Yadav, P., Shrivastava, S.: Identification of age, gender and race smt (scare, marks, tattoos) from unconstrained facial images using statistical techniques. In: 2018 International Conference on Smart Computing and Electronic Enterprise (ICSCEE), pp. 1–8 (2018)
Antipov, G., Baccouche, M., Berrani, S.-A., Dugelay, J.-L.: Effective training of convolutional neural networks for face-based gender and age prediction. Pattern Recognit. 72(C), 15–26 (2017)
Article Google Scholar
Afifi, M., Abdelhamed, A.: Afif4: Deep gender classification based on adaboost-based fusion of isolated facial features and foggy faces. J. Vis. Commun. Image Represent. 62, 77–86 (2019)
Article Google Scholar
Danişman, T.: Bagging ensemble for deep learning based gender recognition using test-time augmentation on large-scale datasets. Turkish J. Electr. Eng. Comput. Sci. 29(4), 2084–2100 (2021)
Article Google Scholar
Yang, T.-Y., Huang, Y.-H., Lin, Y.-Y., Hsiu, P.-C., Chuang, Y.-Y.: Ssr-net: A compact soft stagewise regression network for age estimation. In: Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI-18, 7, pp. 1078–1084 (2018). https://doi.org/10.24963/ijcai.2018/150
Terhörst, P., Huber, M., Kolf, J. N., Zelch, I., Damer, N., Kirchbuchner, F., Kuijper, A.: Reliable age and gender estimation from face images: Stating the confidence of model predictions. In: 10th International Conference on BTAS. IEEE, 10 (2019)
Althnian, A., Aloboud, N., Alkharashi, N., Alduwaish, F., Alrshoud, M., Kurdi, H.: Face gender recognition in the wild: an extensive performance comparison of deep-learned, hand-crafted, and fused features with deep and traditional models. Appl. Sci. 11(1), 89 (2021)
Article Google Scholar
Roxo, T., Proença, H.: Faces in the wild: efficient gender recognition in surveillance conditions (2021)
Demirkus, M., Garg, K., Guler, S.: Automated person categorization for video surveillance using soft biometrics. In: Biometric Technology for Human Identification VII, vol. 7667. SPIE, pp. 236 – 247 (2010). https://doi.org/10.1117/12.851424
Hadid, A., Pietikäinen, M.: Demographic classification from face videos using manifold learning. Neurocomputing 100, 197–205 (2013). (special issue: Behaviours in video)
Article Google Scholar
Wang, W.-C., Hsu, R.-Y., Huang, C.-R., Syu, L.-Y.: Video gender recognition using temporal coherent face descriptor. In: 16th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/ Distributed Computing, SNPD. IEE, pp. 113–118 (2015)
Demirkus, M., Precup, D., Clark, J.J., Arbel, T.: Hierarchical spatio-temporal probabilistic graphical model with multiple feature fusion for binary facial attribute classification in real-world face videos. IEEE TPAMI 38, 1185–1203 (2016)
Article Google Scholar
Azzopardi, G., Greco, A., Saggese, A., Vento, M.: Fast gender recognition in videos using a novel descriptor based on the gradient magnitudes of facial landmarks. In: Advanced Video and Signal Based Surveillance, pp. 1–6 (2017)
Dibeklioğlu, H., Gevers, T., Salah, A.A., Valenti, R.: A smile can reveal your age: enabling facial dynamics in age estimation. In: Proceedings of the 20th ACM International Conference on Multimedia, ser. MM’12. New York, NY, USA: Association for Computing Machinery, pp. 209–218. (2012). https://doi.org/10.1145/2393347.2393382
Dantcheva, A., Brémond, F.: Gender estimation based on smile-dynamics. IEEE Trans. Inf. Forens. Secur. 12(3), 719–729 (2017). https://doi.org/10.1109/TIFS.2016.2632070
Article Google Scholar
Kharchevnikova, A., Savchenko, A.V.: Neural networks in video-based age and gender recognition on mobile platforms. Opt. Mem. Neural Netw. 27(4), 246–259 (2018)
Article Google Scholar
Oleksii, G., Olena, P.: Video stream gender classification using shallow cnn. Int. J. Pattern Recognit. Artif. Intell. (2020). https://doi.org/10.1142/S0218001421550016
Article Google Scholar
Roxo, T., Proença, H.: Is gender “in-the-wild” inference really a solved problem? IEEE Transactions on Biometrics, Behavior, and Identity Science, pp. 1–1 (2021)
Levi, G., Hassner, T.: Age and gender classification using convolutional neural networks. In: Conference on Computer Vision and Pattern Recognition (CVPR) Workshops. IEEE, pp. 34–42 (2015)
Best-Rowden, L., Jain, A.K.: Learning face image quality from human assessments. IEEE Trans. Inf. Forens. Secur. 13(12), 3064–3077 (2018)
Article Google Scholar
Hernandez-Ortega, J., Galbally, J., Fierrez, J., Haraksim, R., Beslay, L.: Faceqnet: Quality assessment for face recognition based on deep learning. In: 2019 International Conference on Biometrics (ICB). IEEE, pp. 1–8 (2019)
Xiang, X.: Beyond deep feature averaging: Sampling videos towards practical facial pain recognition. In: CVPR Workshops, pp. 37–42 (2019)
Kharchevnikova, A., Savchenko, A.V.: Efficient video face recognition based on frame selection and quality assessment. PeerJ Comput. Sci. 7, e391 (2021)
Article Google Scholar
Selim, M., Sundararajan, S., Pagani, A., Stricker, D.: Image quality-aware deep networks ensemble for efficient gender recognition in the wild. In: Proceedings of the 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2018), vol. 5, pp. 351–358 (2018)
Wang, X., Guo, R., Kambhamettu, C.: Deeply-learned feature for age estimation. Winter Conference on Applications of Computer Vision (WACV), pp. 534–541 (2015)
Zhong, Y., Sullivan, J., Li, H.: Face attribute prediction using off-the-shelf deep learning networks. In: International Conference on Biometrics (ICB). IEEE (2016)
Rodríguez, P., Cucurull, G., Gonfaus, J.M., Roca, F.X., González, J.: Age and gender recognition in the wild with deep attention. Pattern Recognit. 72(C), 563–571 (2017)
Article Google Scholar
Lee, B., Gilani, S.Z., Hassan, G.M., Mian, A., Facial gender classification-analysis using convolutional neural networks. In: Digital Image Computing: Techniques and Applications (DICTA). IEEE, vol. 2019, pp. 1–8 (2019)
Min, R., Kose, N., Dugelay, J.-L.: KinectFaceDB: A Kinect database for face recognition. IEEE Trans. Syst. Man Cybern. Syst. 07 (2014). http://www.eurecom.fr/publication/4393
Demirkus, M., Precup, D., Clark, J.J., Arbel, T.: Hierarchical temporal graphical model for head pose estimation and subsequent attribute classification in real-world videos. Comput. Vis. Image Underst. 136, 128–145 (2015)
Article Google Scholar
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001). https://doi.org/10.1023/A:1010933404324
Article MATH Google Scholar
Méndez-Vázquez, H., Chang, L., Rizo-Rodríguez, D., Morales-González, A.: Evaluación de la calidad de las imágenes de rostros utilizadas para la identificación de las personas. Computación y Sistemas 16, 147–165, 06 (2012)
Face Recognition Format Data Interchange, Version 2.0. InterNational Committee for Information Technology Standards (INCITS) Secretariat. Information Technology Industry Council, Standard (2006)
Wolf, L., Hassner, T., Maoz, I.: Face recognition in unconstrained videos with matched background similarity. In: CVPR. IEEE Computer Society, pp. 529–534 (2011)
Huang, G.B., Ramesh, M., Berg, T., Learned-Miller, E.: Labeled faces in the wild: A database for studying face recognition in unconstrained environments. University of Massachusetts, Amherst, Tech. Rep. 07-49, October 2007
Kumar, N., Berg, A., Belhumeur, P.N., Nayar, S.: Describable visual attributes for face verification and image search. IEEE TPAMI 33(10), 1962–1977 (2011)
Article Google Scholar
Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., Adam, H.: Mobilenets: Efficient convolutional neural networks for mobile vision applications. CoRR (2017). arXiv:1704.04861
Rothe, R., Timofte, R., Gool, L.V.: Deep expectation of real and apparent age from a single image without facial landmarks. Int. J. Comput. Vis. (2016)
Zhang, S.Y., Zhifei, Qi, H.: Age progression/regression by conditional adversarial autoencoder. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE (2017)
Openvino age and gender recognition retail 0013 (2019). https://docs.openvinotoolkit.org/2019_R1/_age_gender_recognition_retail_0013_description_age_gender_recognition_retail_0013.html
Bilinski, P.T., Dantcheva, A., Brémond, F.: Can a smile reveal your gender? In: International Conference of the BIOSIG, ser. LNI, vol. P-260. GI/IEEE, vol. 2016, pp. 27–38 (2016)

Download references

Funding

Not applicable.

Author information

Authors and Affiliations

Advanced Technologies Application Center (CENATAV), 7A #21406 Siboney, Playa, P.C.12200, Havana, Cuba
Fabiola Becerra-Riera, Annette Morales-González & Heydi Méndez-Vázquez
Digital Security Department, EURECOM, Campus Sophia Tech, 450 route des Chappes, Biot Sophia Antipolis, 06410, France
Jean-Luc Dugelay

Authors

Fabiola Becerra-Riera
View author publications
You can also search for this author in PubMed Google Scholar
Annette Morales-González
View author publications
You can also search for this author in PubMed Google Scholar
Heydi Méndez-Vázquez
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Luc Dugelay
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fabiola Becerra-Riera.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Becerra-Riera, F., Morales-González, A., Méndez-Vázquez, H. et al. Demographic attribute estimation in face videos combining local information and quality assessment. Machine Vision and Applications 33, 26 (2022). https://doi.org/10.1007/s00138-021-01269-4

Download citation

Received: 14 April 2021
Revised: 29 August 2021
Accepted: 19 November 2021
Published: 03 February 2022
DOI: https://doi.org/10.1007/s00138-021-01269-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Demographic attribute estimation in face videos combining local information and quality assessment

Abstract

Access this article

Similar content being viewed by others

A new YOLO-based method for real-time crowd detection from video and performance analysis of YOLO models

A review on face recognition systems: recent approaches and challenges

Real-Time Human Pose Detection and Recognition Using MediaPipe

Availability of data and material

Code availability

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Demographic attribute estimation in face videos combining local information and quality assessment

Abstract

Access this article

Similar content being viewed by others

A new YOLO-based method for real-time crowd detection from video and performance analysis of YOLO models

A review on face recognition systems: recent approaches and challenges

Real-Time Human Pose Detection and Recognition Using MediaPipe

Availability of data and material

Code availability

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation