Face-Voice Authentication Based on 3D Face Models

Chetty, Girija; Wagner, Michael

doi:10.1007/11612032_57

Girija Chetty¹⁹ &
Michael Wagner¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3851))

Included in the following conference series:

Asian Conference on Computer Vision

Abstract

In this paper we propose fusion of shape and texture information from 3D face models of persons with the acoustic features extracted from spoken utterances, to improve the performance against imposter and replay attacks. Experiments conducted on two multimodal speaking face corpora, VidTIMIT and AVOZES, allowed less than 2 % EERs to be achieved for imposter attacks, and less than 1% for type-1 replay attacks for multimodal feature fusion of acoustic, shape and texture features. For type-2 replay attacks, more difficult type of spoof attacks, less than 7% EER was achieved.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Multimodal Presentation Attack Detection Based on Mouth Motion and Speech Recognition

Automated Multimodal Biometric System with Ear and Side Profile Face for Human Identification

Texture analysis of edge mapped audio spectrogram for spoofing attack detection

Article 26 May 2023

References

Chetty, G., Wagner, M.: ’Liveness’ Verification in Audio-Video Authentication. In: Proc. Int. Conf. on Spoken Language Processing ICSLP 2004, Jeju, Korea, pp. 2509–2512 (2004)
Google Scholar
Poh, N., Korczak, J.: Hybrid biometric person authentication using face and voice features. In: Proc. of Int. Conf. on Audio and Video-Based Biometric Person Authentification, Halmstad, Sweden, June 2001, pp. 348–353 (2001)
Google Scholar
Sanderson, C., Paliwal, K.K.: Identity verification using speech and face information. Digital Signal Processing 14(5), 397–507 (2004)
Article Google Scholar
Hsu, R.L., Jain, A.K.: Face Modeling for Recognition. In: Proceedings Int’l Conf. Image Processing, ICIP, Greece, October 7-10 (2001)
Google Scholar
Yehia, H., Rubin, P., Vatikiotic-Bateson, E.: Quantitative association of vocal track and facial behavior. Jorunal of Speech Communication 26(1-2), 23–43 (1998)
Article Google Scholar
Yehia, H., Kuratate, T., Vatikiotic-Bateson, E.: Linking Facial Animation, Head Motion and Speech Acoustics. Journal of Phonetics 30(3) (2002)
Google Scholar
Sanderson, C., Paliwal, K.K.: Fast features for face authentication under illumination direction changes. Pattern Recognition Letters 24, 2409–2419 (2003)
Article Google Scholar
Goecke, R., Millar, J.B.: The Audio-Video Australian English Speech Data Corpus AVOZES. In: Proceedings of the 8th International Conference on Spoken Language Processing INTERSPEECH 2004 - ICSLP, Jeju, Korea, 4 - 8 October, vol. III, pp. 2525–2528 (2004)
Google Scholar
Gordon, G.: Face Recognition from Frontal and Profile Views. In: Proceedings Int’l Workshop on Face and Gesture Gesture Recognition, Zurich, pp. 47–52 (1995)
Google Scholar
Chetty, G., Wagner, M.: Automated lip feature extraction for liveness verification in audio-video authentication. In: Proc. Image and Vision Computing 2004, New Zealand, pp. 17–22 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

HCC Laboratory, School of ISE, University of Canberra,
Girija Chetty & Michael Wagner

Authors

Girija Chetty
View author publications
You can also search for this author in PubMed Google Scholar
Michael Wagner
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

International Institute of Information Technology, Center for Visual Information Technology, Hyderabad, India
P. J. Narayanan
Department of Computer Science, Columbia University, 500 West 120th Street, 10027, New York, NY, USA
Shree K. Nayar
Microsoft Research Asia, Beijing, P.R. China
Heung-Yeung Shum

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chetty, G., Wagner, M. (2006). Face-Voice Authentication Based on 3D Face Models. In: Narayanan, P.J., Nayar, S.K., Shum, HY. (eds) Computer Vision – ACCV 2006. ACCV 2006. Lecture Notes in Computer Science, vol 3851. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11612032_57

Download citation

DOI: https://doi.org/10.1007/11612032_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31219-2
Online ISBN: 978-3-540-32433-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics