Abstract
In this paper we propose fusion of shape and texture information from 3D face models of persons with the acoustic features extracted from spoken utterances, to improve the performance against imposter and replay attacks. Experiments conducted on two multimodal speaking face corpora, VidTIMIT and AVOZES, allowed less than 2 % EERs to be achieved for imposter attacks, and less than 1% for type-1 replay attacks for multimodal feature fusion of acoustic, shape and texture features. For type-2 replay attacks, more difficult type of spoof attacks, less than 7% EER was achieved.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Chetty, G., Wagner, M.: ’Liveness’ Verification in Audio-Video Authentication. In: Proc. Int. Conf. on Spoken Language Processing ICSLP 2004, Jeju, Korea, pp. 2509–2512 (2004)
Poh, N., Korczak, J.: Hybrid biometric person authentication using face and voice features. In: Proc. of Int. Conf. on Audio and Video-Based Biometric Person Authentification, Halmstad, Sweden, June 2001, pp. 348–353 (2001)
Sanderson, C., Paliwal, K.K.: Identity verification using speech and face information. Digital Signal Processing 14(5), 397–507 (2004)
Hsu, R.L., Jain, A.K.: Face Modeling for Recognition. In: Proceedings Int’l Conf. Image Processing, ICIP, Greece, October 7-10 (2001)
Yehia, H., Rubin, P., Vatikiotic-Bateson, E.: Quantitative association of vocal track and facial behavior. Jorunal of Speech Communication 26(1-2), 23–43 (1998)
Yehia, H., Kuratate, T., Vatikiotic-Bateson, E.: Linking Facial Animation, Head Motion and Speech Acoustics. Journal of Phonetics 30(3) (2002)
Sanderson, C., Paliwal, K.K.: Fast features for face authentication under illumination direction changes. Pattern Recognition Letters 24, 2409–2419 (2003)
Goecke, R., Millar, J.B.: The Audio-Video Australian English Speech Data Corpus AVOZES. In: Proceedings of the 8th International Conference on Spoken Language Processing INTERSPEECH 2004 - ICSLP, Jeju, Korea, 4 - 8 October, vol. III, pp. 2525–2528 (2004)
Gordon, G.: Face Recognition from Frontal and Profile Views. In: Proceedings Int’l Workshop on Face and Gesture Gesture Recognition, Zurich, pp. 47–52 (1995)
Chetty, G., Wagner, M.: Automated lip feature extraction for liveness verification in audio-video authentication. In: Proc. Image and Vision Computing 2004, New Zealand, pp. 17–22 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chetty, G., Wagner, M. (2006). Face-Voice Authentication Based on 3D Face Models. In: Narayanan, P.J., Nayar, S.K., Shum, HY. (eds) Computer Vision – ACCV 2006. ACCV 2006. Lecture Notes in Computer Science, vol 3851. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11612032_57
Download citation
DOI: https://doi.org/10.1007/11612032_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31219-2
Online ISBN: 978-3-540-32433-1
eBook Packages: Computer ScienceComputer Science (R0)