Abstract
This paper discusses a new modality for speaker recognition - conversational biometrics - as a high security voice-based authentication method for E-commerce applications. By combining diverse simultaneous conversational technologies, high accuracy transparent speaker recognition becomes possible even in channel or environment mismatches. For speaker identification over very large populations, we combine dialogs to reduce the set of confusable speakers and text-independent speaker identification to pin-point the actual speaker. Similarly, dialogs with personal random or predefined questions are used to perform simultaneously knowledge-based and acoustic-based verifications of the user. Adequate design of the dialog allows to tailor the ROC curves to the needs of most applications. We demonstrate the conceptual advantages using our telephony prototype. Users familiar with the system can log into the system with 0.8% or 1.3% false rejection and ca. 5 • 10−12% or 2 • 10−6% false acceptance rates in about 40 sec or 20 sec respectively which is an impressive result as compared to purely voice-print based authentication.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Atal B. S.: Automatic recognition of speakers from their voices. Proc. IEEE, 64:pp. 460–475 (1976).
Beigi H. S., Maes S. H., Sorensen J. S., and Chaudhari U. V.: A hierarchichal approach to large-scale speaker recognition. In Proc. Eurospeech (1999).
Beigi H. S. M., Maes S., and Sorensen J.: A frame-based statistical method for speaker recognition. In Proc. RLA2C, Avigon, France, (1998).
Campbell J.: Automatic speech and speaker recognition, advanced topics. In Lee et al. [14].
Chaudhari U. V., Beigi H. S., and Maes S. H.: Multi-environment speaker verification. In Proc. AutoID, (1999).
Chaudhari U.V., Navr-atil J., and Maes S.H.: Multi-grained data modeling for speaker recognition with sparse training and test data. In Proc. of the International Conference on Spoken Language Processing (ICSLP), Beijing, (2000). submitted.
Davies K. and al.: The IBM conversational telephony system for financial applications. In Proc. Eurospeech, (1999).
Doddington G. R.: Speaker recognition-identifying people by their voices. Proc. IEEE, 76(11):pp. 1651–1664, (1985).
Farell K.R., Mammone R.J., and Assaleh K.T.: Speaker recognition using neural networks and conventional classifiers. IEEE Trans. on Acoustics, Speech, and Signal Processing, 2(1):194–205, (1994).
Furui S.: Automatic speech and speaker recognition, advanced topics. In Lee et al. [14].
Furui S.: Recent advances in speaker recognition. In Bigun J., Chollet G., and Borgefors G., editors, Proc. Audio-and Video-based biometric person authentication, pages 237–252. Springer-Verlag, (1997).
Furui S. and Sondhi M., editors: Advances in speech signal processing. Marcel Dekker, New York, NY, (1991).
Kimball O., Schmidt M., Gish H., and Waterman J.: Speaker verification with limited enrollment data. In Proc. Eurospeech, volume 2, pages 967–970, (1997).
Lee C.-H., Soong F. K., and Paliwal K. K., editors: Automatic speech and speaker recognition, advanced topics. Kluwer Academic Publishers, Norwell, MA, (1996).
Li Q., Juang B.-H., Zhou Q., and Lee C.-H.: Verbal information verification. In Proc. Eurospeech, volume 2, pages 839–842, (1997).
Maes S. H. and Beigi H. S.: Open Sesame! Speech password or key to secure your door. In Proc. ACCV, (1998). invited paper.
Maes S.H.: Conversational biometrics In Proc. of the European Conference on Speech Communication and Technology (EUROSPEECH), Budapest, Hungary, (1999).
Navrátil J., Kleindienst J., and Maes S.H.: An instantiable speech biometrics module with natural language interface: Implementation in the telephony environment. In Proc. of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Istanbul, Turkey, (2000). IEEE.
O’Shaughnessy D.: Speaker recognition. IEEE ASSP Magazine, 3(4):pp. 4–17, (1986).
Papineni K. A., Roukos S., and Ward R. T.: Free-flow dialog management using forms. In Proc. Eurospeech, (1999).
Ramaswamy G. and Gopalakrishnan P.: Compression of acoustic features for speech recognition in network environments. In Proc. ICASSP, volume 2, pages 977–980, (1998).
Rosenberg A. E. and Parthasarathy S.: Speaker identi-cation with user-selected password phrases. In Proc. Eurospeech, volume 3, pages 1371–1374, (1997).
Zviran M. and Haga W.J.: User authentication by cognitive passwords: An empirical assessment. IEEE, (1990).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Maes, S.H., Navrátil, J., Chaudhari, U.V. (2001). Conversational Speech Biometrics. In: Liu, J., Ye, Y. (eds) E-Commerce Agents. Lecture Notes in Computer Science, vol 2033. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45370-9_10
Download citation
DOI: https://doi.org/10.1007/3-540-45370-9_10
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41934-1
Online ISBN: 978-3-540-45370-3
eBook Packages: Springer Book Archive