Conversational Speech Biometrics

Maes, Stéphane H.; Navrátil, Jiří; Chaudhari, Upendra V.

doi:10.1007/3-540-45370-9_10

Conversational Speech Biometrics

Stéphane H. Maes⁵,
Jiří Navrátil⁵ &
Upendra V. Chaudhari⁵

Chapter
First Online: 01 January 2001

630 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2033))

Abstract

This paper discusses a new modality for speaker recognition - conversational biometrics - as a high security voice-based authentication method for E-commerce applications. By combining diverse simultaneous conversational technologies, high accuracy transparent speaker recognition becomes possible even in channel or environment mismatches. For speaker identification over very large populations, we combine dialogs to reduce the set of confusable speakers and text-independent speaker identification to pin-point the actual speaker. Similarly, dialogs with personal random or predefined questions are used to perform simultaneously knowledge-based and acoustic-based verifications of the user. Adequate design of the dialog allows to tailor the ROC curves to the needs of most applications. We demonstrate the conceptual advantages using our telephony prototype. Users familiar with the system can log into the system with 0.8% or 1.3% false rejection and ca. 5 • 10^−12% or 2 • 10^−6% false acceptance rates in about 40 sec or 20 sec respectively which is an impressive result as compared to purely voice-print based authentication.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Atal B. S.: Automatic recognition of speakers from their voices. Proc. IEEE, 64:pp. 460–475 (1976).
Article Google Scholar
Beigi H. S., Maes S. H., Sorensen J. S., and Chaudhari U. V.: A hierarchichal approach to large-scale speaker recognition. In Proc. Eurospeech (1999).
Google Scholar
Beigi H. S. M., Maes S., and Sorensen J.: A frame-based statistical method for speaker recognition. In Proc. RLA2C, Avigon, France, (1998).
Google Scholar
Campbell J.: Automatic speech and speaker recognition, advanced topics. In Lee et al. [14].
Google Scholar
Chaudhari U. V., Beigi H. S., and Maes S. H.: Multi-environment speaker verification. In Proc. AutoID, (1999).
Google Scholar
Chaudhari U.V., Navr-atil J., and Maes S.H.: Multi-grained data modeling for speaker recognition with sparse training and test data. In Proc. of the International Conference on Spoken Language Processing (ICSLP), Beijing, (2000). submitted.
Google Scholar
Davies K. and al.: The IBM conversational telephony system for financial applications. In Proc. Eurospeech, (1999).
Google Scholar
Doddington G. R.: Speaker recognition-identifying people by their voices. Proc. IEEE, 76(11):pp. 1651–1664, (1985).
Article Google Scholar
Farell K.R., Mammone R.J., and Assaleh K.T.: Speaker recognition using neural networks and conventional classifiers. IEEE Trans. on Acoustics, Speech, and Signal Processing, 2(1):194–205, (1994).
Google Scholar
Furui S.: Automatic speech and speaker recognition, advanced topics. In Lee et al. [14].
Google Scholar
Furui S.: Recent advances in speaker recognition. In Bigun J., Chollet G., and Borgefors G., editors, Proc. Audio-and Video-based biometric person authentication, pages 237–252. Springer-Verlag, (1997).
Google Scholar
Furui S. and Sondhi M., editors: Advances in speech signal processing. Marcel Dekker, New York, NY, (1991).
Google Scholar
Kimball O., Schmidt M., Gish H., and Waterman J.: Speaker verification with limited enrollment data. In Proc. Eurospeech, volume 2, pages 967–970, (1997).
Google Scholar
Lee C.-H., Soong F. K., and Paliwal K. K., editors: Automatic speech and speaker recognition, advanced topics. Kluwer Academic Publishers, Norwell, MA, (1996).
Google Scholar
Li Q., Juang B.-H., Zhou Q., and Lee C.-H.: Verbal information verification. In Proc. Eurospeech, volume 2, pages 839–842, (1997).
Google Scholar
Maes S. H. and Beigi H. S.: Open Sesame! Speech password or key to secure your door. In Proc. ACCV, (1998). invited paper.
Google Scholar
Maes S.H.: Conversational biometrics In Proc. of the European Conference on Speech Communication and Technology (EUROSPEECH), Budapest, Hungary, (1999).
Google Scholar
Navrátil J., Kleindienst J., and Maes S.H.: An instantiable speech biometrics module with natural language interface: Implementation in the telephony environment. In Proc. of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Istanbul, Turkey, (2000). IEEE.
Google Scholar
O’Shaughnessy D.: Speaker recognition. IEEE ASSP Magazine, 3(4):pp. 4–17, (1986).
Article Google Scholar
Papineni K. A., Roukos S., and Ward R. T.: Free-flow dialog management using forms. In Proc. Eurospeech, (1999).
Google Scholar
Ramaswamy G. and Gopalakrishnan P.: Compression of acoustic features for speech recognition in network environments. In Proc. ICASSP, volume 2, pages 977–980, (1998).
Google Scholar
Rosenberg A. E. and Parthasarathy S.: Speaker identi-cation with user-selected password phrases. In Proc. Eurospeech, volume 3, pages 1371–1374, (1997).
Google Scholar
Zviran M. and Haga W.J.: User authentication by cognitive passwords: An empirical assessment. IEEE, (1990).
Google Scholar

Download references

Author information

Authors and Affiliations

IBM T.J. Watson Research Center, Rt. 134, Yorktown Heights, NY, USA
Stéphane H. Maes, Jiří Navrátil & Upendra V. Chaudhari

Authors

Stéphane H. Maes
View author publications
You can also search for this author in PubMed Google Scholar
Jiří Navrátil
View author publications
You can also search for this author in PubMed Google Scholar
Upendra V. Chaudhari
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Hong Kong Baptist University, Kowloon Tong, Hong Kong, China
Jiming Liu
IBM T.J.Watson Research Center, 30 Saw Mill River Road (Route 9A), Hawthorne, NY, 10532, USA
Yiming Ye

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Maes, S.H., Navrátil, J., Chaudhari, U.V. (2001). Conversational Speech Biometrics. In: Liu, J., Ye, Y. (eds) E-Commerce Agents. Lecture Notes in Computer Science, vol 2033. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45370-9_10

Download citation

DOI: https://doi.org/10.1007/3-540-45370-9_10
Published: 25 April 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41934-1
Online ISBN: 978-3-540-45370-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics