Skip to main content

Conversational Speech Biometrics

  • Chapter
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2033))

Abstract

This paper discusses a new modality for speaker recognition - conversational biometrics - as a high security voice-based authentication method for E-commerce applications. By combining diverse simultaneous conversational technologies, high accuracy transparent speaker recognition becomes possible even in channel or environment mismatches. For speaker identification over very large populations, we combine dialogs to reduce the set of confusable speakers and text-independent speaker identification to pin-point the actual speaker. Similarly, dialogs with personal random or predefined questions are used to perform simultaneously knowledge-based and acoustic-based verifications of the user. Adequate design of the dialog allows to tailor the ROC curves to the needs of most applications. We demonstrate the conceptual advantages using our telephony prototype. Users familiar with the system can log into the system with 0.8% or 1.3% false rejection and ca. 5 • 10−12% or 2 • 10−6% false acceptance rates in about 40 sec or 20 sec respectively which is an impressive result as compared to purely voice-print based authentication.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Atal B. S.: Automatic recognition of speakers from their voices. Proc. IEEE, 64:pp. 460–475 (1976).

    Article  Google Scholar 

  2. Beigi H. S., Maes S. H., Sorensen J. S., and Chaudhari U. V.: A hierarchichal approach to large-scale speaker recognition. In Proc. Eurospeech (1999).

    Google Scholar 

  3. Beigi H. S. M., Maes S., and Sorensen J.: A frame-based statistical method for speaker recognition. In Proc. RLA2C, Avigon, France, (1998).

    Google Scholar 

  4. Campbell J.: Automatic speech and speaker recognition, advanced topics. In Lee et al. [14].

    Google Scholar 

  5. Chaudhari U. V., Beigi H. S., and Maes S. H.: Multi-environment speaker verification. In Proc. AutoID, (1999).

    Google Scholar 

  6. Chaudhari U.V., Navr-atil J., and Maes S.H.: Multi-grained data modeling for speaker recognition with sparse training and test data. In Proc. of the International Conference on Spoken Language Processing (ICSLP), Beijing, (2000). submitted.

    Google Scholar 

  7. Davies K. and al.: The IBM conversational telephony system for financial applications. In Proc. Eurospeech, (1999).

    Google Scholar 

  8. Doddington G. R.: Speaker recognition-identifying people by their voices. Proc. IEEE, 76(11):pp. 1651–1664, (1985).

    Article  Google Scholar 

  9. Farell K.R., Mammone R.J., and Assaleh K.T.: Speaker recognition using neural networks and conventional classifiers. IEEE Trans. on Acoustics, Speech, and Signal Processing, 2(1):194–205, (1994).

    Google Scholar 

  10. Furui S.: Automatic speech and speaker recognition, advanced topics. In Lee et al. [14].

    Google Scholar 

  11. Furui S.: Recent advances in speaker recognition. In Bigun J., Chollet G., and Borgefors G., editors, Proc. Audio-and Video-based biometric person authentication, pages 237–252. Springer-Verlag, (1997).

    Google Scholar 

  12. Furui S. and Sondhi M., editors: Advances in speech signal processing. Marcel Dekker, New York, NY, (1991).

    Google Scholar 

  13. Kimball O., Schmidt M., Gish H., and Waterman J.: Speaker verification with limited enrollment data. In Proc. Eurospeech, volume 2, pages 967–970, (1997).

    Google Scholar 

  14. Lee C.-H., Soong F. K., and Paliwal K. K., editors: Automatic speech and speaker recognition, advanced topics. Kluwer Academic Publishers, Norwell, MA, (1996).

    Google Scholar 

  15. Li Q., Juang B.-H., Zhou Q., and Lee C.-H.: Verbal information verification. In Proc. Eurospeech, volume 2, pages 839–842, (1997).

    Google Scholar 

  16. Maes S. H. and Beigi H. S.: Open Sesame! Speech password or key to secure your door. In Proc. ACCV, (1998). invited paper.

    Google Scholar 

  17. Maes S.H.: Conversational biometrics In Proc. of the European Conference on Speech Communication and Technology (EUROSPEECH), Budapest, Hungary, (1999).

    Google Scholar 

  18. Navrátil J., Kleindienst J., and Maes S.H.: An instantiable speech biometrics module with natural language interface: Implementation in the telephony environment. In Proc. of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Istanbul, Turkey, (2000). IEEE.

    Google Scholar 

  19. O’Shaughnessy D.: Speaker recognition. IEEE ASSP Magazine, 3(4):pp. 4–17, (1986).

    Article  Google Scholar 

  20. Papineni K. A., Roukos S., and Ward R. T.: Free-flow dialog management using forms. In Proc. Eurospeech, (1999).

    Google Scholar 

  21. Ramaswamy G. and Gopalakrishnan P.: Compression of acoustic features for speech recognition in network environments. In Proc. ICASSP, volume 2, pages 977–980, (1998).

    Google Scholar 

  22. Rosenberg A. E. and Parthasarathy S.: Speaker identi-cation with user-selected password phrases. In Proc. Eurospeech, volume 3, pages 1371–1374, (1997).

    Google Scholar 

  23. Zviran M. and Haga W.J.: User authentication by cognitive passwords: An empirical assessment. IEEE, (1990).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Maes, S.H., Navrátil, J., Chaudhari, U.V. (2001). Conversational Speech Biometrics. In: Liu, J., Ye, Y. (eds) E-Commerce Agents. Lecture Notes in Computer Science, vol 2033. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45370-9_10

Download citation

  • DOI: https://doi.org/10.1007/3-540-45370-9_10

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41934-1

  • Online ISBN: 978-3-540-45370-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics