Skip to main content

Speaker Databases and Evaluation

  • Reference work entry

Introduction

Expanding interest in the use of biometrics for security purposes has brought increasing attention to the use of speech as a biometric. Speech fits naturally into the list of likely biometric modalities. It is an activity engaged in by essentially everyone, and is one of the primary means by which people identify those whom they know.

But speaker recognition has not heretofore been seen as among the most useful biometrics for general security applications. There has been a much more developmental effort on the use of face, fingerprint, and iris. Recognition of speakers by voice has been seen as more of a niche application, largely because of the special difficulties associated with the collection of quality speech input, and perhaps because of a particular advantage may offer.

This introduction briefly discusses some key issues related to speaker recognition as a biometric. In the following section some of the main databases that have been used for speaker recognition...

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   449.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Cieri, C., Campbell, J.P., Nakasone, H., Miller, D., Walker, K.: The Mixer Corpus of Multilingual, Multichannel Speaker Recognition Data, LREC 2004: Fourth International Conference on Language Resources and Evaluation, Lisbon (2004)

    Google Scholar 

  2. Cieri, C., Andrews, W., Campbell, J.P., Doddington, G., Godfrey, J., Huang, S., Liberman, M., Martin, A., Nakasone, H., Przybocki, M., Walker, K.: The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research, LREC 2006: Fifth International Conference on Language Resources and Evaluation (2006)

    Google Scholar 

  3. Cieri, C., Corson, L., Graff, D., Walker, K.: Resources for New Research Directions in Speaker Recognition: The Mixer 3, 4 and 5 Corpora, Interspeech 2007, Antwerp (August 2007)

    Google Scholar 

  4. Martin, A.F., et al.: The DET curve in assessment of detection task performance. In: Proceedings of Eurospeech ’97, vol. 4, pp. 1899–1903. Rhodes, Greece (September 1997)

    Google Scholar 

  5. Brummer, N., du Preez, J.: Application-independent evaluation of speaker detection. Comput. Speech Lang. 20(2–3), 230–275 (April–July 2006)

    Article  Google Scholar 

  6. Doddington, G.: Speaker recognition based on idiolectal differences between speakers. In: Proceedings of Eurospeech ’01, vol. 4, pp. 2521–2524. Aalborg, Denmark (September 2001)

    Google Scholar 

  7. Martin, A.F., Przybocki, M.A.: The NIST speaker recognition evaluations: 1996–2001. In: Proceedings of 2001: A Speaker Odyssey, pp. 39–43. Chainia, Crete, Greece (June 2001)

    Google Scholar 

  8. Martin, A.F., Przybocki, M.A., Campbell, J.P.: The NIST speaker recognition evaluation program. In: Wayman, J. (eds.) et al. Biometric Systems: Technology, Design and Performance Evaluation, Chapter 8, pp. 241–262. Springer, Berlin (2005)

    Google Scholar 

  9. Przybocki, M.A., Martin, A.F.: (2004)NIST speaker recognition evaluation chronicles. In: Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop. Toledo, Spain

    Google Scholar 

  10. Przybocki, M.A., Martin, A.F., Le, A.N.: NIST speaker recognition evaluation chronicles – Part 2. In: Proceedings of Odyssey 2006: The Speaker and Language Recognition Workshop. San Juan, PR (2006)

    Google Scholar 

  11. Przybocki, M.A., Martin, A.F., Le, A.N.: NIST speaker recognition evaluations utililizing the mixer corpora – 2004, 2005, 2006. IEEE Trans. Audio Speech Lang. Process. 15(7), (2007)

    Google Scholar 

  12. Martin, A.F.: Evaluations of automatic speaker classification systems. In: Muller, C. (ed.) Speaker Classification I, pp. 313–329. Springer, Berlin (2007)

    Chapter  Google Scholar 

  13. Reynolds, D.A.: (January 2008)Keynote talk. In: Proceedings of Odyssey 2008: The Speaker and Language Recognition Workshop. Stellenbosch, South Africa

    Google Scholar 

  14. Leeuwen, D.A., et al.: van NIST and NFI-TNO evaluations of automatic speaker recognition. Comput. Speech Lang. 20(2), 128–158 (2006)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer Science+Business Media, LLC

About this entry

Cite this entry

Martin, A.F. (2009). Speaker Databases and Evaluation. In: Li, S.Z., Jain, A. (eds) Encyclopedia of Biometrics. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-73003-5_204

Download citation

Publish with us

Policies and ethics