Speaker Databases and Evaluation

Martin, Alvin F.

doi:10.1007/978-0-387-73003-5_204

Speaker Databases and Evaluation

Alvin F. Martin³

Reference work entry

74 Accesses
1 Citations

Introduction

Expanding interest in the use of biometrics for security purposes has brought increasing attention to the use of speech as a biometric. Speech fits naturally into the list of likely biometric modalities. It is an activity engaged in by essentially everyone, and is one of the primary means by which people identify those whom they know.

But speaker recognition has not heretofore been seen as among the most useful biometrics for general security applications. There has been a much more developmental effort on the use of face, fingerprint, and iris. Recognition of speakers by voice has been seen as more of a niche application, largely because of the special difficulties associated with the collection of quality speech input, and perhaps because of a particular advantage may offer.

This introduction briefly discusses some key issues related to speaker recognition as a biometric. In the following section some of the main databases that have been used for speaker recognition...

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 449.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Cieri, C., Campbell, J.P., Nakasone, H., Miller, D., Walker, K.: The Mixer Corpus of Multilingual, Multichannel Speaker Recognition Data, LREC 2004: Fourth International Conference on Language Resources and Evaluation, Lisbon (2004)
Google Scholar
Cieri, C., Andrews, W., Campbell, J.P., Doddington, G., Godfrey, J., Huang, S., Liberman, M., Martin, A., Nakasone, H., Przybocki, M., Walker, K.: The Mixer and Transcript Reading Corpora: Resources for Multilingual, Crosschannel Speaker Recognition Research, LREC 2006: Fifth International Conference on Language Resources and Evaluation (2006)
Google Scholar
Cieri, C., Corson, L., Graff, D., Walker, K.: Resources for New Research Directions in Speaker Recognition: The Mixer 3, 4 and 5 Corpora, Interspeech 2007, Antwerp (August 2007)
Google Scholar
Martin, A.F., et al.: The DET curve in assessment of detection task performance. In: Proceedings of Eurospeech ’97, vol. 4, pp. 1899–1903. Rhodes, Greece (September 1997)
Google Scholar
Brummer, N., du Preez, J.: Application-independent evaluation of speaker detection. Comput. Speech Lang. 20(2–3), 230–275 (April–July 2006)
Article Google Scholar
Doddington, G.: Speaker recognition based on idiolectal differences between speakers. In: Proceedings of Eurospeech ’01, vol. 4, pp. 2521–2524. Aalborg, Denmark (September 2001)
Google Scholar
Martin, A.F., Przybocki, M.A.: The NIST speaker recognition evaluations: 1996–2001. In: Proceedings of 2001: A Speaker Odyssey, pp. 39–43. Chainia, Crete, Greece (June 2001)
Google Scholar
Martin, A.F., Przybocki, M.A., Campbell, J.P.: The NIST speaker recognition evaluation program. In: Wayman, J. (eds.) et al. Biometric Systems: Technology, Design and Performance Evaluation, Chapter 8, pp. 241–262. Springer, Berlin (2005)
Google Scholar
Przybocki, M.A., Martin, A.F.: (2004)NIST speaker recognition evaluation chronicles. In: Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop. Toledo, Spain
Google Scholar
Przybocki, M.A., Martin, A.F., Le, A.N.: NIST speaker recognition evaluation chronicles – Part 2. In: Proceedings of Odyssey 2006: The Speaker and Language Recognition Workshop. San Juan, PR (2006)
Google Scholar
Przybocki, M.A., Martin, A.F., Le, A.N.: NIST speaker recognition evaluations utililizing the mixer corpora – 2004, 2005, 2006. IEEE Trans. Audio Speech Lang. Process. 15(7), (2007)
Google Scholar
Martin, A.F.: Evaluations of automatic speaker classification systems. In: Muller, C. (ed.) Speaker Classification I, pp. 313–329. Springer, Berlin (2007)
Chapter Google Scholar
Reynolds, D.A.: (January 2008)Keynote talk. In: Proceedings of Odyssey 2008: The Speaker and Language Recognition Workshop. Stellenbosch, South Africa
Google Scholar
Leeuwen, D.A., et al.: van NIST and NFI-TNO evaluations of automatic speaker recognition. Comput. Speech Lang. 20(2), 128–158 (2006)
Article Google Scholar

Download references

Author information

Authors and Affiliations

National Institute of Standards and Technology, Gaithersburg, Maryland, USA
Alvin F. Martin

Authors

Alvin F. Martin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Biometrics and Security Research, Chinese Academy of Sciences, Beijing, China
Stan Z. Li (Professor) (Professor)
Departments of Computer Science & Engineering, Michigan State University, East Lansing, MI, USA
Anil Jain (Professor) (Professor)

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Martin, A.F. (2009). Speaker Databases and Evaluation. In: Li, S.Z., Jain, A. (eds) Encyclopedia of Biometrics. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-73003-5_204

Download citation

DOI: https://doi.org/10.1007/978-0-387-73003-5_204
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-73002-8
Online ISBN: 978-0-387-73003-5
eBook Packages: Computer ScienceReference Module Computer Science and Engineering

Publish with us

Policies and ethics