Speaker Corpora and Evaluation

Martin, Alvin F.

doi:10.1007/978-1-4899-7488-4_204

Alvin F. Martin³

181 Accesses

Synonyms

Speaker databases; SRE

Definition

An organized collection of speech data designed to provide extensive examples of particular types of speech or to support research and development of systems for particular types of speech processing is typically referred to as a speech corpus (plural corpora). An alternative term is a speech database.

Speaker recognition refers to the challenge of determining from a given speech segment who is speaking. It may involve deciding which of a given set of n different known subjects is speaking. This is referred to as speaker identification and may be either closed set (must be one of the n) or open set (may be none of the known speakers). Alternatively, it may involve deciding whether or not one particular known speaker is speaking in a given speech segment. This may be referred to as speaker detection or speaker verification.

A speaker corpus refers to a collection of speech data (a speech corpus) containing multiple speakers and for each speaker...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 549.99; Price excludes VAT (USA)

Hardcover Book: USD 549.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

C. Cieri, J.P. Campbell, H. Nakasone, D. Miller, K. Walker, The Mixer corpus of multilingual, multichannel speaker recognition data, in LREC 2004: Fourth International Conference on Language Resources and Evaluation, Lisbon
Google Scholar
C. Cieri, W. Andrews, J.P. Campbell, G. Doddington, J. Godfrey, S. Huang, M. Liberman, A. Martin, H. Nakasone, M. Przybocki, K. Walker, The Mixer and transcript reading corpora: resources for multilingual, cross-channel speaker recognition research, in LREC 2006: Fifth International Conference on Language Resources and Evaluation, Genoa
Google Scholar
C. Cieri, L. Corson, D. Graff, K. Walker, Resources for new research directions in speaker recognition: the Mixer 3, 4 and 5 corpora, Interspeech, Antwerp, Aug 2007
Google Scholar
L. Brandschain, D. Graff, C. Cieri, K. Walker, C. Caruso, A. Neely, The Mixer 6 corpus: resources for cross-channel and text independent speaker recognition, in LREC, Malta, May 2010, pp. 2441–2444
Google Scholar
Heman A Patil, T.K. Basu, Development of speech corpora for speaker recognition research and evaluation in Indian languages. Int. J. Speech Technol. 11, 17–32 (2008)
Google Scholar
A.F. Martin et al., The DET curve in assessment of detection task performance, in Proceedings of Eurospeech’97, Rhodes, vol. 4, Sept 1997, pp. 1899–1903
Google Scholar
N. Brummer, J. du Preez, Application-independent evaluation of speaker detection. Comput. Speech Lang. 20(2–3), 230–275 (2006)
Google Scholar
G. Doddington, Speaker recognition based on idiolectal differences between speakers, in Proceedings of Eurospeech’01, Aalborg, vol. 4, Sept 2001, pp. 2521–2524
Google Scholar
A.F. Martin, M.A. Przybocki, The NIST speaker recognition evaluations: 1996–2001, in Proceedings of 2001: A Speaker Odyssey, Chainia, Crete, June 2001, pp. 39–43
Google Scholar
A.F. Martin, M.A. Przybocki, J.P. Campbell, The NIST speaker recognition evaluation program, in Biometric Systems: Technology, Design and Performance Evaluation, chap. 8, ed. by J. Wayman et al. (Springer, London, 2005), pp. 241–262
Google Scholar
M.A. Przybocki, A.F. Martin, NIST speaker recognition evaluation chronicles, in Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop, Toledo, June 2004
Google Scholar
M.A. Przybocki, A.F. Martin, A.N. Le, NIST speaker recognition evaluation chronicles – part 2, in Proceedings of Odyssey 2006: The Speaker and Language Recognition Workshop, San Juan, June 2006
Google Scholar
M.A. Przybocki, A.F. Martin, A.N. Le, NIST speaker recognition evaluations utililizing the Mixer corpora – 2004, 2005, 2006. IEEE Trans. Audio Speech Lang. Process. 15(7), 1951–1959 (2007)
Google Scholar
A.F. Martin, Evaluations of automatic speaker classification systems, in Speaker Classification I, ed. by C. Muller (Springer, Berlin/Heidelberg, 2007), pp. 313–329
Google Scholar
A. Martin, C. Greenberg, NIST 2008 speaker recognition evaluation: performance across telephone and room microphone channels, in Proceedings of Interspeech, Brighton, Sept 2009, pp. 2579–2582
Google Scholar
A.F. Martin, Craig S. Greenberg, The NIST 2010 speaker recognition evaluation, in Proceedings of Interspeech, Makuhari, 2010, pp. 2726–2729
Google Scholar
C. Greenberg, A. Martin, M. Przybocki, The 2011 BEST speaker recognition interim assessment, in Proceedings of Odyssey, Singapore, June 2012
Google Scholar
D.A. Reynolds, Keynote talk “speaker and language recognition: a guided safari”, in Proceedings of Odyssey 2008: The Speaker and Language Recognition Workshop, Stellenbosch, Jan 2008
Google Scholar
D.A. van Leeuwen et al., NIST and NFI-TNO evaluations of automatic speaker recognition. Comput. Speech Lang. 20(2/3), 128–158 (2006)
Google Scholar
C. Greenberg, A. Martin, L. Brandschain, J. Campbell, C. Cieri, G. Doddington, J. Godfrey, Human assisted speaker recognition in NIST SRE10, in Proceedings of Odyssey, Brno, June 2010, paper 032
Google Scholar
C.S. Greenberg, A.F. Martin, G.R. Doddington, J.J. Godfrey, Including human expertise in speaker recognition systems: report on a pilot evaluation, in Proceedings of ICASSP, Prague, May 2011, pp. 5896–5899
Google Scholar

Download references

Author information

Authors and Affiliations

National Institute of Standards and Technology, Gaithersburg, MD, USA
Alvin F. Martin

Authors

Alvin F. Martin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Biometrics and Security, Research & National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China
Stan Z. Li
Departments of Computer Science and Engineering, Michigan State University, East Lansing, MI, USA
Anil K. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Martin, A.F. (2015). Speaker Corpora and Evaluation. In: Li, S.Z., Jain, A.K. (eds) Encyclopedia of Biometrics. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-7488-4_204

Download citation

DOI: https://doi.org/10.1007/978-1-4899-7488-4_204
Published: 03 July 2015
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4899-7487-7
Online ISBN: 978-1-4899-7488-4
eBook Packages: Computer ScienceReference Module Computer Science and Engineering

Publish with us

Policies and ethics