Synonyms
Speaker databases; SRE
Definition
An organized collection of speech data designed to provide extensive examples of particular types of speech or to support research and development of systems for particular types of speech processing is typically referred to as a speech corpus (plural corpora). An alternative term is a speech database.
Speaker recognition refers to the challenge of determining from a given speech segment who is speaking. It may involve deciding which of a given set of n different known subjects is speaking. This is referred to as speaker identification and may be either closed set (must be one of the n) or open set (may be none of the known speakers). Alternatively, it may involve deciding whether or not one particular known speaker is speaking in a given speech segment. This may be referred to as speaker detection or speaker verification.
A speaker corpus refers to a collection of speech data (a speech corpus) containing multiple speakers and for each speaker...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
C. Cieri, J.P. Campbell, H. Nakasone, D. Miller, K. Walker, The Mixer corpus of multilingual, multichannel speaker recognition data, in LREC 2004: Fourth International Conference on Language Resources and Evaluation, Lisbon
C. Cieri, W. Andrews, J.P. Campbell, G. Doddington, J. Godfrey, S. Huang, M. Liberman, A. Martin, H. Nakasone, M. Przybocki, K. Walker, The Mixer and transcript reading corpora: resources for multilingual, cross-channel speaker recognition research, in LREC 2006: Fifth International Conference on Language Resources and Evaluation, Genoa
C. Cieri, L. Corson, D. Graff, K. Walker, Resources for new research directions in speaker recognition: the Mixer 3, 4 and 5 corpora, Interspeech, Antwerp, Aug 2007
L. Brandschain, D. Graff, C. Cieri, K. Walker, C. Caruso, A. Neely, The Mixer 6 corpus: resources for cross-channel and text independent speaker recognition, in LREC, Malta, May 2010, pp. 2441–2444
Heman A Patil, T.K. Basu, Development of speech corpora for speaker recognition research and evaluation in Indian languages. Int. J. Speech Technol. 11, 17–32 (2008)
A.F. Martin et al., The DET curve in assessment of detection task performance, in Proceedings of Eurospeech’97, Rhodes, vol. 4, Sept 1997, pp. 1899–1903
N. Brummer, J. du Preez, Application-independent evaluation of speaker detection. Comput. Speech Lang. 20(2–3), 230–275 (2006)
G. Doddington, Speaker recognition based on idiolectal differences between speakers, in Proceedings of Eurospeech’01, Aalborg, vol. 4, Sept 2001, pp. 2521–2524
A.F. Martin, M.A. Przybocki, The NIST speaker recognition evaluations: 1996–2001, in Proceedings of 2001: A Speaker Odyssey, Chainia, Crete, June 2001, pp. 39–43
A.F. Martin, M.A. Przybocki, J.P. Campbell, The NIST speaker recognition evaluation program, in Biometric Systems: Technology, Design and Performance Evaluation, chap. 8, ed. by J. Wayman et al. (Springer, London, 2005), pp. 241–262
M.A. Przybocki, A.F. Martin, NIST speaker recognition evaluation chronicles, in Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop, Toledo, June 2004
M.A. Przybocki, A.F. Martin, A.N. Le, NIST speaker recognition evaluation chronicles – part 2, in Proceedings of Odyssey 2006: The Speaker and Language Recognition Workshop, San Juan, June 2006
M.A. Przybocki, A.F. Martin, A.N. Le, NIST speaker recognition evaluations utililizing the Mixer corpora – 2004, 2005, 2006. IEEE Trans. Audio Speech Lang. Process. 15(7), 1951–1959 (2007)
A.F. Martin, Evaluations of automatic speaker classification systems, in Speaker Classification I, ed. by C. Muller (Springer, Berlin/Heidelberg, 2007), pp. 313–329
A. Martin, C. Greenberg, NIST 2008 speaker recognition evaluation: performance across telephone and room microphone channels, in Proceedings of Interspeech, Brighton, Sept 2009, pp. 2579–2582
A.F. Martin, Craig S. Greenberg, The NIST 2010 speaker recognition evaluation, in Proceedings of Interspeech, Makuhari, 2010, pp. 2726–2729
C. Greenberg, A. Martin, M. Przybocki, The 2011 BEST speaker recognition interim assessment, in Proceedings of Odyssey, Singapore, June 2012
D.A. Reynolds, Keynote talk “speaker and language recognition: a guided safari”, in Proceedings of Odyssey 2008: The Speaker and Language Recognition Workshop, Stellenbosch, Jan 2008
D.A. van Leeuwen et al., NIST and NFI-TNO evaluations of automatic speaker recognition. Comput. Speech Lang. 20(2/3), 128–158 (2006)
C. Greenberg, A. Martin, L. Brandschain, J. Campbell, C. Cieri, G. Doddington, J. Godfrey, Human assisted speaker recognition in NIST SRE10, in Proceedings of Odyssey, Brno, June 2010, paper 032
C.S. Greenberg, A.F. Martin, G.R. Doddington, J.J. Godfrey, Including human expertise in speaker recognition systems: report on a pilot evaluation, in Proceedings of ICASSP, Prague, May 2011, pp. 5896–5899
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer Science+Business Media New York
About this entry
Cite this entry
Martin, A.F. (2015). Speaker Corpora and Evaluation. In: Li, S.Z., Jain, A.K. (eds) Encyclopedia of Biometrics. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-7488-4_204
Download citation
DOI: https://doi.org/10.1007/978-1-4899-7488-4_204
Published:
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4899-7487-7
Online ISBN: 978-1-4899-7488-4
eBook Packages: Computer ScienceReference Module Computer Science and Engineering