Acoustic Speaker Identification: The LIMSI CLEAR’07 System

Barras, Claude; Zhu, Xuan; Leung, Cheung-Chi; Gauvain, Jean-Luc; Lamel, Lori

doi:10.1007/978-3-540-68585-2_21

Claude Barras^1,2,
Xuan Zhu^1,2,
Cheung-Chi Leung¹,
Jean-Luc Gauvain¹ &
…
Lori Lamel¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4625))

Included in the following conference series:

1215 Accesses
2 Citations

Abstract

The CLEAR 2007 acoustic speaker identification task aims to identify speakers in CHIL seminars via the acoustic channel. The LIMSI system for this task consists of a standard Gaussian mixture model based system working on cepstral coefficients, with MAP adaptation of a Universal Background Model (UBM). It builds upon the LIMSI CLEAR’06 system with several modifications: removal of feature normalization and frames filtering, and pooling of all speaker enrollment data for UBM training. The primary system uses a beamforming of all audio channels, while a single channel is selected for the contrastive system. This latter system performs the best and improves the baseline system by 50% relative for the 1 second and 5 seconds test conditions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Anguera, X., Wooters, C., Hernando, J.: Speaker Diarization for Multi-Party Meetings Using Acoustic Fusion. In: Automatic Speech Recognition and Understanding (IEEE, ASRU 2005), San Juan, Puerto Rico (2005)
Google Scholar
Barras, C., Zhu, X., Gauvain, J.-L., Lamel, L.: The CLEAR 2006 LIMSI Acoustic Speaker Identification System for CHIL Seminars. In: Stiefelhagen, R., Garofolo, J.S. (eds.) CLEAR 2006. LNCS, vol. 4122, pp. 233–240. Springer, Heidelberg (2007)
Chapter Google Scholar
Gauvain, J.-L., Lee, C.H.: Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Transactions on Speech and Audio Processing 2(2), 291–298 (1994)
Article Google Scholar
Pelecanos, J., Sridharan, S.: Feature warping for robust speaker verification. In: Proc. ISCA Workshop on Speaker Recognition - Odyssey (June 2001)
Google Scholar
Reynolds, D., Quatieri, T., Dunn, R.: Speaker verification using adapted Gaussian mixture models. Digital Signal Processing 10, 19–41 (2000)
Article Google Scholar
Luque, J., Hernando, J.: Robust Speaker Identification for Meetings: UPC CLEAR 2007 Meeting Room Evaluation System. LNCS, vol. 4625. Springer, Heidelberg (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Spoken Language Processing Group, LIMSI-CNRS, BP 133, 91403, Orsay cedex, France
Claude Barras, Xuan Zhu, Cheung-Chi Leung, Jean-Luc Gauvain & Lori Lamel
Univ Paris-Sud, F-91405, Orsay, France
Claude Barras & Xuan Zhu

Authors

Claude Barras
View author publications
You can also search for this author in PubMed Google Scholar
Xuan Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Cheung-Chi Leung
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Luc Gauvain
View author publications
You can also search for this author in PubMed Google Scholar
Lori Lamel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Rainer Stiefelhagen Rachel Bowers Jonathan Fiscus

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Barras, C., Zhu, X., Leung, CC., Gauvain, JL., Lamel, L. (2008). Acoustic Speaker Identification: The LIMSI CLEAR’07 System. In: Stiefelhagen, R., Bowers, R., Fiscus, J. (eds) Multimodal Technologies for Perception of Humans. RT CLEAR 2007 2007. Lecture Notes in Computer Science, vol 4625. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68585-2_21

Download citation

DOI: https://doi.org/10.1007/978-3-540-68585-2_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68584-5
Online ISBN: 978-3-540-68585-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics