Sparsity Analysis and Compensation for i-Vector Based Speaker Verification

Li, Wei; Fu, Tian Fan; Zhu, Jie; Chen, Ning

doi:10.1007/978-3-319-23132-7_47

Sparsity Analysis and Compensation for i-Vector Based Speaker Verification

Wei Li⁷,
Tian Fan Fu⁸,
Jie Zhu⁷ &
…
Ning Chen⁹

Conference paper
First Online: 01 January 2015

1630 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9319))

Abstract

Over recent years, i-vector based framework has been proven to provide state-of-art performance in speaker verification. Most of the researches focus on compensating the channel variability of i-vector. In this paper we will give an analysis that in the case that the duration of enrollment or test utterance is limited, i-vector based system may suffer from biased estimation problem. In order to solve this problem, we propose an improved i-vector extraction algorithm which we term Adapted First order Baum-Welch Statistics Analysis (AFSA). This new algorithm suppresses and compensates the deviation of first order Baum-Welch statistics caused by phonetic sparsity and phonetic imbalance. Experiments were performed based on NIST 2008 SRE data sets, Experimental results show that 10 %–15 % relative improvement is achieved compared to the baseline of traditional i-vector based system.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Bonastre, J.F., Scheffer, N., Matrouf, D., Fredouille, C., Larcher, A., Preti, A., Pouchoulin, G., Evans, N.W., Fauve, B.G., Mason, J.S.: Alize/spkdet: a state-of-the-art open source software for speaker recognition. In: Odyssey, p. 20 (2008)
Google Scholar
Bousquet, P.M., Larcher, A., Matrouf, D., Bonastre, J.F., Plchot, O.: Variance-spectra based normalization for i-vector standard and probabilistic linear discriminant analysis. In: Speaker and Language Recognition Workshop (IEEE Odyssey) (2012)
Google Scholar
Bousquet, P.M., Matrouf, D., Bonastre, J.F.: Intersession compensation and scoring methods in the i-vectors space for speaker recognition. In: INTERSPEECH, pp. 485–488 (2011)
Google Scholar
Dehak, N., Kenny, P., Dehak, R., Dumouchel, P., Ouellet, P.: Front-end factor analysis for speaker verification. IEEE Trans. Audio Speech Lang. Process. 19(4), 788–798 (2011)
Article Google Scholar
Kenny, P.: Joint factor analysis of speaker and session variability: Theory and algorithms. CRIM, Montreal, (Report) CRIM-06/08-13 (2005)
Google Scholar
Kenny, P.: Bayesian speaker verification with heavy-tailed priors. In: Odyssey, p. 14 (2010)
Google Scholar
Kenny, P., Boulianne, G., Dumouchel, P.: Eigenvoice modeling with sparse training data. IEEE Trans. Speech Audio Process. 13(3), 345–354 (2005)
Article Google Scholar
Kenny, P., Boulianne, G., Ouellet, P., Dumouchel, P.: Joint factor analysis versus eigenchannels in speaker recognition. IEEE Trans. Audio Speech Lang. Process. 15(4), 1435–1447 (2007)
Article Google Scholar
Kenny, P., Ouellet, P., Dehak, N., Gupta, V., Dumouchel, P.: A study of interspeaker variability in speaker verification. IEEE Trans. Audio Speech Lang. Process. 16(5), 980–988 (2008)
Article Google Scholar
Pelecanos, J., Sridharan, S.: Feature warping for robust speaker verification (2001)
Google Scholar
Reynolds, D.A., Quatieri, T.F., Dunn, R.B.: Speaker verification using adapted gaussian mixture models. Digital Sig. Process. 10(1), 19–41 (2000)
Article Google Scholar

Download references

Acknowledgments

This article was supported by the National Natural Science Foundation of China (NSFC) under Grants No. 61271349, 61371147 and 11433002.

Author information

Authors and Affiliations

Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai, 200240, China
Wei Li & Jie Zhu
Department of Computer Science and Engineering (CSE), Shanghai Jiao Tong University, Shanghai, 200240, China
Tian Fan Fu
School of Information Science and Engineering, East China University of S&T, Shanghai, 200237, China
Ning Chen

Authors

Wei Li
View author publications
You can also search for this author in PubMed Google Scholar
Tian Fan Fu
View author publications
You can also search for this author in PubMed Google Scholar
Jie Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Ning Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei Li .

Editor information

Editors and Affiliations

SPIIRAS, Saint-Petersburg, Russia
Andrey Ronzhin
Moscow State Linguistic University, Moscow, Russia
Rodmonga Potapova
University of Patras, Patras, Greece
Nikos Fakotakis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, W., Fu, T.F., Zhu, J., Chen, N. (2015). Sparsity Analysis and Compensation for i-Vector Based Speaker Verification. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds) Speech and Computer. SPECOM 2015. Lecture Notes in Computer Science(), vol 9319. Springer, Cham. https://doi.org/10.1007/978-3-319-23132-7_47

Download citation

DOI: https://doi.org/10.1007/978-3-319-23132-7_47
Published: 04 September 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23131-0
Online ISBN: 978-3-319-23132-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics