Exploration of Local Variability in Text-Independent Speaker Verification

Chen, Liping; Lee, Kong Aik; Ma, Bin; Guo, Wu; Li, Haizhou; Dai, Li-Rong

doi:10.1007/s11265-015-0997-1

Exploration of Local Variability in Text-Independent Speaker Verification

Published: 17 April 2015

Volume 82, pages 217–228, (2016)
Cite this article

Journal of Signal Processing Systems Aims and scope Submit manuscript

Liping Chen¹,
Kong Aik Lee²,
Bin Ma²,
Wu Guo¹,
Haizhou Li² &
…
Li-Rong Dai¹

288 Accesses
2 Citations
Explore all metrics

Abstract

Total variability model has shown to be effective for text-independent speaker verification. It provisions a tractable way to estimate the so-called i-vector, which describes the speaker and session variability rendered in a whole utterance. In order to extract the local session variability that is neglected by an i-vector, local variability models were proposed, including the Gaussian- and the dimension-oriented local variability models. This paper presents a consolidated study of the total and local variability models and gives a full comparison between them under the same framework. Besides, new extensions are proposed for the existing local variability models. The comparison between the total variability model and the local variability models is fulfilled with the experiments on NIST SRE’08 and SRE’10 datasets. Furthermore, in the experiments, the dimension-oriented local variability models show their capability to capture the session variability which is complementary to that estimated by the total variability model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A study on the roles of total variability space and session variability modeling in speaker recognition

Article 07 December 2015

Two-space variability compensation technique for speaker verification in short length and reverberant environments

Article 12 May 2017

Identify the Benefits of the Different Steps in an i-Vector Based Speaker Verification System

References

Reynolds, D.A., Quatieri, T.F., & Dumn, R.B. (2000). Speaker verification using adapted Gaussian mixture model. Digital Signal Processing, 10(1–3), 19–41.
Article Google Scholar
Kinnunen, T., & Li, H. (2010). An overview of text-independent speaker recognition: from features to supervectors. Speech Communication, 52(1), 12–40.
Article Google Scholar
Kenny, P., Boulianne, G., Ouellet, P., & Dumouchel, P. (2007). Speaker and session variability in GMM-Based speaker verification. IEEE Trans. Audio Speech and Language Processing, 15(4), 1448–1460.
Article Google Scholar
Dehak, N., Kenny, P., Dehak, R., Dumouchel, P., & Ouellet, P. (2011). Front-end factor analysis for speaker verification. IEEE Trans. Audio Speech and Language Processing, 19(4), 788–798.
Article Google Scholar
Bishop, C.M. (2006). Pattern recognition and machine learning: Springer.
Kenny, P., Stafylakis, T., Ouellet, P., Alam, M.J., & Dumouchel, P. (2013). PLDA for speaker verification with utterance of arbitrary duration. In: Proceedings of IEEE ICASSP, (pp. 7649–7653).
Hatch, A., Kajarekar, S., & Stolcke, A. (2006). Within-class covariance normalization for SVM-based speaker recognition. In: International conference on spoken language processing, Pittsburgh.
Prince, S.J.D., & Elder, J.H. (2007). Probabilistic linear discriminant analysis for inferences about identity. In: Proceedings of the international conference on computer vision.
Chen, L., Lee, K.A., Ma, B., Guo, W., Li, H., & Dai, L.R. (2014). Local variability modeling for text-independent speaker verification. In: Proceedings of Odyssey: Speaker and Language Recognition Workshop.
Chen, L., Lee, K.A., Ma, B., Guo, W., Li, H., & Dai, L.R. (2014). Local variability vector for text-independent speaker verification. In: Proceedings of ISCSLP, (pp. 54–58).
Kenny, P. (2012). A small footprint i-vector extractor. In: Proceedings of the Odyssey: speaker and language recognition workshop.
Matejka, P., Glembek, O., Castaldo, F., Alam, J., Plchot, O., Kenny, P., Burget, L., & Cernocky, J. (2011). Full-covariance ubm and heavy-tailed plda in i-vector speaker verification. In: Proceedings of the IEEE ICASSP, (pp. 4828–4831).
Kenny, P. (2010). Bayesian speaker verification with heavy-tailed priors. In: Proceedings of the Odyssey: speaker and language recognition workshop.
Prince, S.J.D. (2012). Computer vision: models, learning, and inference, Cambridge University Press.
Jiang, Y., Lee, K.A., Tang, Z., Ma, B., Larcher, A., & Li, H. (2012). PLDA modeling in i-vector and supervector space for speaker verification. In: Proceedings if the INTERSPEECH, paper 198.
Lee, K.A., Larcher, A., You, C.H., Ma, B., & Li, H. (2013). Multi-session PLDA scoring of i-vector for partially open-set speaker detection. In: Proceedings of the INTERSPEECH, (pp. 3651–3655).
Kenny, P., Stafylakis, T., Ouellet, P., Alam, J., & Dumouchel, P. (2013). PLDA for Speaker Verification with Utterances of Arbitrary Duration. In: Proceedings of the IEEE ICASSP, (pp. 7649–7653).
Chen, L., Lee, K. A., Ma, B., Guo, W., Li, H., & Dai, L.R. (2014). Minimum divergence estimation of speaker prior in multi-session PLDA scoring. In: Proceedings of the ICASSP, (pp. 4035–4036).
Brmmer, N., & du Preez, J. (2006). Application-independent evaluation of speaker detection. Computer Speech & Language, 20(2), 230–275.
Article Google Scholar

Download references

Acknowledgments

The work of Liping Chen was partially supported by the National Nature Science Foundation of China (Grant No. 61273264) and the electronic information industry development fund of China (Grant No. 2013-472).

Author information

Authors and Affiliations

EEIS, USTC, Mailbox 4, Hefei, China
Liping Chen, Wu Guo & Li-Rong Dai
Singapore, 138632, Singapore
Kong Aik Lee, Bin Ma & Haizhou Li

Authors

Liping Chen
View author publications
You can also search for this author in PubMed Google Scholar
Kong Aik Lee
View author publications
You can also search for this author in PubMed Google Scholar
Bin Ma
View author publications
You can also search for this author in PubMed Google Scholar
Wu Guo
View author publications
You can also search for this author in PubMed Google Scholar
Haizhou Li
View author publications
You can also search for this author in PubMed Google Scholar
Li-Rong Dai
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Li-Rong Dai.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, L., Lee, K.A., Ma, B. et al. Exploration of Local Variability in Text-Independent Speaker Verification. J Sign Process Syst 82, 217–228 (2016). https://doi.org/10.1007/s11265-015-0997-1

Download citation

Received: 21 November 2014
Revised: 16 February 2015
Accepted: 18 March 2015
Published: 17 April 2015
Issue Date: February 2016
DOI: https://doi.org/10.1007/s11265-015-0997-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Exploration of Local Variability in Text-Independent Speaker Verification

Abstract

Access this article

Similar content being viewed by others

A study on the roles of total variability space and session variability modeling in speaker recognition

Two-space variability compensation technique for speaker verification in short length and reverberant environments

Identify the Benefits of the Different Steps in an i-Vector Based Speaker Verification System

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Exploration of Local Variability in Text-Independent Speaker Verification

Abstract

Access this article

Similar content being viewed by others

A study on the roles of total variability space and session variability modeling in speaker recognition

Two-space variability compensation technique for speaker verification in short length and reverberant environments

Identify the Benefits of the Different Steps in an i-Vector Based Speaker Verification System

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation