Efficient Speaker Identification Based on Robust VQ-PCA

Lee, Younjeong; Lee, Joohun; Lee, Ki Yong

doi:10.1007/3-540-44843-8_69

Younjeong Lee^10,11,
Joohun Lee¹² &
Ki Yong Lee^10,11

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2668))

Included in the following conference series:

International Conference on Computational Science and Its Applications

672 Accesses
1 Citations

Abstract

In this paper, an efficient speaker identification based on robust vector quantization principal component analysis (VQ-PCA) is proposed to solve the problems from outliers and high dimensionality of training feature vectors in speaker identification. Firstly, the proposed method partitions the data space into several disjoint regions by roust VQ based on M-estimation. Secondly, the robust PCA is obtained from the covariance matrix in each region. Finally, our method obtains the Gaussian Mixture model (GMM) for speaker from the transformed feature vectors with reduced dimension by the robust PCA in each region. Compared to the conventional GMM with diagonal covariance matrix, under the same performance, the proposed method gives faster results with less storage and, moreover, shows robust performance to outliers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

D.A. Reynolds and R.C. Rose, “Robust text-independent speaker identification using Gaussian mixture speaker models,” IEEE Tr. SAP., 3,1, (1995) 72–82.
Google Scholar
Sadaoki Furui, “Recent advances in speaker recognition, Pattern Recognition Letters, 18, (1997) 859–872.
Article Google Scholar
L. Liu and J. He, “On the use of orthogonal GMM in speaker recognition,” Proc. ICASSP, (1999) 845–849.
Google Scholar
C. Seo, K.Y. Lee and J. Lee, “GMM based on local PCA for speaker identification,” IEEE Electronic Letters, vol.37, no. 24, (2001) 1486–1488.
Article Google Scholar
Ariki, Y. & Tagashira, S. & Nishijima, M. (1996). Speaker recognition and speaker normalization by projection to speaker subspace, International Conference on Acoustics, Speech, and Signal Processing 96, 319–322
Google Scholar
Croux, C. & Haesbroeck, G. (2000). Principal component analysis based on robust estimators of the covariance or correlation matrix: influence functions and efficiencies, Biometrika 87,3, 603–618
Article MATH MathSciNet Google Scholar
Gersho, A. & Gray, R.M. Vector quantization and signal compression, Kluwer Academic
Google Scholar
Huber, P. (1981). Robust Statistics, New York: Wiley.
MATH Google Scholar
Kambhatla, N. & Leen, T.K. (1997). Dimension reduction by local PCA, Neural Computation 9, 1493–1503.
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Electronic Engineering, Soongsil University, 1-1, Sangdo-dong, Dongjak-gu, Seoul, 156-743, Korea
Younjeong Lee & Ki Yong Lee
Biometric Engineering Research Center, Soongsil University, 1-1, Sangdo-dong, Dongjak-gu, Seoul, 156-743, Korea
Younjeong Lee & Ki Yong Lee
Department of Internet Broadcasting, Dong-Ah Broadcasting College, Jinchon-ri, Samjuk-myeon, Anseong, Gyeonggi-do, 456-880, Korea
Joohun Lee

Authors

Younjeong Lee
View author publications
You can also search for this author in PubMed Google Scholar
Joohun Lee
View author publications
You can also search for this author in PubMed Google Scholar
Ki Yong Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Army High Performance Computing Research Center, USA
Vipin Kumar
Department of Computer Science and Engineering, University of Minessota, MN, 55455, USA
Vipin Kumar
Department of Computer Science, University of Calgary, Calgary, AB, T2N1N4, Canada
Marina L. Gavrilova
Heuchera Technologies Inc., 122 9251-8 Yonge Street, Richmond Hill, ON, Canada, L4C 9T3
Chih Jeng Kenneth Tan
School of Computer Science, The Queen’s University of Belfast, Belfast, BT7 1NN, Northern Ireland, UK
Chih Jeng Kenneth Tan
Département d’informatique et de recherche opérationelle, Université de Montréal, Montréal, Québec, H3C 3J7, Canada
Pierre L’Ecuyer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lee, Y., Lee, J., Lee, K.Y. (2003). Efficient Speaker Identification Based on Robust VQ-PCA. In: Kumar, V., Gavrilova, M.L., Tan, C.J.K., L’Ecuyer, P. (eds) Computational Science and Its Applications — ICCSA 2003. ICCSA 2003. Lecture Notes in Computer Science, vol 2668. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44843-8_69

Download citation

DOI: https://doi.org/10.1007/3-540-44843-8_69
Published: 18 June 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40161-2
Online ISBN: 978-3-540-44843-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics