Abstract
In this paper, an efficient speaker identification based on robust vector quantization principal component analysis (VQ-PCA) is proposed to solve the problems from outliers and high dimensionality of training feature vectors in speaker identification. Firstly, the proposed method partitions the data space into several disjoint regions by roust VQ based on M-estimation. Secondly, the robust PCA is obtained from the covariance matrix in each region. Finally, our method obtains the Gaussian Mixture model (GMM) for speaker from the transformed feature vectors with reduced dimension by the robust PCA in each region. Compared to the conventional GMM with diagonal covariance matrix, under the same performance, the proposed method gives faster results with less storage and, moreover, shows robust performance to outliers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
D.A. Reynolds and R.C. Rose, “Robust text-independent speaker identification using Gaussian mixture speaker models,” IEEE Tr. SAP., 3,1, (1995) 72–82.
Sadaoki Furui, “Recent advances in speaker recognition, Pattern Recognition Letters, 18, (1997) 859–872.
L. Liu and J. He, “On the use of orthogonal GMM in speaker recognition,” Proc. ICASSP, (1999) 845–849.
C. Seo, K.Y. Lee and J. Lee, “GMM based on local PCA for speaker identification,” IEEE Electronic Letters, vol.37, no. 24, (2001) 1486–1488.
Ariki, Y. & Tagashira, S. & Nishijima, M. (1996). Speaker recognition and speaker normalization by projection to speaker subspace, International Conference on Acoustics, Speech, and Signal Processing 96, 319–322
Croux, C. & Haesbroeck, G. (2000). Principal component analysis based on robust estimators of the covariance or correlation matrix: influence functions and efficiencies, Biometrika 87,3, 603–618
Gersho, A. & Gray, R.M. Vector quantization and signal compression, Kluwer Academic
Huber, P. (1981). Robust Statistics, New York: Wiley.
Kambhatla, N. & Leen, T.K. (1997). Dimension reduction by local PCA, Neural Computation 9, 1493–1503.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lee, Y., Lee, J., Lee, K.Y. (2003). Efficient Speaker Identification Based on Robust VQ-PCA. In: Kumar, V., Gavrilova, M.L., Tan, C.J.K., L’Ecuyer, P. (eds) Computational Science and Its Applications — ICCSA 2003. ICCSA 2003. Lecture Notes in Computer Science, vol 2668. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44843-8_69
Download citation
DOI: https://doi.org/10.1007/3-540-44843-8_69
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40161-2
Online ISBN: 978-3-540-44843-3
eBook Packages: Springer Book Archive