Abstract
In this paper, we proposed the principal component analysis (PCA) fuzzy mixture model for speaker identification. A PCA fuzzy mixture model is derived from the combination of the PCA and the fuzzy version of mixture model with diagonal covariance matrices. In this method, the feature vectors are first transformed by each speaker’s PCA transformation matrix to reduce the correlation among the elements. Then, the fuzzy mixture model for speaker is obtained from these transformed feature vectors with reduced dimensions. The orthogonal Gaussian Mixture Model (GMM) can be derived as a special case of PCA fuzzy mixture model. In our experiments, with having the number of mixtures equal, the proposed method requires less training time and less storage as well as shows better speaker identification rate compared to the conventional GMM. Also, the proposed one shows equal or better identification performance than the orthogonal GMM does.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Reynolds, D.A., Rose, R.C.: Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Tr. SAP. 3(1), 72–82 (1995)
Reynolds, D.A.: Speaker identification and verification using Gaussian mixture speaker models. Speech Comm. 17, 91–108 (1995)
Tran, D., Le, T.V., Wagner, M.: Fuzzy Gaussian mixture models for speaker recognition. In: Proc. ICSLP, vol. 3, pp. 759–762 (1998)
Tran, D., Wagner, M.: A robust clustering approach to fuzzy Gaussian mixture models for speaker identification. In: Proc. KES 1999, Adelaide, Australia, pp. 337–340 (1999)
Jolliffe, I.T.: Principal component analysis. Springer, Heidelberg (1986)
Liu, L., He, J.: On the use of orthogonal GMM in speaker recognition. In: Proc. ICASSP, pp. 845–849 (1999)
Seo, C., Lee, K.Y., Lee, J.: GMM based on local PCA for speaker identification. IEEE Electronic Letters 37(24), 1486–1488 (2001)
Wang, L., Chen, K., Chi, H.S.: Capture interspeaker information with a neural network for speaker identification. IEEE Tr. Neural Network 13(2) (March 2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lee, Y., Lee, J., Lee, K.Y. (2003). PCA Fuzzy Mixture Model for Speaker Identification. In: Liu, J., Cheung, Ym., Yin, H. (eds) Intelligent Data Engineering and Automated Learning. IDEAL 2003. Lecture Notes in Computer Science, vol 2690. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45080-1_140
Download citation
DOI: https://doi.org/10.1007/978-3-540-45080-1_140
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40550-4
Online ISBN: 978-3-540-45080-1
eBook Packages: Springer Book Archive