Abstract
We have been building a text-independent speaker recognition system in noisy conditions. In this paper, we propose a novel feature using hybrid PCA/LDA. The feature is created from the convectional MFCC(mel-frequency cepstral coefficients) by transforming them using a matrix. The matrix consists of some components from the PCA and LDA transformation matrices. We tested the new feature using Aurora project Database 2 which is intended for the evaluation of algorithms for front-end feature extraction algorithms in background noise. The proposed method outperformed in all noise types and noise levels. It reduced the relative recognition error by 63.6% than using the baseline feature when the SNR is 15dB.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Campbell, J.P.: Speaker Recognition: A Tutorial. Proceedings of the IEEE 85(9), 1437–1462 (1997)
Acero, A.: Acoustical and Environmental Robustness in Automatic Speech Recognition. Kluwer Academic Publishers, Boston (1993)
Huang, X., Acero, A., Hon, H.: Spoken Language Processing, A Guide to Theory, Algorithm, and System Development. Prentice-Hall, Englewood Cliffs (2001)
Tsai, S.-N., Lee, L.-S.: Improved Robust Features for Speech Recognition by Integrating Time-Frequency Principal Components (TFPC) and Histogram Equalization (HEQ). In: IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 297–302 (2003)
Wanfeng, Z., Yingchun, Y., Zhaohui, W., Lifeng, S.: Experimental Evaluation of a New Speaker Identification Framework using PCA. In: IEEE International Conference on Systems, Man and Cybernetics, vol. 5, pp. 4147–4152 (2003)
Ding, P., Liming, Z.: Speaker Recognition using Principal Component Analysis. In: Proceedings of ICONIP 2001, 8th International Conference on Neural Information Processing, Shanghai (2001)
Jin, Q., Waibel, A.: Application of LDA to Speaker Recognition. In: International Conference on Speech and Language Processing, October 2000, Beijing, China (2000)
Openshaw, J.P., Sun, Z.P., Mason, J.S.: A comparison of composite features under degraded speech in speaker recognition. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1993, April 1993, vol. 2, pp. 371–374 (1993)
Su, H.-T., Feng, D.-D., Wang, X.-Y., Zhao, R.-C.: Face Recognition Using Hybrid Feature. In: Proceedings of the Second International Conference on Machine Learning and Cybernetics, Xi’an, November 2003, pp. 3045–3049 (2003)
Zhao, W., Chellappa, R., Krishnaswamy, A.: Discriminant analysis of principal components for face recognition. In: Proceedings of the Third IEEE International Conference on Automatic Face and Gesture Recognition, April 1998, pp. 336–341 (1998)
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. Wiley-Interscience, Chichester (2000)
Reynolds, D.A., Rose, R.C.: Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models. IEEE Transactions on Speech Audio Processing 3(1), 72–83 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, MS., Yu, HJ., Kwak, KC., Chi, SY. (2006). Robust Text-Independent Speaker Identification Using Hybrid PCA&LDA. In: Gelbukh, A., Reyes-Garcia, C.A. (eds) MICAI 2006: Advances in Artificial Intelligence. MICAI 2006. Lecture Notes in Computer Science(), vol 4293. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11925231_102
Download citation
DOI: https://doi.org/10.1007/11925231_102
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49026-5
Online ISBN: 978-3-540-49058-6
eBook Packages: Computer ScienceComputer Science (R0)