Robust Text-Independent Speaker Identification Using Hybrid PCA&LDA

Kim, Min-Seok; Yu, Ha-Jin; Kwak, Keun-Chang; Chi, Su-Young

doi:10.1007/11925231_102

Min-Seok Kim²⁰,
Ha-Jin Yu²⁰,
Keun-Chang Kwak²¹ &
…
Su-Young Chi²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4293))

Included in the following conference series:

Mexican International Conference on Artificial Intelligence

960 Accesses
2 Citations

Abstract

We have been building a text-independent speaker recognition system in noisy conditions. In this paper, we propose a novel feature using hybrid PCA/LDA. The feature is created from the convectional MFCC(mel-frequency cepstral coefficients) by transforming them using a matrix. The matrix consists of some components from the PCA and LDA transformation matrices. We tested the new feature using Aurora project Database 2 which is intended for the evaluation of algorithms for front-end feature extraction algorithms in background noise. The proposed method outperformed in all noise types and noise levels. It reduced the relative recognition error by 63.6% than using the baseline feature when the SNR is 15dB.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 239.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Campbell, J.P.: Speaker Recognition: A Tutorial. Proceedings of the IEEE 85(9), 1437–1462 (1997)
Article Google Scholar
Acero, A.: Acoustical and Environmental Robustness in Automatic Speech Recognition. Kluwer Academic Publishers, Boston (1993)
Google Scholar
Huang, X., Acero, A., Hon, H.: Spoken Language Processing, A Guide to Theory, Algorithm, and System Development. Prentice-Hall, Englewood Cliffs (2001)
Google Scholar
Tsai, S.-N., Lee, L.-S.: Improved Robust Features for Speech Recognition by Integrating Time-Frequency Principal Components (TFPC) and Histogram Equalization (HEQ). In: IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 297–302 (2003)
Google Scholar
Wanfeng, Z., Yingchun, Y., Zhaohui, W., Lifeng, S.: Experimental Evaluation of a New Speaker Identification Framework using PCA. In: IEEE International Conference on Systems, Man and Cybernetics, vol. 5, pp. 4147–4152 (2003)
Google Scholar
Ding, P., Liming, Z.: Speaker Recognition using Principal Component Analysis. In: Proceedings of ICONIP 2001, 8th International Conference on Neural Information Processing, Shanghai (2001)
Google Scholar
Jin, Q., Waibel, A.: Application of LDA to Speaker Recognition. In: International Conference on Speech and Language Processing, October 2000, Beijing, China (2000)
Google Scholar
Openshaw, J.P., Sun, Z.P., Mason, J.S.: A comparison of composite features under degraded speech in speaker recognition. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1993, April 1993, vol. 2, pp. 371–374 (1993)
Google Scholar
Su, H.-T., Feng, D.-D., Wang, X.-Y., Zhao, R.-C.: Face Recognition Using Hybrid Feature. In: Proceedings of the Second International Conference on Machine Learning and Cybernetics, Xi’an, November 2003, pp. 3045–3049 (2003)
Google Scholar
Zhao, W., Chellappa, R., Krishnaswamy, A.: Discriminant analysis of principal components for face recognition. In: Proceedings of the Third IEEE International Conference on Automatic Face and Gesture Recognition, April 1998, pp. 336–341 (1998)
Google Scholar
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. Wiley-Interscience, Chichester (2000)
Google Scholar
Reynolds, D.A., Rose, R.C.: Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models. IEEE Transactions on Speech Audio Processing 3(1), 72–83 (1995)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, University of Seoul, Dongdaemungu, Seoul, 130-743, Korea
Min-Seok Kim & Ha-Jin Yu
Human-Robot Interaction Research Team, Intelligent Robot Research Division, Electronics and Telecommunication Research Institute (ETRI), 305-700, Korea
Keun-Chang Kwak & Su-Young Chi

Authors

Min-Seok Kim
View author publications
You can also search for this author in PubMed Google Scholar
Ha-Jin Yu
View author publications
You can also search for this author in PubMed Google Scholar
Keun-Chang Kwak
View author publications
You can also search for this author in PubMed Google Scholar
Su-Young Chi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Computing Research, National Polytechnic Institute, 07738, Mexico City, México
Alexander Gelbukh
Instituto Nacional de Astrofísica, Óptica y Electrónica (INAOE), Luis Enrique Erro No. 1, Sta. Ma. Tonanzintla, 72840, Puebla, México
Carlos Alberto Reyes-Garcia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kim, MS., Yu, HJ., Kwak, KC., Chi, SY. (2006). Robust Text-Independent Speaker Identification Using Hybrid PCA&LDA. In: Gelbukh, A., Reyes-Garcia, C.A. (eds) MICAI 2006: Advances in Artificial Intelligence. MICAI 2006. Lecture Notes in Computer Science(), vol 4293. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11925231_102

Download citation

DOI: https://doi.org/10.1007/11925231_102
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49026-5
Online ISBN: 978-3-540-49058-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics