A New Text-Independent Speaker Identification Using Vector Quantization and Multi-layer Perceptron

Keum, Ji-Soo; Park, Chan-Ho; Lee, Hyon-Soo

doi:10.1007/11760023_25

Ji-Soo Keum²¹,
Chan-Ho Park²² &
Hyon-Soo Lee²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3972))

Included in the following conference series:

International Symposium on Neural Networks

87 Accesses
1 Citations

Abstract

In this paper, we propose a new text-independent speaker identification method using VQ and MLP. It consists of three parts: a new spectral peak analysis based feature extraction, speaker clustering and model selection using VQ, and MLP based speaker identification. The feature vector reflects the speaker specific characteristics and has a long-term feature for which makes it text-independent. The proposed method has a computational efficient for feature extraction and identification. To evaluate the proposed method, we calculated the correct identification ratio (CIR), the average CIR of the proposed and GMM method was 92.27% and 85.78% for 5 seconds segments in 15-speaker identification. Experimental results, we have achieved a performance comparable to GMM-method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Closed-Set Text-Independent Automatic Speaker Recognition System Using VQ/GMM

Supervised and Unsupervised Data Mining Techniques for Speaker Verification Using Prosodic + Spectral Features

Text-dependent speaker verification using classical LBG, adaptive LBG and FCM vector quantization

Article 31 May 2016

References

Joseph, P., Campbell, J.R.: Speaker Recognition A Tutorial. Proceeding of The IEEE 85(9), 1437–1462 (1997)
Article Google Scholar
Sadaoki, F.: Recent Advances in Speaker Recognition. Pattern Recognition Letter 18, 859–872 (1997)
Article Google Scholar
Herbert, G., Michael, S.: Text-independent Speaker Identification. IEEE Signal Processing Magazine, 18–32 (1994)
Google Scholar
Reynolds, D.A., Rose, R.C.: Robust Text-independent Speaker Identification using Gaussian Mixture Speaker Models. IEEE Trans. on Speech and Audio Processing 3(1), 72–83 (1995)
Article Google Scholar
Narayanaswamy, B., Gangadharaiah, R.: Extracting Additional Information from Gaussian Mixture Model Probabilities for Improved Text-Independent Speaker Identification. In: IEEE International Conference on Acoustics Speech and Signal Processing, vol. 1, pp. 621–624 (2005)
Google Scholar
Farrell, K.R., Mammone, R.J., Assaleh, K.T.: Speaker Recognition using Neural Networks and Conventional Classifiers. IEEE Trans. on Speech and Audio Processing 2(1), 194–205 (1994)
Article Google Scholar
Hiroaki, H.: Text-Independent Speaker Recognition using Neural Networks. IEICE Trans. INF. & SYST. E76-D(3), 345–351 (1993)
Google Scholar
Lu, L., Zhang, H.J., Jiang, H.: Content Analysis for Audio Classification and Segmentation. IEEE Trans. on Speech and Audio Processing 10(7), 504–516 (2002)
Article Google Scholar
Zhang, T., Kuo, J.: Audio Content Analysis for Online Audiovisual Data Segmentation and Classification. IEEE Trans. on Speech and Audio Processing 9(4), 441–457 (2001)
Article Google Scholar
Keum, J.S., Lee, H.S.: Speaker Change Detection Based on Spectral Peak Track Analysis for Korean Broadcast News. In: The Fifth International Conference on Information Communications and Signal Processing, pp. 724–728 (2005)
Google Scholar
Mohamed, Q.: Vector Quantization, http://www.geocities.com/mohamedqasem/vectorquantization/vq.html
Laurene, F.: Fundamentals of Neural Networks. Prentice Hall, Englewood Cliffs (1994)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Engineering, Kyung Hee University, 1, Seocheon-dong, Giheung-gu, Yongin-si, Gyeonggi-do, Korea
Ji-Soo Keum & Hyon-Soo Lee
Dept. of Internet Information Science, Bucheon College, 424, Simgok-dong, Wonmi-gu, Bucheon-si, Gyeonggi-do, Korea
Chan-Ho Park

Authors

Ji-Soo Keum
View author publications
You can also search for this author in PubMed Google Scholar
Chan-Ho Park
View author publications
You can also search for this author in PubMed Google Scholar
Hyon-Soo Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Mechanical and Automation Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, China
Jun Wang
Computational Intelligence Laboratory, School of Computer Science and Engineering, University of Electronic Science and Technology of China, 610054, Chengdu, P.R. China
Zhang Yi
Department of Electrical Engineering, University of Louisville, 40292, Louisville, KY, U.S.A
Jacek M. Zurada
Laboratory for Computational Biology, Shanghai Center for Systems Biomedicine, 800 Dong Chuan Rd., 200240, Shanghai, China
Bao-Liang Lu
School of Electrical and Electronic Engineering, University of Manchester, UK
Hujun Yin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Keum, JS., Park, CH., Lee, HS. (2006). A New Text-Independent Speaker Identification Using Vector Quantization and Multi-layer Perceptron. In: Wang, J., Yi, Z., Zurada, J.M., Lu, BL., Yin, H. (eds) Advances in Neural Networks - ISNN 2006. ISNN 2006. Lecture Notes in Computer Science, vol 3972. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11760023_25

Download citation

DOI: https://doi.org/10.1007/11760023_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34437-7
Online ISBN: 978-3-540-34438-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A New Text-Independent Speaker Identification Using Vector Quantization and Multi-layer Perceptron

Abstract

Access this chapter

Preview

Similar content being viewed by others

Closed-Set Text-Independent Automatic Speaker Recognition System Using VQ/GMM

Supervised and Unsupervised Data Mining Techniques for Speaker Verification Using Prosodic + Spectral Features

Text-dependent speaker verification using classical LBG, adaptive LBG and FCM vector quantization

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A New Text-Independent Speaker Identification Using Vector Quantization and Multi-layer Perceptron

Abstract

Access this chapter

Preview

Similar content being viewed by others

Closed-Set Text-Independent Automatic Speaker Recognition System Using VQ/GMM

Supervised and Unsupervised Data Mining Techniques for Speaker Verification Using Prosodic + Spectral Features

Text-dependent speaker verification using classical LBG, adaptive LBG and FCM vector quantization

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation