The Clustering Solution of Speech Recognition Models with SOM

Du, Xiu-Ping; He, Pi-Lian

doi:10.1007/11760023_23

Xiu-Ping Du²¹ &
Pi-Lian He²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3972))

Included in the following conference series:

International Symposium on Neural Networks

89 Accesses
3 Citations

Abstract

This paper first introduces the system requirement and the system flow of the auto-plotting system. As the data points needed by the auto-plotting system coming from the remote speech signals, to reach high recognition accuracy, the Hidden Markov Model (HMM) approach was chosen as the speech recognition approach. Then the paper is detailed on the speaker dependent (SD), speaker independent (SI) and speaker adaptive (SA) speech recognition methods. We proposed the n-speech models SD system as the recognition system to gain the highest recognition performance in varying speech environments. However the system required that searching for the optimal model from the database should finish in 5 minutes, so the paper finally describes how the Self-Organizing Map (SOM) was used to pre clustering to the n-speech models, to decrease the time for speech recognition and results evaluation and decrease matching time, Experiments show the n-speech models SD system can select the best-matching model in the limited time and improve the average speech recognition accuracy to 97.2. It ideally suits the system requirements.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Veeravalli, A.G., Pan, W.D., Adhami, R., Cox, P.G.: A Tutorial on Using Hidden Markov Models for Phoneme Recognition. In: SSST 2005 (eds.): Proceedings of the Thirty-Seventh Southeastern Symposium on System Theory, pp. 154–157. IEEE, Tuskegee (2005)
Chapter Google Scholar
Legetter, C.J., Woodland, P.C.: Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models. Computer Speech and Language 9(2), 171–186 (1995)
Article Google Scholar
Gauvain, J.L., Lee, C.H.: Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains. IEEE Trans. Speech and Audio Processing 2(2), 291–298 (1994)
Article Google Scholar
Nishida, S., Ishii, K., Ura, T.: Adaptive Learning to Environment Using Self-Organizing Map and Its Application for Underwater Vehicles. In: UT 2004 (ed.) Proceedings of the 2004 International Symposium on Underwater Technology UT 2004, pp. 223–228. IEEE, Taipei (2004)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

School of Electronic & Information Engineering, Tianjin University, 300072, Tianjin, China
Xiu-Ping Du & Pi-Lian He

Authors

Xiu-Ping Du
View author publications
You can also search for this author in PubMed Google Scholar
Pi-Lian He
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Mechanical and Automation Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, China
Jun Wang
Computational Intelligence Laboratory, School of Computer Science and Engineering, University of Electronic Science and Technology of China, 610054, Chengdu, P.R. China
Zhang Yi
Department of Electrical Engineering, University of Louisville, 40292, Louisville, KY, U.S.A
Jacek M. Zurada
Laboratory for Computational Biology, Shanghai Center for Systems Biomedicine, 800 Dong Chuan Rd., 200240, Shanghai, China
Bao-Liang Lu
School of Electrical and Electronic Engineering, University of Manchester, UK
Hujun Yin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Du, XP., He, PL. (2006). The Clustering Solution of Speech Recognition Models with SOM. In: Wang, J., Yi, Z., Zurada, J.M., Lu, BL., Yin, H. (eds) Advances in Neural Networks - ISNN 2006. ISNN 2006. Lecture Notes in Computer Science, vol 3972. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11760023_23

Download citation

DOI: https://doi.org/10.1007/11760023_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34437-7
Online ISBN: 978-3-540-34438-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics