Abstract
This paper first introduces the system requirement and the system flow of the auto-plotting system. As the data points needed by the auto-plotting system coming from the remote speech signals, to reach high recognition accuracy, the Hidden Markov Model (HMM) approach was chosen as the speech recognition approach. Then the paper is detailed on the speaker dependent (SD), speaker independent (SI) and speaker adaptive (SA) speech recognition methods. We proposed the n-speech models SD system as the recognition system to gain the highest recognition performance in varying speech environments. However the system required that searching for the optimal model from the database should finish in 5 minutes, so the paper finally describes how the Self-Organizing Map (SOM) was used to pre clustering to the n-speech models, to decrease the time for speech recognition and results evaluation and decrease matching time, Experiments show the n-speech models SD system can select the best-matching model in the limited time and improve the average speech recognition accuracy to 97.2. It ideally suits the system requirements.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Veeravalli, A.G., Pan, W.D., Adhami, R., Cox, P.G.: A Tutorial on Using Hidden Markov Models for Phoneme Recognition. In: SSST 2005 (eds.): Proceedings of the Thirty-Seventh Southeastern Symposium on System Theory, pp. 154–157. IEEE, Tuskegee (2005)
Legetter, C.J., Woodland, P.C.: Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models. Computer Speech and Language 9(2), 171–186 (1995)
Gauvain, J.L., Lee, C.H.: Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains. IEEE Trans. Speech and Audio Processing 2(2), 291–298 (1994)
Nishida, S., Ishii, K., Ura, T.: Adaptive Learning to Environment Using Self-Organizing Map and Its Application for Underwater Vehicles. In: UT 2004 (ed.) Proceedings of the 2004 International Symposium on Underwater Technology UT 2004, pp. 223–228. IEEE, Taipei (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Du, XP., He, PL. (2006). The Clustering Solution of Speech Recognition Models with SOM. In: Wang, J., Yi, Z., Zurada, J.M., Lu, BL., Yin, H. (eds) Advances in Neural Networks - ISNN 2006. ISNN 2006. Lecture Notes in Computer Science, vol 3972. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11760023_23
Download citation
DOI: https://doi.org/10.1007/11760023_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34437-7
Online ISBN: 978-3-540-34438-4
eBook Packages: Computer ScienceComputer Science (R0)