Skip to main content

The Clustering Solution of Speech Recognition Models with SOM

  • Conference paper
Advances in Neural Networks - ISNN 2006 (ISNN 2006)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3972))

Included in the following conference series:

Abstract

This paper first introduces the system requirement and the system flow of the auto-plotting system. As the data points needed by the auto-plotting system coming from the remote speech signals, to reach high recognition accuracy, the Hidden Markov Model (HMM) approach was chosen as the speech recognition approach. Then the paper is detailed on the speaker dependent (SD), speaker independent (SI) and speaker adaptive (SA) speech recognition methods. We proposed the n-speech models SD system as the recognition system to gain the highest recognition performance in varying speech environments. However the system required that searching for the optimal model from the database should finish in 5 minutes, so the paper finally describes how the Self-Organizing Map (SOM) was used to pre clustering to the n-speech models, to decrease the time for speech recognition and results evaluation and decrease matching time, Experiments show the n-speech models SD system can select the best-matching model in the limited time and improve the average speech recognition accuracy to 97.2. It ideally suits the system requirements.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Veeravalli, A.G., Pan, W.D., Adhami, R., Cox, P.G.: A Tutorial on Using Hidden Markov Models for Phoneme Recognition. In: SSST 2005 (eds.): Proceedings of the Thirty-Seventh Southeastern Symposium on System Theory, pp. 154–157. IEEE, Tuskegee (2005)

    Chapter  Google Scholar 

  • Legetter, C.J., Woodland, P.C.: Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models. Computer Speech and Language 9(2), 171–186 (1995)

    Article  Google Scholar 

  • Gauvain, J.L., Lee, C.H.: Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains. IEEE Trans. Speech and Audio Processing 2(2), 291–298 (1994)

    Article  Google Scholar 

  • Nishida, S., Ishii, K., Ura, T.: Adaptive Learning to Environment Using Self-Organizing Map and Its Application for Underwater Vehicles. In: UT 2004 (ed.) Proceedings of the 2004 International Symposium on Underwater Technology UT 2004, pp. 223–228. IEEE, Taipei (2004)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Du, XP., He, PL. (2006). The Clustering Solution of Speech Recognition Models with SOM. In: Wang, J., Yi, Z., Zurada, J.M., Lu, BL., Yin, H. (eds) Advances in Neural Networks - ISNN 2006. ISNN 2006. Lecture Notes in Computer Science, vol 3972. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11760023_23

Download citation

  • DOI: https://doi.org/10.1007/11760023_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-34437-7

  • Online ISBN: 978-3-540-34438-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics