Skip to main content

Language Model Comparison for Ukrainian Real-Time Speech Recognition System

  • Conference paper
Book cover Speech and Computer (SPECOM 2013)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8113))

Included in the following conference series:

  • 1176 Accesses

Abstract

This paper describes a real-time speech recognition system for Ukrainian designed basically for text dictation purpose targeting moderate computation requirements. The research is focused on language model parameter estimation. As a Slavonic language Ukrainian is highly inflective and tolerates relatively free word order. These features motivate transition from word- to class-based statistical language model. According to our experimental research, class-based LMs occupy less space and potentially outperform a 3-gram word-based model. We also describe several tools developed to visualize HMMs, to predict word stress, and to manage cluster-based language modeling.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Vintsiuk, T., Sazhok, M.: Multi-Level Multi-Decision Models for ASR. In: Proc. SpeCom 2005, Patras, pp. 69–76 (2005)

    Google Scholar 

  2. Gales, M., Young, S.: The Application of Hidden Markov Models in Speech Recognition. Foundations and Trends in Signal Processing 1(3), 195–304 (2007)

    Article  MATH  Google Scholar 

  3. Robeiko, V., Sazhok, M.: Real-time spontaneous Ukrainian speech recognition system based on word acoustic composite models. In: Proc. UkrObraz 2012, Kyiv, pp. 77–81 (2012)

    Google Scholar 

  4. Lee, A., Kawahara, T.: Recent Development of Open-Source Speech Recognition Engine Julius. In: APSIPA ASC, pp. 131–137 (2009)

    Google Scholar 

  5. Young, S.J., et al.: The HTK Book Version 3.4. Cambridge University (2006)

    Google Scholar 

  6. Pylypenko, V., Robeiko, V., Sazhok, M., Vasylieva, N., Radoutsky, O.: Ukrainian Broadcast Speech Corpus Development. In: Proc. Specom 2011, Kazan, RF, pp. 244–247 (2011)

    Google Scholar 

  7. Robeiko, V., Sazhok, M.: Bidirectional Text-To-Pronunciation Conversion with Word Stress Prediction for Ukrainian. In: Proc. UkrObraz 2012, Kyiv, pp. 43–46 (2012)

    Google Scholar 

  8. Hsu, B.-J(P.), Glass, J.: Iterative Language Model Estimation: Efficient Data Structure and Algorithms. In: Proc. Interspeech (2008)

    Google Scholar 

  9. Martin, S., Liermann, J., Ney, H.: Algorithms for bigram and trigram word clustering. In: Proc. Eurospeech, Madrid, vol. 2, pp. 1253–1256 (1995)

    Google Scholar 

  10. http://lcorp.ulif.org.ua/dictua/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer International Publishing Switzerland

About this paper

Cite this paper

Sazhok, M., Robeiko, V. (2013). Language Model Comparison for Ukrainian Real-Time Speech Recognition System. In: Železný, M., Habernal, I., Ronzhin, A. (eds) Speech and Computer. SPECOM 2013. Lecture Notes in Computer Science(), vol 8113. Springer, Cham. https://doi.org/10.1007/978-3-319-01931-4_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-01931-4_28

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-01930-7

  • Online ISBN: 978-3-319-01931-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics