Language Model Comparison for Ukrainian Real-Time Speech Recognition System

Sazhok, Mykola; Robeiko, Valentyna

doi:10.1007/978-3-319-01931-4_28

Mykola Sazhok^22,23 &
Valentyna Robeiko²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8113))

Included in the following conference series:

International Conference on Speech and Computer

1176 Accesses

Abstract

This paper describes a real-time speech recognition system for Ukrainian designed basically for text dictation purpose targeting moderate computation requirements. The research is focused on language model parameter estimation. As a Slavonic language Ukrainian is highly inflective and tolerates relatively free word order. These features motivate transition from word- to class-based statistical language model. According to our experimental research, class-based LMs occupy less space and potentially outperform a 3-gram word-based model. We also describe several tools developed to visualize HMMs, to predict word stress, and to manage cluster-based language modeling.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Vintsiuk, T., Sazhok, M.: Multi-Level Multi-Decision Models for ASR. In: Proc. SpeCom 2005, Patras, pp. 69–76 (2005)
Google Scholar
Gales, M., Young, S.: The Application of Hidden Markov Models in Speech Recognition. Foundations and Trends in Signal Processing 1(3), 195–304 (2007)
Article MATH Google Scholar
Robeiko, V., Sazhok, M.: Real-time spontaneous Ukrainian speech recognition system based on word acoustic composite models. In: Proc. UkrObraz 2012, Kyiv, pp. 77–81 (2012)
Google Scholar
Lee, A., Kawahara, T.: Recent Development of Open-Source Speech Recognition Engine Julius. In: APSIPA ASC, pp. 131–137 (2009)
Google Scholar
Young, S.J., et al.: The HTK Book Version 3.4. Cambridge University (2006)
Google Scholar
Pylypenko, V., Robeiko, V., Sazhok, M., Vasylieva, N., Radoutsky, O.: Ukrainian Broadcast Speech Corpus Development. In: Proc. Specom 2011, Kazan, RF, pp. 244–247 (2011)
Google Scholar
Robeiko, V., Sazhok, M.: Bidirectional Text-To-Pronunciation Conversion with Word Stress Prediction for Ukrainian. In: Proc. UkrObraz 2012, Kyiv, pp. 43–46 (2012)
Google Scholar
Hsu, B.-J(P.), Glass, J.: Iterative Language Model Estimation: Efficient Data Structure and Algorithms. In: Proc. Interspeech (2008)
Google Scholar
Martin, S., Liermann, J., Ney, H.: Algorithms for bigram and trigram word clustering. In: Proc. Eurospeech, Madrid, vol. 2, pp. 1253–1256 (1995)
Google Scholar
http://lcorp.ulif.org.ua/dictua/

Download references

Author information

Authors and Affiliations

Hlushkov Institute of Cybernetics, Kyiv, Ukraine
Mykola Sazhok
International Research/Training Center for Information Technology and Systems, Kyiv, Ukraine
Mykola Sazhok & Valentyna Robeiko

Authors

Mykola Sazhok
View author publications
You can also search for this author in PubMed Google Scholar
Valentyna Robeiko
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Applied Sciences, Department of Cybernetics, University of West Bohemia, Univerzitní 8, 306 14, Plzeň, Czech Republic
Miloš Železný
University of West Bohemia, 306 14, Pilsen, Czech Republic
Ivan Habernal
Speech and Multimodal Interfaces Laboratory, St. Petersburg Institute of Informatics and Automation for the Russian Academy of Sciences, 14-th line, 39, 199178, St. Petersburg, Russia
Andrey Ronzhin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sazhok, M., Robeiko, V. (2013). Language Model Comparison for Ukrainian Real-Time Speech Recognition System. In: Železný, M., Habernal, I., Ronzhin, A. (eds) Speech and Computer. SPECOM 2013. Lecture Notes in Computer Science(), vol 8113. Springer, Cham. https://doi.org/10.1007/978-3-319-01931-4_28

Download citation

DOI: https://doi.org/10.1007/978-3-319-01931-4_28
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-01930-7
Online ISBN: 978-3-319-01931-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics