Abstract
This paper describes a real-time speech recognition system for Ukrainian designed basically for text dictation purpose targeting moderate computation requirements. The research is focused on language model parameter estimation. As a Slavonic language Ukrainian is highly inflective and tolerates relatively free word order. These features motivate transition from word- to class-based statistical language model. According to our experimental research, class-based LMs occupy less space and potentially outperform a 3-gram word-based model. We also describe several tools developed to visualize HMMs, to predict word stress, and to manage cluster-based language modeling.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Vintsiuk, T., Sazhok, M.: Multi-Level Multi-Decision Models for ASR. In: Proc. SpeCom 2005, Patras, pp. 69–76 (2005)
Gales, M., Young, S.: The Application of Hidden Markov Models in Speech Recognition. Foundations and Trends in Signal Processing 1(3), 195–304 (2007)
Robeiko, V., Sazhok, M.: Real-time spontaneous Ukrainian speech recognition system based on word acoustic composite models. In: Proc. UkrObraz 2012, Kyiv, pp. 77–81 (2012)
Lee, A., Kawahara, T.: Recent Development of Open-Source Speech Recognition Engine Julius. In: APSIPA ASC, pp. 131–137 (2009)
Young, S.J., et al.: The HTK Book Version 3.4. Cambridge University (2006)
Pylypenko, V., Robeiko, V., Sazhok, M., Vasylieva, N., Radoutsky, O.: Ukrainian Broadcast Speech Corpus Development. In: Proc. Specom 2011, Kazan, RF, pp. 244–247 (2011)
Robeiko, V., Sazhok, M.: Bidirectional Text-To-Pronunciation Conversion with Word Stress Prediction for Ukrainian. In: Proc. UkrObraz 2012, Kyiv, pp. 43–46 (2012)
Hsu, B.-J(P.), Glass, J.: Iterative Language Model Estimation: Efficient Data Structure and Algorithms. In: Proc. Interspeech (2008)
Martin, S., Liermann, J., Ney, H.: Algorithms for bigram and trigram word clustering. In: Proc. Eurospeech, Madrid, vol. 2, pp. 1253–1256 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Sazhok, M., Robeiko, V. (2013). Language Model Comparison for Ukrainian Real-Time Speech Recognition System. In: Železný, M., Habernal, I., Ronzhin, A. (eds) Speech and Computer. SPECOM 2013. Lecture Notes in Computer Science(), vol 8113. Springer, Cham. https://doi.org/10.1007/978-3-319-01931-4_28
Download citation
DOI: https://doi.org/10.1007/978-3-319-01931-4_28
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-01930-7
Online ISBN: 978-3-319-01931-4
eBook Packages: Computer ScienceComputer Science (R0)