Comparative Study on Bigram Language Models for Spoken Czech Recognition

Nejedlová, Dana

doi:10.1007/3-540-46154-X_27

Dana Nejedlová³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2448))

Included in the following conference series:

International Conference on Text, Speech and Dialogue

563 Accesses
2 Citations

Abstract

The article deals with the problem of continuous speech recognition of Czech language. The main goal of this study is to compare various kinds of bigram language models with respect to the accuracy and speed of speech recognition. The main types of bigram language models are described here as well as multiple parameters that affect the performance of a speech recognition system. A comparison with a zerogram model is also made. Different models and various parameter settings are compared by means of the accuracy rate in extensive experiments done with a large test database of 1,600 Czech sentences recorded by 40 speakers.

This work has been supported by the Grant Agency of the Czech Republic (grant no. 102/02/0124) and through research goal project MSM 242200001.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Nouza, J.: A Czech Large Vocabulary Recognition System for Real-Time Applications. In: P. Sojka et al. (Eds.): Proc. of 3^rd International Workshop on Text, Speech, Dialogue, Springer-Verlag, Heidelberg, Germany (2000) 217–222.
Google Scholar
Nouza, J.: Strategies for Developing a Real-Time Continuous Speech Recognition System for Czech Language. In: Sojka P. et al. (Eds.): Text, Speech and Dialogue, Proceedings of the Fifth International Conference, Brno, Czech Republic, September 9–12, 2002, pp. 189–196.
Google Scholar
Witten, I. H. and Bell, T.C.: The Zero-Frequency Problem: Estimating the Probabilities of Novel Events in Adaptive Text Compression. IEEE Transactions on Information Theory, 37(4), (1991) 1085–1094.
Article Google Scholar
Jelinek, F. and Mercer, R. L.: Interpolated Estimation on Markov Source Parameters from Sparse Data. In Gelsema, E. S. and Kanal, L.N. (Eds.), Proceedings, Workshop on Pattern Recognition in Practice. North Holland, Amsterdam (1980) 381–397.
Google Scholar
Katz, S. M.: Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer. IEEE Transactions on Acoustics, Speech, and Signal Processing, 35(3), (1987) 400–401.
Article Google Scholar

Download references

Author information

Authors and Affiliations

SpeechLab, Technical University of Liberec, Hálkova 6, 461 17, Liberec, Czech Republic
Dana Nejedlová

Authors

Dana Nejedlová
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Informatics Department of Programming Systems and Communication, Masaryk University, Botanická 68a, 602 00, Brno, Czech Republic
Petr Sojka
Faculty of Informatics Department of Information Technologies, Masaryk University, Botanická 68a, 602 00, Brno, Czech Republic
Ivan Kopeček & Karel Pala &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nejedlová, D. (2002). Comparative Study on Bigram Language Models for Spoken Czech Recognition. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_27

Download citation

DOI: https://doi.org/10.1007/3-540-46154-X_27
Published: 23 August 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44129-8
Online ISBN: 978-3-540-46154-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics