A Large Czech Vocabulary Recognition System for Real-Time Applications

Nouza, Jan

doi:10.1007/3-540-45323-7_37

Jan Nouza³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1902))

Included in the following conference series:

International Workshop on Text, Speech and Dialogue

367 Accesses
3 Citations

Abstract

In this paper, we propose two methods for speeding up discreteutterance recognition in vocabularies with hundreds to several thousands of words. We show that acceptable results as well as short response time can be achieved if the words are represented by concatenated monophone models (multi-mixture HMMs). In such case, the computation load of the classic Viterbi procedure can be reduced significantly if a proper caching scheme is used. In several experiments done with test vocabularies containing hundreds and thousands of Czech words, we demonstrate that the recognition procedures can be speeded up by a factor of 50 to 100 without a loss of accuracy. The method is well suited for voice controlled systems with a large branching factor and low syntax, i.e. in voice portals, telephone directory assistance, etc.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Proc. of Workshop Voice Operated Telecom Services (Do they have a bright future?). Ghent, May 2000.
Google Scholar
Nouza J., Holada M.: A Voice-Operated Multi-Domain Telephone Information System. Proc. of 25th Int. Conference on Acoustics, Speech and Signal Processing (ICASSP2000), Istanbul, June 2000 (to appear).
Google Scholar
Radová V., Vopálka P.: Methods of Sentences Selection for Read-Speech Corpus Design. Proceedings of the 1st Workshop on Text, Speech and Dialogue, Plzecň 1999, pp. 165–170.
Google Scholar
Hájek D.: A Continuous Speech Recognition System. MSc thesis (in Czech). Technical University of Liberec, May 1998.
Google Scholar
Hájek D., Nouza J.: A Quasi-Triphone Model Created by Merging Context-Specific Phone Models. In Studientexte zur Sprachkommunikation, Heft 14 (Elektronische Sprachsignalver-arbeitung), Cottbus 1997, pp. 85–92.
Google Scholar
Ming J., O’Boyle O., Owens M., Smith F.J.: A Bayesian Approach for Building Triphone Models for Continuous Speech Recognition. IEEE Trans. Speech and Audio Processing, vol. 7, no. 6, 1999, pp. 678–684.
Article Google Scholar
Nouza J., Psutka J., UhlÍř J.: Phonetic Alphabet for Speech Recognition of Czech. Radioengineering, vol. 6, no. 4, Dec 1997, pp. 16–20.
Google Scholar
Nouza J., Myslivec M.: Creating and Annotating Speech Database for Continuous Speech Recognition. Proc. of 4th ECMS Workshop, Liberec, May 1999, pp. 147–151.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronics and Signal Processing, Technical University of Liberec, Hálkova 5, 461 17, Liberec, Czech Republic
Jan Nouza

Authors

Jan Nouza
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Informatics Department of Programming Systems and Communication, Masaryk University, Botanická 68a, 602 00, Brno, Czech Republic
Petr Sojka
Faculty of Informatics, Department of Information Technologies, Masaryk University, Botanická 68a, 602 00, Brno, Czech Republic
Ivan Kopeček & Karel Pala &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nouza, J. (2000). A Large Czech Vocabulary Recognition System for Real-Time Applications. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2000. Lecture Notes in Computer Science(), vol 1902. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45323-7_37

Download citation

DOI: https://doi.org/10.1007/3-540-45323-7_37
Published: 15 August 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41042-3
Online ISBN: 978-3-540-45323-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics