Abstract
This work is part of the effort to develop a speech recognition system for Brazilian Portuguese. The resources for the training and test stages of this system, such as corpora, pronunciation dictionary, language and acoustic models, are publicly available. Here, an application programming interface is proposed in order to facilitate using the open-source Julius speech decoder. Performance tests are presented, comparing the developed systems with a commercial software.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
http://julius.sourceforge.jp/en/ (Visited in May 2009)
Siravenha, A., Neto, N., Macedo, V., Klautau, A.: Uso de regras fonológicas com determinação de vogal tônica para conversão grafema-fone em português brasileiro. In: 7th International Information and Telecommunication Technologies Symposium (2008)
Silva, P., Neto, N., Klautau, A.: Novos recursos e utilização de adaptação de locutor no desenvolvimento de um sistema de reconhecimento de voz para o português brasileiro. In: XXVII Simpósio Brasileiro de Telecomunicaçães (2009)
Silva, P., Neto, N., Klautau, A., Adami, A., Trancoso, I.: Speech recognition for brazilian portuguese using the spoltech and OGI-22 corpora. In: XXVI Simpósio Brasileiro de Telecomunicações - SBrt 2008 (2008)
Young, S., Ollason, D., Valtchev, V., Woodland, P.: The HTK Book (for HTK Version 3.4). Cambridge University Engineering Department, Cambridge (2006)
Stolcke, A.: SRILM an extensible language modeling toolkit. In: Proc. Intl. Conf. Spoken Language Processing, Denver, Colorado (2002)
http://www.ibm.com/software/speech/ (Visited in September 2009)
Rotovnik, T., Maucec, M.S., Horvat, B., Kacic, Z.: A comparison of HTK, ISIP and julius in slovenian large vocabulary continuous speech recognition. In: 7th International Conference on Spoken Language Processing, ICSLP (2002)
http://www.laps.ufpa.br/falabrasil (Visited in October 2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Silva, P., Batista, P., Neto, N., Klautau, A. (2010). An Open-Source Speech Recognizer for Brazilian Portuguese with a Windows Programming Interface. In: Pardo, T.A.S., Branco, A., Klautau, A., Vieira, R., de Lima, V.L.S. (eds) Computational Processing of the Portuguese Language. PROPOR 2010. Lecture Notes in Computer Science(), vol 6001. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12320-7_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-12320-7_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12319-1
Online ISBN: 978-3-642-12320-7
eBook Packages: Computer ScienceComputer Science (R0)