Abstract
This paper presents our most recent activities trying to adapt the foreign language based speech recognition engine for the recognition of the Lithuanian speech commands. As presented in our earlier papers the speakers of less popular languages (such as the Lithuanian) have several choices: to develop own speech recognition engines or to try adapting the speech recognition models developed and trained for the foreign languages to the task of recognition of their native spoken language. The second approach can lead to the faster implementation of the Lithuanian speech recognition modules into some practical tasks but the proper adaptation and optimization procedures should be found and investigated. This paper presents our activities trying to improve the recognition of Lithuanian voice commands using multiple transcriptions per command and English recognizer.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Anderson, O., Dalsgaard, P., Barry, W.: On the use of data-driven clustering technique for identification of poly- and mono-phonemes for four European languages. IEEE Trans. Acoustics, Speech, and Signal Processing i, 121–124 (1994)
Cohen, P., et al.: Towards a Universal Speech Recognizer for Multiple Languages. In: Proceedings of Automatic Speech Recognition and Understanding, pp. 591–598 (1997)
Schultz, T.: Global Phone: A Multilingual Speech and Text Database developed at Karlsruhe University. In: Proceedings of ICSLP 2002, Denver, Colorado, vol. 1, pp. 345–348 (2002)
Schultz, T., Waibel, A.: Multilingual and Crosslingual Speech Recognition. In: Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, pp. 259–262 (1998)
Schultz, T., Waibel, A.: Fast Bootstrapping of LVCSR Systems with Multilingual Phoneme Sets. In: Proceedings of Eurospeech 1997, Rhodes, pp. 371–374 (1997)
Schultz, T., Waibel, A.: Experiments on Cross-language Acoustic Modeling. In: Proceedings of Eurospeech-2001, Alborg, pp. 2721–2725 (2001)
Stuker, S., Schultz, T., Metze, F., Waibel, A.: Multilingual Articulatory Features. In: Proceedings of ICASSP 2003, Hong Kong, vol. 1, pp. I-144–I-147 (2003)
Stuker, S., Metze, F., Schultz, T., Waibel, A.: Integrating Multilingual Articulatory Features into Speech Recognition. In: Proceedings of Eurospeech 2003, Geneva, pp. 1033–1036 (2003)
Jin, Q., Schultz, T., Waibel, A.: Speaker Identification using Multilingual Phone Strings. In: Proceedings of ICASSP 2002, Orlando, Florida, vol. 1, pp. I-145–I-148 (2002)
Schultz, T., et al.: Speaker, Accent, and Language Identification using Multilingual Phone Strings. In: Proceedings of the Human Language Technology Meeting 2002, San Diego, California, pp. 125–131 (2002)
Fugen, C., et al.: Efficient Handling of Multilingual Language Models. In: Proceedings of ASRU 2003, ASRU, St. Thomas, pp. 441–446 (2003)
Lin, H., et al.: Learning Methods in Multilingual Speech Recognition. In: NIPS Workshop (December 2008), http://ssli.ee.washington.edu/people/hlin/papers/nips2008WSL1_03.pdf
Rudzionis, V., Maskeliunas, R., Rudzionis, A.: On the adaptation of foreign language speech recognition engines for Lithuanian speech recognition. In: Business Information System. LNBIP, vol. 37, pp. 113–118. Springer, Heidelberg (2009)
Maskeliunas, R., Rudzionis, A., Rudzionis, V.: Analysis of the possibilities to adapt the foreign language speech recognition engines for the Lithuanian spoken commands recognition. In: Esposito, A., Vích, R. (eds.) Cross-Modal Analysis of Speech, Gestures, Gaze and Facial Expressions. LNCS (LNAI), vol. 5641, pp. 409–422. Springer, Heidelberg (2009)
Maskeliunas, R.: Modeling Aspects of Multimodal Lithuanian Human - Machine Interface. In: Esposito, A., Hussain, A., Marinaro, M., Martone, R. (eds.) Multimodal Signals: Cognitive and Algorithmic Issues. LNCS (LNAI), vol. 5398, pp. 75–82. Springer, Heidelberg (2009)
Maskeliunas, R., Rudzionis, A., Ratkevicius, K., Rudzionis, V.: User Identification Based on Lithuanian Digits Recognition. In: Proceedings of the 15th International Conference on Information and Software Technologies, pp. 256–262. Kaunas(2009)
Maskeliunas, R., Rudzionis, A., Rudzionis, V., Raktevicius, K.: Voice controlled telephony services. In: Proceedings of the 4th Int. Conf. on Electrical And Control Technologies 2009, pp. 48–54. Kaunas (2009)
Maskeliunas, R., Rudzionis, A., Ratkevicius, K., Rudzionis, V.: Investigation of Foreign Languages Models for Lithuanian Speech Recognition. Electronics and Electrical Engineering 3(91), 15–21 (2009)
Kasparaitis, P.: Lithuanian Speech Recognition Using the English Recognizer. INFORMATICA 2008 19(4), 505–516 (2008)
Koppapu, S.K., Rao, P.: Enhancing spoken connected-digit recognition accuracy by error correction codes – A novel scheme. Sadhana 29(5), 559–571 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Maskeliunas, R., Rudzionis, A., Rudzionis, V. (2010). Advances on the Use of the Foreign Language Recognizer. In: Esposito, A., Campbell, N., Vogel, C., Hussain, A., Nijholt, A. (eds) Development of Multimodal Interfaces: Active Listening and Synchrony. Lecture Notes in Computer Science, vol 5967. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12397-9_18
Download citation
DOI: https://doi.org/10.1007/978-3-642-12397-9_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12396-2
Online ISBN: 978-3-642-12397-9
eBook Packages: Computer ScienceComputer Science (R0)