Abstract
This paper describes our study on solving two basic problems of large vocabulary continuous speech recognition (LVCSR) of Vietnamese, which can be used as a standard reference for Vietnamese researchers and other researchers who are interested in Vietnamese language. First, a standard phoneme set is proposed with its corresponding grapheme-to-phoneme map. This phoneme set is the core to solve other problems related to LVCSR of Vietnamese. Then the creation of standard pronouncing dictionary based on the grapheme-to-phoneme map and the analysis of Vietnamese syllable is also described. Finally, we present the results on LVCSR using different types of pronouncing dictionary, which show some interesting aspects of Vietnamese language such as the structure of Vietnamese syllable and the effect of tone in the relationship with syllable.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Chuong, N.T.: Selection of Sentence Set for Vietnamese Audio-Visual Corpus Design. In: IDAACS 2011, Praha, Czech Republic, pp. 492–495 (2011)
Vu, T.T., Nguyen, D.T., Luong, C.M., Hosom, J.P.: Vietnamese large vocabulary continuous speech recognition. In: INTERSPEECH 2005, pp. 1689–1692 (2005)
Vu, Q., Demuynck, K., Van Compernolle, D.: Vietnamese Automatic Speech Recognition: The FLaVoR Approach. In: Huo, Q., Ma, B., Chng, E.-S., Li, H. (eds.) ISCSLP 2006. LNCS (LNAI), vol. 4274, pp. 464–474. Springer, Heidelberg (2006)
Nguyen, H.Q., Nocera, P., Castelli, E., Trinh, V.L.: Using tone information for Vietnamese continuous speech recognition. In: RIVF 2008, pp. 103–106 (2008)
Nguyen, H.Q., Trinh, V.L., Le, T.D.: Automatic Speech Recognition for Vietnamese Using HTK System. In: RIVF 2010, pp. 1–4 (2010)
Vu, Q., et al.: A Robust Transcription System for Soccer Video Database. In: ICALIP, Shanghai (2010)
Nguyen, T., Vu, Q.: Advances in Acoustic Modeling for Vietnamese LVCSR. In: IALP 2009, pp. 280–284 (2009)
Vu, N.T., Schultz, T.: Vietnamese Large Vocabulary Continuous Speech Recognition. In: ASRU IEEE, Italy, pp. 333–338 (2009)
Vu, N.T., Schultz, T.: Optimization On Vietnamese Large Vocabulary Speech Recognition. In: Workshop on Spoken Languages Technologies for Under-Resourced Languages, SLTU 2010, Penang, Malaysia (May 03, 2010)
Le, V.B., Besacier, L.: Comparison of Acoustic Modeling Techniques for Vietnamese and Khmer ASR. In: ICSLP 2006, Pittsburgh, PA (September 2006)
Nguyen, H.Q., Nocera, P., Castelli, E., Trinh, V.L.: Large vocabulary continuous speech recognition for Vietnamese, an under-resourced language. In: SLTU 2008, Ha Noi, Vietnam, May 5-7 (2008)
Le, V.B., Tran, D.D., Besacier, L., Castelli, E., Serignat, J.-F.: First steps in building a large vocabulary continuous speech recognition system for Vietnamese. In: RIVF 2005, Can Tho, Vietnam (February 2005)
Le, T., Nguyen, H., Vu, Q.: Progress in Transcription of Vietnamese Broadcast News. In: Proc. International Conference on Communications and Electronics, ICCE 2006 (October 2006)
Hoang, P.: Syllable Dictionary. Danang Publisher, Vietnam (1996)
Steve, Y., Odel, J., Ollason, D., Valtchev, V., Woodland, P.: The HTK Book, version 3.2. Cambr. Univ., UK (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nguyen, T.C., Chaloupka, J. (2013). Phoneme Set and Pronouncing Dictionary Creation for Large Vocabulary Continuous Speech Recognition of Vietnamese. In: Habernal, I., Matoušek, V. (eds) Text, Speech, and Dialogue. TSD 2013. Lecture Notes in Computer Science(), vol 8082. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40585-3_50
Download citation
DOI: https://doi.org/10.1007/978-3-642-40585-3_50
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40584-6
Online ISBN: 978-3-642-40585-3
eBook Packages: Computer ScienceComputer Science (R0)