Abstract
In this paper, the implementation of a Hakka text-to-speech (TTS) system is presented. The system is designed based on the same principle of developing a Mandarin and a Min-Nan TTS systems proposed previously. It takes 671 base-syllables as basic synthesis units and uses a recurrent neural network (RNN)-based prosody generator to generate proper prosodic parameters for synthesizing natural output speech. The whole system is implemented by software and runs in real-time on PC. Informal subjective listening test confirmed that the system performed well. All synthetic speeches sounded well for well-tokenized texts and fair for texts with automatic tokenization.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Fu, H.: Automatic Generation of Synthesis Units for Taiwanese Text-to-Speech System. Master Thesis, EE Dept., Chang Gung University (June 2000)
Sher, Y.J., Chung, K.C., Wu, C.H.: Establish Taiwanese 7-tones Syllable-based Synthesis units Database for the Prototype Development of Text-to-Speech System. In: Proceedings of ROCLING XII (August 1999)
Yang, Y.C.: An Implementation of Taiwanese Text-to-Speech System. Master Thesis, Communication Engg. Dept., National Chiao Tung Univ (June 1999)
Chen, S.H., Ho, C.C.: A Min-Nan Text-to-Speech System. In: ISCSLP 2000, Beijing (October 2000)
Kuo, W.-C., Zhong, X.-R., Wang, Y.-R., Chen, S.-H.: A High- Performance Min-Nan/ Taiwanese TTS System. In: ICASSP 2003 (2003)
Chen, S.H., Hwang, S.H., Wang, Y.R.: An RNN-Based Prosodic Information Synthesizer for Mandarin Text-to-Speech. IEEE Trans. Speech and Audio Processing 6(3), 226–239 (1998)
Haykin, S.: Neural networks – A comprehensive foundation. Macmillan College Publishing Company (1994)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yu, HM., Hwang, HT., Lin, DY., Chen, SH. (2006). A Hakka Text-To-Speech System. In: Huo, Q., Ma, B., Chng, ES., Li, H. (eds) Chinese Spoken Language Processing. ISCSLP 2006. Lecture Notes in Computer Science(), vol 4274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11939993_28
Download citation
DOI: https://doi.org/10.1007/11939993_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49665-6
Online ISBN: 978-3-540-49666-3
eBook Packages: Computer ScienceComputer Science (R0)