A Hakka Text-To-Speech System

Yu, Hsiu-Min; Hwang, Hsin-Te; Lin, Dong-Yi; Chen, Sin-Horng

doi:10.1007/11939993_28

Hsiu-Min Yu²²,
Hsin-Te Hwang²³,
Dong-Yi Lin²³ &
…
Sin-Horng Chen²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4274))

Included in the following conference series:

International Symposium on Chinese Spoken Language Processing

1582 Accesses
2 Citations

Abstract

In this paper, the implementation of a Hakka text-to-speech (TTS) system is presented. The system is designed based on the same principle of developing a Mandarin and a Min-Nan TTS systems proposed previously. It takes 671 base-syllables as basic synthesis units and uses a recurrent neural network (RNN)-based prosody generator to generate proper prosodic parameters for synthesizing natural output speech. The whole system is implemented by software and runs in real-time on PC. Informal subjective listening test confirmed that the system performed well. All synthetic speeches sounded well for well-tokenized texts and fair for texts with automatic tokenization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Fu, H.: Automatic Generation of Synthesis Units for Taiwanese Text-to-Speech System. Master Thesis, EE Dept., Chang Gung University (June 2000)
Google Scholar
Sher, Y.J., Chung, K.C., Wu, C.H.: Establish Taiwanese 7-tones Syllable-based Synthesis units Database for the Prototype Development of Text-to-Speech System. In: Proceedings of ROCLING XII (August 1999)
Google Scholar
Yang, Y.C.: An Implementation of Taiwanese Text-to-Speech System. Master Thesis, Communication Engg. Dept., National Chiao Tung Univ (June 1999)
Google Scholar
Chen, S.H., Ho, C.C.: A Min-Nan Text-to-Speech System. In: ISCSLP 2000, Beijing (October 2000)
Google Scholar
Kuo, W.-C., Zhong, X.-R., Wang, Y.-R., Chen, S.-H.: A High- Performance Min-Nan/ Taiwanese TTS System. In: ICASSP 2003 (2003)
Google Scholar
Chen, S.H., Hwang, S.H., Wang, Y.R.: An RNN-Based Prosodic Information Synthesizer for Mandarin Text-to-Speech. IEEE Trans. Speech and Audio Processing 6(3), 226–239 (1998)
Article Google Scholar
Haykin, S.: Neural networks – A comprehensive foundation. Macmillan College Publishing Company (1994)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Foreign Languages, Chung Hua University, Hsinchu
Hsiu-Min Yu
Department of Communication Engineering, National Chiao Tung University, Hsinchu
Hsin-Te Hwang, Dong-Yi Lin & Sin-Horng Chen

Authors

Hsiu-Min Yu
View author publications
You can also search for this author in PubMed Google Scholar
Hsin-Te Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Dong-Yi Lin
View author publications
You can also search for this author in PubMed Google Scholar
Sin-Horng Chen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, The University of Hong Kong, Hong Kong
Qiang Huo
Human Language Technology Department, Institute for Infocomm Research (I2R), 119613, Singapore
Bin Ma
School of Computer Engineering, Nanyang Technological University (NTU), 639798, Singapore
Eng-Siong Chng
Institute for Infocomm Research, 21 Heng Mui Keng Terrace, 119613, Singapore
Haizhou Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yu, HM., Hwang, HT., Lin, DY., Chen, SH. (2006). A Hakka Text-To-Speech System. In: Huo, Q., Ma, B., Chng, ES., Li, H. (eds) Chinese Spoken Language Processing. ISCSLP 2006. Lecture Notes in Computer Science(), vol 4274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11939993_28

Download citation

DOI: https://doi.org/10.1007/11939993_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49665-6
Online ISBN: 978-3-540-49666-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics