Skip to main content

A Hakka Text-To-Speech System

  • Conference paper
Chinese Spoken Language Processing (ISCSLP 2006)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4274))

Included in the following conference series:

Abstract

In this paper, the implementation of a Hakka text-to-speech (TTS) system is presented. The system is designed based on the same principle of developing a Mandarin and a Min-Nan TTS systems proposed previously. It takes 671 base-syllables as basic synthesis units and uses a recurrent neural network (RNN)-based prosody generator to generate proper prosodic parameters for synthesizing natural output speech. The whole system is implemented by software and runs in real-time on PC. Informal subjective listening test confirmed that the system performed well. All synthetic speeches sounded well for well-tokenized texts and fair for texts with automatic tokenization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Fu, H.: Automatic Generation of Synthesis Units for Taiwanese Text-to-Speech System. Master Thesis, EE Dept., Chang Gung University (June 2000)

    Google Scholar 

  2. Sher, Y.J., Chung, K.C., Wu, C.H.: Establish Taiwanese 7-tones Syllable-based Synthesis units Database for the Prototype Development of Text-to-Speech System. In: Proceedings of ROCLING XII (August 1999)

    Google Scholar 

  3. Yang, Y.C.: An Implementation of Taiwanese Text-to-Speech System. Master Thesis, Communication Engg. Dept., National Chiao Tung Univ (June 1999)

    Google Scholar 

  4. Chen, S.H., Ho, C.C.: A Min-Nan Text-to-Speech System. In: ISCSLP 2000, Beijing (October 2000)

    Google Scholar 

  5. Kuo, W.-C., Zhong, X.-R., Wang, Y.-R., Chen, S.-H.: A High- Performance Min-Nan/ Taiwanese TTS System. In: ICASSP 2003 (2003)

    Google Scholar 

  6. Chen, S.H., Hwang, S.H., Wang, Y.R.: An RNN-Based Prosodic Information Synthesizer for Mandarin Text-to-Speech. IEEE Trans. Speech and Audio Processing 6(3), 226–239 (1998)

    Article  Google Scholar 

  7. Haykin, S.: Neural networks – A comprehensive foundation. Macmillan College Publishing Company (1994)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Yu, HM., Hwang, HT., Lin, DY., Chen, SH. (2006). A Hakka Text-To-Speech System. In: Huo, Q., Ma, B., Chng, ES., Li, H. (eds) Chinese Spoken Language Processing. ISCSLP 2006. Lecture Notes in Computer Science(), vol 4274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11939993_28

Download citation

  • DOI: https://doi.org/10.1007/11939993_28

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-49665-6

  • Online ISBN: 978-3-540-49666-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics