Loading [a11y]/accessibility-menu.js
Break prediction of prosody for Hakka'S TTS systems based on data mining approaches | IEEE Conference Publication | IEEE Xplore

Break prediction of prosody for Hakka'S TTS systems based on data mining approaches


Abstract:

This paper aims at the prosody generation for Hakka's language based on the data mining approaches, and implement the TTS system on Internet. Our system is composed of th...Show More

Abstract:

This paper aims at the prosody generation for Hakka's language based on the data mining approaches, and implement the TTS system on Internet. Our system is composed of the following four components: 1) Text analysis, 2) Mandarin to Hakka word translation, 3) Prosody prediction, and 4) Speech generation module. More than 2427 monosyllabic speech units and 2234 word speech units of Hakka and several silences with various durations have been recorded as basic units for speech synthesis. We focus on adding breaks to speeches, with emphasis on predicting the types of break. There are three kinds of breaks: major break, minor break and no-break between words. We train a break model and predict break based on the data mining approaches - Bayesian network (BN) and CART classifier. The best precision rate for testing achieves 80.17% based on the CART. Fourteen students familiar with Hakka joined to evaluate the prosody quality of synthesized speeches. The results with 10 scale achieves 7.54 score in average. Based on the comprehensive evaluation, it is obvious that our system can synthesize the clear and natural Hakka's speeches.
Date of Conference: 10-13 July 2011
Date Added to IEEE Xplore: 12 September 2011
ISBN Information:

ISSN Information:

Conference Location: Guilin, China

Contact IEEE to Subscribe

References

References is not available for this document.