Loading [a11y]/accessibility-menu.js
Discourse prosody and its application to speech synthesis | IEEE Conference Publication | IEEE Xplore

Discourse prosody and its application to speech synthesis


Abstract:

This paper reveals the correlations between discourse structure and acoustic parameters and presents a method of manipulating discourse prosody in relation to discourse s...Show More

Abstract:

This paper reveals the correlations between discourse structure and acoustic parameters and presents a method of manipulating discourse prosody in relation to discourse structure to improve the naturalness of synthesis speech. The text material included 1229 passages. The texts were annotated using Rhetorical Structure Theory. Prosody measurements were extracted from the corresponding speech annotation and then the statistic analysis were conducted. The results showed that: 1) segments at higher hierarchical level were preceded with longer pause durations; 2) segments bearing nucleus possessed longer average duration than satellites did. To test if rhetorical structure would benefit synthesized speech prosody, 15 passages were synthesized with discourse features implemented. The evaluation results indicated that the modified synthesis speech excelled the baseline system by 0.1 MOS point, suggesting that implementing prosodic features into synthesized speech would improve overall prosody.speech annotation
Date of Conference: 17-20 October 2016
Date Added to IEEE Xplore: 04 May 2017
ISBN Information:
Conference Location: Tianjin, China

Contact IEEE to Subscribe

References

References is not available for this document.