ABSTRACT
Contour is a voice-guided speech re-synthesis system we previously developed for efficient TTS (Text-to-Speech) content production. In this follow-up evaluation study, we investigate qualities of synthetic speech produced using Contour against a conventional parametric-based workflow by evaluating expressive dimensions of produced TTS content using vocal prosodic parameters. Based on the quantitative and qualitative results, we discuss user preferences between these two workflows for producing TTS content.
- Véronique Aubergé, Nicolas Audibert, and Albert Rilliard. 2004. Acoustic morphology of expressive speech: What about contours?. In Speech Prosody 2004, International Conference.Google Scholar
- Yuan-Yi Fan, Soyoung Shin, and Vids Samanta. 2017. Contour: An Efficient Voice-enabled Workflow for Producing Text-to-Speech Content. In Adjunct Publication of the 30th Annual ACM Symposium on User Interface Software and Technology. ACM, 133--135. Google ScholarDigital Library
- Klaus R Scherer, Tom Johnstone, and Gundrun Klasmeyer. 2003. Vocal expression of emotion. Handbook of affective sciences (2003), 433--456.Google Scholar
Index Terms
- Evaluating expressiveness of a voice-guided speech re-synthesis system using vocal prosodic parameters
Recommendations
Speech-Input Speech-Output Communication for Dysarthric Speakers Using HMM-Based Speech Recognition and Adaptive Synthesis System
Dysarthria is a motor speech disorder that causes inability to control and coordinate one or more articulators. This makes it difficult for a dysarthric speaker to utter certain speech sound units, thereby producing poorly articulated, slurred, and ...
Lithuanian Speech Corpus Liepa for Development of Human-Computer Interfaces Working in Voice Recognition and Synthesis Mode
The problem of speech corpus for design of human-computer interfaces working in voice recognition and synthesis mode is investigated. Specific requirements of speech corpus for speech recognizers and synthesizers were accented. It has been discussed that ...
Prosodic Events Recognition in Evaluation of Speech-Synthesis System Performance
TSD '08: Proceedings of the 11th international conference on Text, Speech and DialogueWe present an objective-evaluation method of the prosody modeling in an HMM-based Slovene speech-synthesis system. Method is based on the results of the automatic recognition of syntactic-prosodic boundary positions and accented words in the synthetic ...
Comments