Unit-Selection Speech Synthesis Method Using Words as Search Units

Unit-Selection Speech Synthesis Method Using Words as Search Units

Hiroyuki Segi
Copyright: © 2016 |Volume: 7 |Issue: 2 |Pages: 15
ISSN: 1947-8534|EISSN: 1947-8542|EISBN13: 9781466690455|DOI: 10.4018/IJMDEM.2016040104
Cite Article Cite Article

MLA

Segi, Hiroyuki. "Unit-Selection Speech Synthesis Method Using Words as Search Units." IJMDEM vol.7, no.2 2016: pp.1-15. http://doi.org/10.4018/IJMDEM.2016040104

APA

Segi, H. (2016). Unit-Selection Speech Synthesis Method Using Words as Search Units. International Journal of Multimedia Data Engineering and Management (IJMDEM), 7(2), 1-15. http://doi.org/10.4018/IJMDEM.2016040104

Chicago

Segi, Hiroyuki. "Unit-Selection Speech Synthesis Method Using Words as Search Units," International Journal of Multimedia Data Engineering and Management (IJMDEM) 7, no.2: 1-15. http://doi.org/10.4018/IJMDEM.2016040104

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

Unit-selection speech-synthesis systems have been proposed. In most of the unit-selection speech-synthesis systems, search units are rather short such as syllables, phonemes and diphones. However, when applied to large speech databases, shorter units produce more voice-waveform candidates and a larger speech database cannot be used without narrow pruning for practical use. Narrow pruning impairs the quality of the synthesized speech. Here the author examined the possibility of using words as search units. Subjective evaluations indicated that 70% of the speech synthesized by the proposed method sounded more natural than that synthesized by a conventional method. The five-point mean opinion score of the synthesized speech was 3.5, and 21% was judged to sound as natural as human speech. These results demonstrate the effectiveness of unit-selection speech synthesis using words as search units.