short-paper

Evaluating expressiveness of a voice-guided speech re-synthesis system using vocal prosodic parameters

Authors:
Yuan-Yi Fan

I.AM+, LLC

I.AM+, LLC
View Profile

,
Soyoung Shin

I.AM+, LLC

I.AM+, LLC
View Profile

,
Vids Samanta

I.AM+, LLC

I.AM+, LLC
View Profile

IUI '19 Companion: Companion Proceedings of the 24th International Conference on Intelligent User InterfacesMarch 2019Pages 67–68https://doi.org/10.1145/3308557.3308715

Published:16 March 2019Publication History

IUI '19 Companion: Companion Proceedings of the 24th International Conference on Intelligent User Interfaces

Pages 67–68

ABSTRACT

Contour is a voice-guided speech re-synthesis system we previously developed for efficient TTS (Text-to-Speech) content production. In this follow-up evaluation study, we investigate qualities of synthetic speech produced using Contour against a conventional parametric-based workflow by evaluating expressive dimensions of produced TTS content using vocal prosodic parameters. Based on the quantitative and qualitative results, we discuss user preferences between these two workflows for producing TTS content.

References

Véronique Aubergé, Nicolas Audibert, and Albert Rilliard. 2004. Acoustic morphology of expressive speech: What about contours?. In Speech Prosody 2004, International Conference.Google Scholar
Yuan-Yi Fan, Soyoung Shin, and Vids Samanta. 2017. Contour: An Efficient Voice-enabled Workflow for Producing Text-to-Speech Content. In Adjunct Publication of the 30th Annual ACM Symposium on User Interface Software and Technology. ACM, 133--135. Google ScholarDigital Library
Klaus R Scherer, Tom Johnstone, and Gundrun Klasmeyer. 2003. Vocal expression of emotion. Handbook of affective sciences (2003), 433--456.Google Scholar

Index Terms

Evaluating expressiveness of a voice-guided speech re-synthesis system using vocal prosodic parameters
1. Human-centered computing
  1. Human computer interaction (HCI)
    1. HCI design and evaluation methods
    2. Interaction paradigms
      1. Natural language interfaces

Recommendations

Speech-Input Speech-Output Communication for Dysarthric Speakers Using HMM-Based Speech Recognition and Adaptive Synthesis System

Dysarthria is a motor speech disorder that causes inability to control and coordinate one or more articulators. This makes it difficult for a dysarthric speaker to utter certain speech sound units, thereby producing poorly articulated, slurred, and ...
Read More
Lithuanian Speech Corpus Liepa for Development of Human-Computer Interfaces Working in Voice Recognition and Synthesis Mode

The problem of speech corpus for design of human-computer interfaces working in voice recognition and synthesis mode is investigated. Specific requirements of speech corpus for speech recognizers and synthesizers were accented. It has been discussed that ...
Read More
Prosodic Events Recognition in Evaluation of Speech-Synthesis System Performance
TSD '08: Proceedings of the 11th international conference on Text, Speech and Dialogue

We present an objective-evaluation method of the prosody modeling in an HMM-based Slovene speech-synthesis system. Method is based on the results of the automatic recognition of syntactic-prosodic boundary positions and accented words in the synthetic ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

IUI '19 Companion: Companion Proceedings of the 24th International Conference on Intelligent User Interfaces
March 2019
173 pages
ISBN:9781450366731
DOI:10.1145/3308557

Copyright © 2019 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 16 March 2019
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
production tool design
speech synthesis
voice user interface
Qualifiers
- short-paper
Conference

Acceptance Rates
Overall Acceptance Rate746of2,811submissions,27%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 84
  Total Downloads
- Downloads (Last 12 months)9
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Evaluating expressiveness of a voice-guided speech re-synthesis system using vocal prosodic parameters

IUI '19 Companion: Companion Proceedings of the 24th International Conference on Intelligent User Interfaces

ABSTRACT

References

Cited By

Index Terms

Recommendations

Speech-Input Speech-Output Communication for Dysarthric Speakers Using HMM-Based Speech Recognition and Adaptive Synthesis System

Lithuanian Speech Corpus Liepa for Development of Human-Computer Interfaces Working in Voice Recognition and Synthesis Mode

Prosodic Events Recognition in Evaluation of Speech-Synthesis System Performance

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Evaluating expressiveness of a voice-guided speech re-synthesis system using vocal prosodic parameters

IUI '19 Companion: Companion Proceedings of the 24th International Conference on Intelligent User Interfaces

ABSTRACT

References

Cited By

Index Terms

Recommendations

Speech-Input Speech-Output Communication for Dysarthric Speakers Using HMM-Based Speech Recognition and Adaptive Synthesis System

Lithuanian Speech Corpus Liepa for Development of Human-Computer Interfaces Working in Voice Recognition and Synthesis Mode

Prosodic Events Recognition in Evaluation of Speech-Synthesis System Performance

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media