skip to main content
10.1145/1543834.1543972acmconferencesArticle/Chapter ViewAbstractPublication PagesgecConference Proceedingsconference-collections
poster

Emotional speech synthesis by XML file using interactive genetic algorithms

Published: 12 June 2009 Publication History

Abstract

As a technique that can "let computer speak", speech synthesis is drawing more and more attention. Today, much speech synthesis software can synthesize neutral speech naturally and knowingly. However, it is hard to make computers speak with "emotion" as that in our daily life, because of the complexity of emotion model. Interactive Genetic Algorithms which can be acted self-organizingly, adaptively and self-learningly can just resolve the problem of difficulty in modeling emotional speech synthesis. As a result, this paper designs an emotional speech synthesis process, which adjusts the parameters (XML-tags) used to synthesize emotional speech dynamically, using interactive Genetic Algorithms, to optimize the quality of emotional speech. Also, the paper includes an evaluation experiment, which proves the feasibility of the algorithms.

References

[1]
Marc Schroder,Emotional Speech Synthesis: A Review, DFKI, Saarbrucken, Germany Institute of Phonetics, University of the Saarland
[2]
Paul Ekrnan and Harriet Oster Facial Expressions Of Emotion, Department of Psychiatry, University of California, San Francisco, California 94143, Ann. Rev. Psvchol. 1979. 30:527--54
[3]
Jianhua Tao, Member, IEEE, Yongguo Kang, and Aijun Li, Prosody Conversion From Neutral Speech to Emotional Speech, IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 14, NO. 4, JULY 2006, 1558--7916/$20.00 @ 2006 IEEE
[4]
Iain R. Murray, John L. Arnott, Applying an analysis of acted vocal emotions to improve the simulation of synthetic speech, Computer Speech and Language 22 (2008) 107--129
[5]
Yuji Sato. Voice quality conversion using interactive evolution of prosodic control. Faculty of Computer and Information Sciences, Hosei University. Applied Soft Computing 5 (2005) 181--192
[6]
Murtaza Bulut etc. A Statistical Approach For Modeling Prosody Features Using POS Tags For Emotional Speech Synthesis. 1-4244-0728-1/07/$20.00 @ 2007 IEEE
[7]
Shinya Mori etc. Emotional Speech Synthesis Using Subspace Constraints In Prosody. 1424403677/06/$20.00 @ 2006 IEEE
[8]
Mariet Theune etc. Generating Expressive Speech for Storytelling Applications. 1558--7916/$20.00 @ 2006 IEEE
[9]
Janet E. Cahn. Generating Expression in Synthesized Speech. Massachusetts Institute of Technology 1990
[10]
Iain R. Murray etc. Applying an analysis of acted vocal emotions to improve the simulation of synthetic speech. I.R. Murray, J.L. Arnott / Computer Speech and Language 22 (2008) 107--129
[11]
Makoto TACHIBANA, Junichi YAMAGISHI, Student Members, Takashi MASUKO, and Takao KOBAYASHI, Members, Speech Synthesis with Various Emotional Expressions and Speaking Styles by Style Interpolation and Morphing,IEICE TRANS. INF. SYST., VOL.E88-D, NO.11 NOVEMBER 2005
[12]
Marc Schroder, Expressing Degree of Activation in Synthetic Speech, IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 14, NO. 4, JULY 2006
[13]
Pierre-Yves Oudeyer, The production and recognition of emotions in speech: features and algorithms, Int. J. Human-Computer Studies 59 (2003) 157--183
[14]
H Scheffe, An analysis of variance for paired comparisons, Journal of the American Statistical Association, 1952 -- JSTOR
[15]
Microsoft Corporation, Microsoft Speech SDK 5.1 SAPI

Cited By

View all
  • (2020)Generating Kranok patterns with an interactive evolutionary algorithmApplied Soft Computing10.1016/j.asoc.2020.10612189:COnline publication date: 1-Apr-2020
  • (2011)Emotion representation, analysis and synthesis in continuous space: A surveyFace and Gesture 201110.1109/FG.2011.5771357(827-834)Online publication date: Mar-2011
  • (2011)Interactive Intonation Optimisation Using CMA-ES and DCT Parameterisation of the F0 Contour for Speech SynthesisNature Inspired Cooperative Strategies for Optimization (NICSO 2011)10.1007/978-3-642-24094-2_4(57-71)Online publication date: 2011
  • Show More Cited By

Index Terms

  1. Emotional speech synthesis by XML file using interactive genetic algorithms

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      GEC '09: Proceedings of the first ACM/SIGEVO Summit on Genetic and Evolutionary Computation
      June 2009
      1112 pages
      ISBN:9781605583266
      DOI:10.1145/1543834

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 12 June 2009

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. emotion
      2. interactive genetic algorithms
      3. speech synthesis

      Qualifiers

      • Poster

      Conference

      GEC '09
      Sponsor:

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 01 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2020)Generating Kranok patterns with an interactive evolutionary algorithmApplied Soft Computing10.1016/j.asoc.2020.10612189:COnline publication date: 1-Apr-2020
      • (2011)Emotion representation, analysis and synthesis in continuous space: A surveyFace and Gesture 201110.1109/FG.2011.5771357(827-834)Online publication date: Mar-2011
      • (2011)Interactive Intonation Optimisation Using CMA-ES and DCT Parameterisation of the F0 Contour for Speech SynthesisNature Inspired Cooperative Strategies for Optimization (NICSO 2011)10.1007/978-3-642-24094-2_4(57-71)Online publication date: 2011
      • (2009)Evaluating emotional algorithms using psychological scalesProceedings of the International Workshop on Affective-Aware Virtual Agents and Social Robots10.1145/1655260.1655262(1-6)Online publication date: 6-Nov-2009

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media