Analysis of the Suitability of Common Corpora for Emotional Speech Modeling in Standard Basque

Navas, Eva; Hernáez, Inmaculada; Luengo, Iker; Sánchez, Jon; Saratxaga, Ibon

doi:10.1007/11551874_34

Eva Navas¹⁹,
Inmaculada Hernáez¹⁹,
Iker Luengo¹⁹,
Jon Sánchez¹⁹ &
…
Ibon Saratxaga¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3658))

Included in the following conference series:

International Conference on Text, Speech and Dialogue

712 Accesses

Abstract

This paper presents the analysis made to assess the suitability of neutral semantic corpora to study emotional speech. Two corpora have been used: one having neutral texts that were common to all emotions and the other having texts related to the emotion. Subjective and objective analysis have been performed. In the subjective test common corpus has achieved good recognition rates, although worse than those obtained with specific texts. In the objective analysis, differences among emotions are larger for common texts than for specific texts, indicating that in common corpus expression of emotions was more exaggerated. This is convenient for emotional speech synthesis, but no for emotion recognition. So, in this case, common corpus is suitable for the prosodic modeling of emotions to be used in speech synthesis, but for emotion recognition specific texts are more convenient.

This work has been partially funded by the Spanish Ministry of Science and Technology (TIC2003-08382-C05-03). Authors would also like to thank all the evaluators that took part in the subjective evaluation process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Developing a Thai emotional speech corpus from Lakorn (EMOLA)

Article 28 November 2018

Emotional Speech Datasets for English Speech Synthesis Purpose: A Review

Current State of Speech Emotion Dataset-National and International Level

References

Campbell, N.: Databases of Emotional Speech. In: Proc. ISCA Tutorial and Research Workshop (ITRW) on Speech and Emotion. ISCA Archive, pp. 34–38 (2000)
Google Scholar
Montero, J.M., Gutiérrez-Arriola, J., Colás, J., Enríquez, E., Pardo, J.M.: Analysis and Modelling of Emotional Speech in Spanish. In: Proc. ICPhS 1999, pp. 957–960 (1999)
Google Scholar
Hozjan, V., Kacic, Z., Moreno, A., Bonafonte, A., Nogueiras, A.: Interface databases: Design and Collection of a Multilingual Emotional Speech Database. In: Proc. 3rd International Conference on Language Resources and Evaluation, pp. 2019–2023 (2000)
Google Scholar
Seppánen, T., Väyrynen, E., Toivanen, J.: Prosody-based classification of emotions in spoken Finnish. In: Proc. Eurospeech 2003, vol. 1, pp. 717–720 (2003)
Google Scholar
Alvarez, J.L.: The Future of Standard Basque. Uztaro 11, 47–54 (1994)
Google Scholar
Scherrer, K.R.: Vocal Communication of Emotion: A Review of Research Paradigms. Speech Communication 40, 227–256 (2003)
Article Google Scholar
Cowie, R., Cornelius, R.R.: Describing the Emotional States that Are Expressed in Speech. Speech Communication 40(1,2), 2–32 (2003)
Google Scholar
Lay Nwe, T., Wei Foo, S., De Silva, L.: Speech Emotion Recognition Using Hidden Markov Models. Speech Communication 41(4), 603–623 (2003)
Article Google Scholar
Boula deMareüil, P., Célérier, P., Toen, J.: Generation of Emotions by a Morphing Technique in English, French and Spanish. In: Proc. Speech Prosody. Laboratoire Parole et Langage CNRS, Aix-en Provence, pp. 187–190 (2002)
Google Scholar
Enberg, I.S., Hansen, A.V., Andersen, O., Dalsgaard, P.: Design, Recording and Verification of a Danish Emotional Speech Database. In: Proc. 5th European Conference on Speech Communication and Technology, pp. 1695–1698 (1997)
Google Scholar
Paeschke, A., Sendlmeier, W.F.: Prosodic characteristics of Emotional Speech; Measurements of Fundamental Frequency Movements. In: Proc. ISCA Workshop on Speech and Emotion. ISCA Archive, pp. 75–80 (2000)
Google Scholar
Iida, A., Campbell, N., Higuchi, F., Yasumura, M.: A Corpus-based Speech Synthesis System with Emotion. Speech Communication 40(1,2), 161–187 (2003)
Article MATH Google Scholar
Burkhardt, F., Sendlmeier, W.F.: Verification of Acoustical Correlates of Emotional Speech using Formant-Synthesis. In: Proc. ISCA Workshop on Speech and Emotion. ISCA Archive, pp. 151–156 (2000)
Google Scholar
Iriondo, I., Guaus, R., Rodríguez, A., Lázaro, P., Montoya, N., Blanco, J.M., Bernardas, D., Oliver, J.M., Tena, D., Longhi, L.: Validation of an Acoustical Modelling of Emotional Expression in Spanish using Speech Synthesis Techniques. In: Proc. ISCA Workshop on Speech and Emotion. ISCA Archive, pp. 161–166 (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Electrónica y Telecomunicaciones Escuela Técnica Superior de Ingeniería, University of the Basque Country, Alameda Urquijo s/n, 48013, Bilbao, Spain
Eva Navas, Inmaculada Hernáez, Iker Luengo, Jon Sánchez & Ibon Saratxaga

Authors

Eva Navas
View author publications
You can also search for this author in PubMed Google Scholar
Inmaculada Hernáez
View author publications
You can also search for this author in PubMed Google Scholar
Iker Luengo
View author publications
You can also search for this author in PubMed Google Scholar
Jon Sánchez
View author publications
You can also search for this author in PubMed Google Scholar
Ibon Saratxaga
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of West Bohemia in Pilsen, Univerzitni 8, 30614, Plzen, Czech Republic
Václav Matoušek , Pavel Mautner & Tomáš Pavelka , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Navas, E., Hernáez, I., Luengo, I., Sánchez, J., Saratxaga, I. (2005). Analysis of the Suitability of Common Corpora for Emotional Speech Modeling in Standard Basque. In: Matoušek, V., Mautner, P., Pavelka, T. (eds) Text, Speech and Dialogue. TSD 2005. Lecture Notes in Computer Science(), vol 3658. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11551874_34

Download citation

DOI: https://doi.org/10.1007/11551874_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28789-6
Online ISBN: 978-3-540-31817-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics