Abstract
This paper presents the analysis made to assess the suitability of neutral semantic corpora to study emotional speech. Two corpora have been used: one having neutral texts that were common to all emotions and the other having texts related to the emotion. Subjective and objective analysis have been performed. In the subjective test common corpus has achieved good recognition rates, although worse than those obtained with specific texts. In the objective analysis, differences among emotions are larger for common texts than for specific texts, indicating that in common corpus expression of emotions was more exaggerated. This is convenient for emotional speech synthesis, but no for emotion recognition. So, in this case, common corpus is suitable for the prosodic modeling of emotions to be used in speech synthesis, but for emotion recognition specific texts are more convenient.
This work has been partially funded by the Spanish Ministry of Science and Technology (TIC2003-08382-C05-03). Authors would also like to thank all the evaluators that took part in the subjective evaluation process.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Campbell, N.: Databases of Emotional Speech. In: Proc. ISCA Tutorial and Research Workshop (ITRW) on Speech and Emotion. ISCA Archive, pp. 34–38 (2000)
Montero, J.M., Gutiérrez-Arriola, J., Colás, J., Enríquez, E., Pardo, J.M.: Analysis and Modelling of Emotional Speech in Spanish. In: Proc. ICPhS 1999, pp. 957–960 (1999)
Hozjan, V., Kacic, Z., Moreno, A., Bonafonte, A., Nogueiras, A.: Interface databases: Design and Collection of a Multilingual Emotional Speech Database. In: Proc. 3rd International Conference on Language Resources and Evaluation, pp. 2019–2023 (2000)
Seppánen, T., Väyrynen, E., Toivanen, J.: Prosody-based classification of emotions in spoken Finnish. In: Proc. Eurospeech 2003, vol. 1, pp. 717–720 (2003)
Alvarez, J.L.: The Future of Standard Basque. Uztaro 11, 47–54 (1994)
Scherrer, K.R.: Vocal Communication of Emotion: A Review of Research Paradigms. Speech Communication 40, 227–256 (2003)
Cowie, R., Cornelius, R.R.: Describing the Emotional States that Are Expressed in Speech. Speech Communication 40(1,2), 2–32 (2003)
Lay Nwe, T., Wei Foo, S., De Silva, L.: Speech Emotion Recognition Using Hidden Markov Models. Speech Communication 41(4), 603–623 (2003)
Boula deMareüil, P., Célérier, P., Toen, J.: Generation of Emotions by a Morphing Technique in English, French and Spanish. In: Proc. Speech Prosody. Laboratoire Parole et Langage CNRS, Aix-en Provence, pp. 187–190 (2002)
Enberg, I.S., Hansen, A.V., Andersen, O., Dalsgaard, P.: Design, Recording and Verification of a Danish Emotional Speech Database. In: Proc. 5th European Conference on Speech Communication and Technology, pp. 1695–1698 (1997)
Paeschke, A., Sendlmeier, W.F.: Prosodic characteristics of Emotional Speech; Measurements of Fundamental Frequency Movements. In: Proc. ISCA Workshop on Speech and Emotion. ISCA Archive, pp. 75–80 (2000)
Iida, A., Campbell, N., Higuchi, F., Yasumura, M.: A Corpus-based Speech Synthesis System with Emotion. Speech Communication 40(1,2), 161–187 (2003)
Burkhardt, F., Sendlmeier, W.F.: Verification of Acoustical Correlates of Emotional Speech using Formant-Synthesis. In: Proc. ISCA Workshop on Speech and Emotion. ISCA Archive, pp. 151–156 (2000)
Iriondo, I., Guaus, R., Rodríguez, A., Lázaro, P., Montoya, N., Blanco, J.M., Bernardas, D., Oliver, J.M., Tena, D., Longhi, L.: Validation of an Acoustical Modelling of Emotional Expression in Spanish using Speech Synthesis Techniques. In: Proc. ISCA Workshop on Speech and Emotion. ISCA Archive, pp. 161–166 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Navas, E., Hernáez, I., Luengo, I., Sánchez, J., Saratxaga, I. (2005). Analysis of the Suitability of Common Corpora for Emotional Speech Modeling in Standard Basque. In: Matoušek, V., Mautner, P., Pavelka, T. (eds) Text, Speech and Dialogue. TSD 2005. Lecture Notes in Computer Science(), vol 3658. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11551874_34
Download citation
DOI: https://doi.org/10.1007/11551874_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28789-6
Online ISBN: 978-3-540-31817-0
eBook Packages: Computer ScienceComputer Science (R0)