Skip to main content

Feature Subset Selection Based on Evolutionary Algorithms for Automatic Emotion Recognition in Spoken Spanish and Standard Basque Language

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4188))

Abstract

The study of emotions in human-computer interaction is a growing research area. Focusing on automatic emotion recognition, work is being performed in order to achieve good results particularly in speech and facial gesture recognition. In this paper we present a study performed to analyze different Machine Learning techniques validity in automatic speech emotion recognition area. Using a bilingual affective database, different speech parameters have been calculated for each audio recording. Then, several Machine Learning techniques have been applied to evaluate their usefulness in speech emotion recognition. In this particular case, techniques based on evolutive algorithms (EDA) have been used to select speech feature subsets that optimize automatic emotion recognition success rate. Achieved experimental results show a representative increase in the abovementioned success rate.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aha, D., Kibler, D., Albert, M.K.: Instance-Based learning algorithms. Machine Learning 6, 37–66 (1991)

    Google Scholar 

  2. Bachorowski, J.A., Owren, M.J.: Vocal expression of emotion: Acoustic properties of speech are associated with emotional intensity and context. Psychological Science 6, 219–224 (1995)

    Article  Google Scholar 

  3. Casacuberta, D.: La mente humana: Diez Enigmas y 100 preguntas (The human mind: Ten Enigmas and 100 questions). In: Océano (ed). Barcelona, Spain (2001) ISBN: 84-7556-122-5

    Google Scholar 

  4. Cowie, R., Douglas-Cowie, E., Cox, C.: Beyond emotion archetypes: Databases for emotion modelling using neural networks. Neural Networks 18, 371–388 (2005)

    Article  Google Scholar 

  5. Cowie, R., Douglas-Cowie, E., Tsapatsoulis, N., Votsis, G., Kollias, S., Fellenz, W., Taylor, J.: Emotion recognition in human-computer interaction (2001)

    Google Scholar 

  6. Dasarathy, B.V.: Nearest Neighbor (NN) Norms: NN Pattern Recognition Classification Techniques. IEEE Computer Society Press, Los Alamitos (1991)

    Google Scholar 

  7. Dellaert, F., Polzin, T., Waibel, A.: Recognizing Emotion in Speech. In: Proc. of ICSLP (1996)

    Google Scholar 

  8. Ekman, P., Friesen, W.: Pictures of facial affect. Consulting Psychologist Press, Palo Alto (1976)

    Google Scholar 

  9. Fernández, R.: A Computational Model for the Automatic Recognition of Affect in Speech. Massachusetts Institute of Technology (2004)

    Google Scholar 

  10. Gunes, V., Menard, M., Loonis, P., Petit-Renaud, S.: Combination, cooperation and selection of classiers: A state of the art. International Journal of Pattern Recognition 17, 1303–1324 (2003)

    Article  Google Scholar 

  11. Huber, R., Batliner, A., Buckow, J., Noth, E., Warnke, V., Niemann, H.: Recognition of emotion in a realistic dialogue scenario. In: Proc. ICSLP, pp. 665–668 (2000)

    Google Scholar 

  12. Humaine (retrieved March 10, 2006), http://emotion-research.net/

  13. Inza, I., Larrañaga, P., Etxeberria, R., Sierra, B.: Feature subsetselection by Bayesian network-based optimization. Artificial Intelligence 123, 157–184 (2000)

    Article  MATH  Google Scholar 

  14. Iriondo, I., Guaus, R., Rodríguez, A., Lázaro, P., Montoya, N., Blanco, J.M., Bernadas, D., Oliver, J.M., Tena, D., Longhi, L.: Validation of an acoustical modelling of emotional expression in Spanish using speech synthesis techniques. In: SpeechEmotion, pp. 161–166 (2000)

    Google Scholar 

  15. Kazemzadeh, A., Lee, S., Narayanan, S.: Acoustic correlates of user response to errors in human-computer dialogues. In: Proc. IEEE ASRU (St. Thomas, U.S. Virgin Islands) (December 2003)

    Google Scholar 

  16. Kohavi, R., Sommerfield, D., Dougherty, J.: Data mining using MLC++, a Machine Learning Library in C++. International Journal of Artificial Intelligence Tools 6(4), 537–566 (1997), http://www.sgi.com/Technology/mlc/

    Article  Google Scholar 

  17. Laukka, P.: Vocal Expression of Emotion. Discrete-emotions and Dimensional Accounts. Acta Universitatis Upsaliensis. Comprehensive Summaries of Uppsala Dissertations from the Faculty of Social Sciences, 141, p. 80, Uppsala (2004) ISBN 91-554-6091-7

    Google Scholar 

  18. Liu, H., Motoda, H.: Feature Selection for Knowledge Discovery and Data Mining. Kluwer Academic Publishers, Dordrecht (1998)

    MATH  Google Scholar 

  19. López, J.M., Cearreta, I., Fajardo, I., Garay, N.: Evaluating the validity of RekEmozio affective multimodal database with experimental subjects. Technical Report EHU-KAT-IK-04-06. Computer Architecture and Technology department, University of the Basque Country (2006)

    Google Scholar 

  20. López, J.M., Cearreta, I., Garay, N., López de Ipiña, K., Beristain, A.: RekEmozio project: bilingual and multimodal affective database. Technical Report EHU-KAT-IK-03-06. Computer Architecture and Technology department, University of the Basque Country (2006)

    Google Scholar 

  21. Martin, J.K.: An exact probability metric for Decision Tree splitting and stopping. Machine Learning 28(2/3) (1997)

    Google Scholar 

  22. Mingers, J.: A comparison of methods of pruning induced Rule Trees, Technical Report. Coventry, England: University of Warwick, School of Indutrial and Business Studies (1988)

    Google Scholar 

  23. Minsky, M.: Steps towards artificial intelligence. Proceedings of the IRE 49, 8–30 (1961)

    Article  MathSciNet  Google Scholar 

  24. Montero, J.M., Gutiérrez-Arriola, J., Palazuelos, S., Enríquez, E., Aguilera, S., Pardo, J.M.: Emotional speech synthesis: from speech database to tts. In: Proceedings of the 5th International Conference of Spoken Language Processing, Sydney, Australia, pp. 923–926 (1998)

    Google Scholar 

  25. Navas, E., Hernáez, I., Castelruiz, A., Luengo, I.: Obtaining and Evaluating an Emotional Database for Prosody Modelling in Standard Basque. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2004. LNCS (LNAI), vol. 3206, pp. 393–400. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  26. Pelikan, M., Goldberg, D.E., Lobo, F.: A Survey of Optimization by Building and Using Probabilistic Models. Technical Report 99018, IlliGAL (1999)

    Google Scholar 

  27. Picard, R.W.: Affective Computing. MIT Press, Cambridge (1997)

    Google Scholar 

  28. Quinlan, J.R.: Induction of Decision Trees. Machine Learning 1, 81–106 (1986)

    Google Scholar 

  29. Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, Los Altos (1993)

    Google Scholar 

  30. Rodríguez, A., Lázaro, P., Montoya, N., Blanco, J.M., Bernadas, D., Oliver, J.M., Longhi, L.: Modelización acústica de la expresión emocional en el español. Procesamiento del Lenguaje Natural, No. 25, Lérida, España, 159–166 (1999) ISSN: 1135-5948

    Google Scholar 

  31. Rothkrantz, L.J.M., Wiggers, P., van Wees, J.W.A., van Vark, R.J.: Voice stress analysis. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2004. LNCS, vol. 3206, pp. 449–456. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  32. Schröder, M.: Speech and Emotion Research: An overview of research frameworks and a dimensional approach to emotional speech synthesis. Ph.D. thesis, PHONUS 7, Research Report of the Institute of Phonetics, Saarland University (2004)

    Google Scholar 

  33. Stone, M.: Cross-validation choice and assessment of statistical procedures. Journal Royal of Statistical Society 36, 111–147 (1974)

    MATH  Google Scholar 

  34. Sun, X.: Pitch determination and voice quality analysis using subharmonic-to-harmonic ratio (2002), http://mel.speech.nwu.edu/sunxj/pda.htm

  35. Tao, J., Tan, T.: Affective computing: A review. In: Tao, J., Tan, T., Picard, R.W. (eds.) ACII 2005. LNCS, vol. 3784, pp. 981–995. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  36. Taylor, J.G., Scherer, K., Cowie, R.: Neural Networks. special issue on Emotion and Brain 18(4), 313–455 (2005)

    Google Scholar 

  37. Ting, K.M.: Common issues in Instance-Based and Naive-Bayesian classifiers, Ph.D. Thesis, Basser Department of Computer Science. The Univesity of Sydney, Australia (1995)

    Google Scholar 

  38. Wettschereck, D.: A study of distance-based Machine Learning Algorithms, Ph.D. Thesis, Oregon State University (1994)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Álvarez, A. et al. (2006). Feature Subset Selection Based on Evolutionary Algorithms for Automatic Emotion Recognition in Spoken Spanish and Standard Basque Language. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2006. Lecture Notes in Computer Science(), vol 4188. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11846406_71

Download citation

  • DOI: https://doi.org/10.1007/11846406_71

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-39090-9

  • Online ISBN: 978-3-540-39091-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics