Actor level emotion magnitude prediction in text and speech

Calix, Ricardo A.; Knapp, Gerald M.

doi:10.1007/s11042-011-0909-8

Actor level emotion magnitude prediction in text and speech

Published: 29 October 2011

Volume 62, pages 319–332, (2013)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Ricardo A. Calix¹ &
Gerald M. Knapp¹

400 Accesses
4 Citations
Explore all metrics

Abstract

The digital universe is expanding at very high rates. New ways of retrieving and enriching text and audio content are required. In this work, a methodology for actor level emotion magnitude prediction in text and speech is proposed. A model is trained to predict emotion magnitudes per actor at any point in a story using previous emotion magnitudes plus current text and speech features which act on the actor’s emotional state. The methodology compares linear and non-linear regression techniques to determine the optimal model that fits the data. Results of the analysis show that non-linear regression models based on Support Vector Regression (SVR) using a Radial Basis Function (RBF) kernel provide the most accurate prediction model. An analysis of the contribution of the features for emotion magnitude prediction is performed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Akenine-Moller T, Haines E, Hoffman N (2008) Real-Time Rendering. A K Peters, Wellesley, Massachusetts
Book Google Scholar
Alm CO, Roth D, Sproat R (2005) Emotions from text: machine learning for text-based emotion prediction. In Proceedings of the Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, pp. 579–586
Alm CO (2011) Affect data. http://lrc.cornell.edu/swedish/dataset/affectdata/index.html. Accessed 30 March 2011
Alm CO (2008) Affect in Text and Speech. Dissertation, University of Illinois at Urbana-Champaign
Bird S, Klein E, Loper E (2009) Natural Language Processing with Python. 1st ed., O’Reilly Media
Boersma P, Weenink D (2011) Praat: doing phonetics by computer. Version 5.2.21. http://www.praat.org/. Accessed 30 March 2011
Burns B, Morrison C (2003) Temporal Abstraction in Bayesian Networks. In Working Notes of Association for the Advancement of Artificial Intelligence (AAAI), Spring Symposium Workshop: Foundation and Applications of Spatio-Temporal Reasoning, AAAI,Technical Report SS-03-03, 2003
Busso C, Lee S, Narayanan S (2009) Analysis of emotionally salient aspects of fundamental frequency for emotion detection. IEEE Transactions on Audio, Speech, and Language Processing Vol. 17, No. 4
Calix RA, Mallepudi S, Chen B, Knapp GM (2010) Emotion recognition in text for 3D facial expression rendering. IEEE Transactions on Multimedia, Special Issue on Multimodal Affective Interaction 12(6):544–551
Article Google Scholar
Calix RA, Knapp GM (2011) Affect Corpus 2.0: An extension of a corpus for actor level emotion magnitude detection. In Proceedings of the 2nd ACM Multimedia Systems (MMSys) conference, Feb. 2011, San Jose, California, U.S.A.
Chang CC, Lin C (2001) LIBSVM: a library for support vector machines. http://www.csie.ntu.edu.tw/~cjlin/libsvm. Accessed 30 March 2011
El-Nasr M, Loerger T, Yen J (1999) PETEEI: A pet with evolving emotional intelligence. Proceedings of the third annual conference on autonomous agents, Seattle, Washington, USA, pp. 9–15
Gantz J, Reinsel D (2010) The digital universe decade—Are you ready? IDC Report. http://www.emc.com/collateral/demos/microsites/idc-digital-universe/iview.htm. Accessed 30 March 2011
Grimm M, Kroschel K, Narayanan S (2007) Support Vector Regression for automatic recognition of spontaneous emotions in speech. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007), Vol. 4, pp. IV-1085-IV-1088
Jurafsky D, Martin J (2008) Speech and Language Processing, 2nd edn. Prentice Hall, New Jersey
Google Scholar
Lu CY, Hong J, Cruz-Lara S (2006) Emotion detection in textual information by semantic role labeling and web mining techniques. Third Taiwanese-French Conference on Information Technology—TFIT
Luengo I, Navas E, Hernaez I, (2010) Feature analysis and evaluation for automatic emotion identification in speech. IEEE Transactions on Multimedia, Vol. 12, No. 6
Mao Y, Lebanon G (2006) Sequential models for sentiment prediction. In Proceedings of the International Conference on Machine Learning (ICML), Workshop on Learning in Structured Output Spaces, Pittsburg, PA
Moilanen K, Pulman S (2007) Sentiment Composition. In Proceedings of Recent Advances in Natural Language Processing (RANLP 2007), September 27–29, Borovets, Bulgaria, pp. 378–382
Neviarouskaya A, Prendinger H, Ishizuka M (2009) Semantically distinct verb classes involved in sentiment analysis. In Proceedings IADIS international conference on applied computing, AC 1:27–35
Google Scholar
Nuance (2011) Naturally speaking software. http://www.nuance.com/dragon/index.htm. Accessed 30 March 2011
Pang B, Lee L (2008) Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval 2(1–2):1–135
Article Google Scholar
Smola A, Scholkopf B (2004) A tutorial on Support Vector Regression. Stat Comput 14:199–222
Article MathSciNet Google Scholar
Tokuhisa R, Inui K, Matsumoto Y (2008) Emotion classification using massive examples extracted from the web. In Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), Manchester, pp. 881–888
Witten I, Frank E (2005) Data Mining: Practical Machine Learning Tools and Techniques, 2 dth edn. Morgan Kaufmann Publishers Inc., San Francisco
MATH Google Scholar

Download references

Author information

Authors and Affiliations

College of Engineering, Louisiana State University, Baton Rouge, LA, USA
Ricardo A. Calix & Gerald M. Knapp

Authors

Ricardo A. Calix
View author publications
You can also search for this author in PubMed Google Scholar
Gerald M. Knapp
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gerald M. Knapp.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Calix, R.A., Knapp, G.M. Actor level emotion magnitude prediction in text and speech. Multimed Tools Appl 62, 319–332 (2013). https://doi.org/10.1007/s11042-011-0909-8

Download citation

Published: 29 October 2011
Issue Date: January 2013
DOI: https://doi.org/10.1007/s11042-011-0909-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Actor level emotion magnitude prediction in text and speech

Abstract

Access this article

Similar content being viewed by others

A Longitudinal Study of the Emotional Content in Indian Political Speeches

Relevance units machine based dimensional and continuous speech emotion prediction

Emotion Recognition and Sentiment Analysis of Extemporaneous Speech Transcriptions in Russian

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Actor level emotion magnitude prediction in text and speech

Abstract

Access this article

Similar content being viewed by others

A Longitudinal Study of the Emotional Content in Indian Political Speeches

Relevance units machine based dimensional and continuous speech emotion prediction

Emotion Recognition and Sentiment Analysis of Extemporaneous Speech Transcriptions in Russian

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation