Modeling Latent Discriminative Dynamic of Multi-dimensional Affective Signals

Ramirez, Geovany A.; Baltrušaitis, Tadas; Morency, Louis-Philippe

doi:10.1007/978-3-642-24571-8_51

Modeling Latent Discriminative Dynamic of Multi-dimensional Affective Signals

Geovany A. Ramirez¹⁹,
Tadas Baltrušaitis²⁰ &
Louis-Philippe Morency²¹

Conference paper

4584 Accesses
43 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6975))

Abstract

During face-to-face communication, people continuously exchange para-linguistic information such as their emotional state through facial expressions, posture shifts, gaze patterns and prosody. These affective signals are subtle and complex. In this paper, we propose to explicitly model the interaction between the high level perceptual features using Latent-Dynamic Conditional Random Fields. This approach has the advantage of explicitly learning the sub-structure of the affective signals as well as the extrinsic dynamic between emotional labels. We evaluate our approach on the Audio-Visual Emotion Challenge (AVEC 2011) dataset. By using visual features easily computable using off-the-shelf sensing software (vertical and horizontal eye gaze, head tilt and smile intensity), we show that our approach based on LDCRF model outperforms previously published baselines for all four affective dimensions. By integrating audio features, our approach also outperforms the audio-visual baseline.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Argyle, M., Dean, J.: Eye-contact, distance and affiliation. Sociometry 28, 233–304 (1965)
Article Google Scholar
Bavelas, J.B., Coates, L., Johnson, T.: Listeners as co-narrators. Journal of Personality and Social Psychology 79(6), 941–952 (2000)
Article Google Scholar
Blitzer, J., McDonald, R., Pereira, F.: Domain Adaptation with Structural Correspondence Learning. In: EMNLP, pp. 120–128 (2006)
Google Scholar
Ekman, P.: An argument for basic emotions. Cognition & Emotion 6(3), 169–200 (1992)
Article Google Scholar
Eyben, F., Wollmer, M., Valstar, M., Gunes, H., Schuller, B., Pantic, M.: String-based audiovisual fusion of behavioural events for the assessment of dimensional affect. In: IEEE FG 2011 (2011)
Google Scholar
Fontaine, J.R., Scherer, K.R., Roesch, E.B., Ellsworth, P.: The world of emotion is not two-dimensional. Psychological Science 18, 1050–1057 (2007)
Article Google Scholar
Gunes, H., Pantic, M.: Automatic, dimensional and continuous emotion recognition. Int’l Journal of Synthetic Emotion 1(1), 68–99 (2010)
Article Google Scholar
Hall, M.: Correlation-based Feature Selection for Machine Learning. Ph.D. thesis, University of Waikato (1999)
Google Scholar
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The weka data mining software: an update. SIGKDD Explor. Newsl. 11, 10–18 (2009)
Article Google Scholar
HCRF: library for crf and ldcrf, http://sourceforge.net/projects/hcrf/
Krämer, N.C.: Nonverbal Communication. In: Human Behavior in Military Contexts, pp. 150–188. The National Academies Press, Washington (2008)
Google Scholar
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labelling sequence data. In: ICML 2001 (2001)
Google Scholar
Morency, L.P., Quattoni, A., Darrell, T.: Latent-dynamic discriminative models for continuous gesture recognition. In: CVPR 2007 (2007)
Google Scholar
Nicolaou, M., Gunes, H., Pantic, M.: Audio-visual classification and fusion of spontaneous affective data in likelihood space. In: ICPR 2010 (2010)
Google Scholar
Nicolaou, M., Gunes, H., Pantic, M.: Output-associative rvm regression for dimensional and continuous emotion prediction. In: IEEE FG 2011 (2011)
Google Scholar
Ojala, T., Pietikainen, M., Maenpaa, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. TPAMI 24(7) (2002)
Google Scholar
OKAO: Software, http://www.omron.com/r_d/coretech/vision/okao.html
Pantic, M., Rothkrantz, L.: Toward an affect-sensitive multimodal human-computer interaction. Proceedings of the IEEE 91(9), 1370–1390 (2003)
Article Google Scholar
Quinlan, J.R.: C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc., San Francisco (1993)
Google Scholar
Rozin, P., Cohen, A.B.: High frequency of facial expressions corresponding to confusion, concentration, and worry in an analysis of naturally occurring facial expressions of Americans. Emotion 3(1), 68–75 (2003)
Article Google Scholar
Schuller, B., Valstar, M., Eyben, F., McKeown, G., Cowie, R., Pantic, M.: Avec 2011– the first international audio/visual emotion challenge. In: D´Mello, S., et al. (eds.) ACII 2011, Part II. LNCS, vol. 6975, pp. 415–424. Springer, Heidelberg (2011)
Google Scholar
Wöllmer, M., Eyben, F., Reiter, S., Schuller, B., Cox, C., Douglas-Cowie, E., Cowie, R.: Abandoning emotion classes - towards continuous emotion recognition with modelling of long-range dependencies. In: INTERSPEECH. ISCA (2008)
Google Scholar
Zeng, Z., Pantic, M., Roisman, G.I., Huang, T.S.: A survey of affect recognition methods: Audio, visual, and spontaneous expressions. TPAMI 31(1) (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, University of Texas at El Paso, USA
Geovany A. Ramirez
Computer Laboratory, University of Cambridge, United Kingdom
Tadas Baltrušaitis
Institute for Creative Technologies, University of Southern California, USA
Louis-Philippe Morency

Authors

Geovany A. Ramirez
View author publications
You can also search for this author in PubMed Google Scholar
Tadas Baltrušaitis
View author publications
You can also search for this author in PubMed Google Scholar
Louis-Philippe Morency
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Memphis, 202 Psychology Building, 38152, Memphis, TN, USA
Sidney D’Mello & Arthur Graesser &
Technische Universität München, Arcisstraße 21, 80333, München, Germany
Björn Schuller
Laboratoire d’Informatique pour la Mécanique et les Sciences de l’Ingénieur (LIMSI-CNRS), Bâtiment 508, 91403, Orsay Cedex, France
Jean-Claude Martin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ramirez, G.A., Baltrušaitis, T., Morency, LP. (2011). Modeling Latent Discriminative Dynamic of Multi-dimensional Affective Signals. In: D’Mello, S., Graesser, A., Schuller, B., Martin, JC. (eds) Affective Computing and Intelligent Interaction. ACII 2011. Lecture Notes in Computer Science, vol 6975. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24571-8_51

Download citation

DOI: https://doi.org/10.1007/978-3-642-24571-8_51
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24570-1
Online ISBN: 978-3-642-24571-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics