Skip to main content

Characterizing Emotion in the Soundtrack of an Animated Film: Credible or Incredible?

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4738))

Abstract

In this study we present a novel emotional speech corpus, consisting of dialog that was extracted from an animated film. This type of corpus presents an interesting compromise between the sparsity of emotion found in spontaneous speech, and the contrived emotion found in speech acted solely for research purposes. The dialog was segmented into 453 short units and judged for emotional content by native and non-native English speakers. Emotion was rated on two scales: Activation and Valence. Acoustic analysis gave a comprehensive set of 100 features covering F0, intensity, voice quality and spectrum. We found that Activation is more strongly correlated to our acoustic features than Valence. Activat-ion was correlated to several types of features, whereas Valence was correlated mainly to intensity related features. Further, ANOVA analysis showed some interesting contrasts between the two scales, and interesting differences in the judgments of native vs. non-native English speakers.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Whiteside, S.: Simulated emotions: an acoustic study of voice and perturbation Measures. In: Proceedings of International Conference on Spoken Language Processing (ICSLP), Sydney (1998)

    Google Scholar 

  2. Amir, N., Ziv, S., Cohen, R.: Characteristics of authentic anger in Hebrew speech. In: Proceedings of Eurospeech, Geneva (2003)

    Google Scholar 

  3. Batliner, A., Fischer, K., Huber, R., Spilker, J., Nöth, E.: Desperately seeking emotions or: actors, wizards, and human beings. In: Proceedings of the ISCA Workshop on Speech and Emotion, Belfast (2000)

    Google Scholar 

  4. Clavel, C., Vasilescu, I., Devillers, L., Ehrette, T., Richard, G., Vasilescu, I., Devillers, L., Ehrette, T., Richard, G.: Fear-type emotions of the safe corpus: annotation issues. In: Proc. 5th Int. Conf. on Language Resources and Evaluation (LREC), Genoa (2006)

    Google Scholar 

  5. Cowie, R., Douglas-Cowie, E., Tsapatsoulis, N., Votsis, G., Kollias, S., Fellenz, W., Taylor, J.G.: Emotion recognition in human-computer interaction. IEEE Signal Processing Magazine, 32–81 (2001)

    Google Scholar 

  6. Batliner, A., Steidl, S., Schuller, B., Seppi, D., Laskowski, K., Vogt, T., Devillers, L., Vidrascu, L., Amir, N., Kessous, L., Aharonson, V.: Combining Efforts for Improving Automatic Classification of Emotional User States. In: Proceedings of IS-LTC 2006, Ljubliana, pp. 240–245 (2006)

    Google Scholar 

  7. Devillers, L., Vidrascu, L., Lamel, L.: Challenges in real-life annotation and machine learning based detection. Neural Networks 18(4), 407–422 (2005)

    Article  Google Scholar 

  8. Stone, M., DeCarlo, D., Oh, I., Rodriguez, C., Stere, A., Lees, A., Bregler, C.: Speaking with hands: Creating animated conversational characters from recordings of human performance. In: Proceedings of Siggraph (2004)

    Google Scholar 

  9. Du Bois, J.W., Schuetze-Coburn, S., Cumming, S., Paolino, D.: Outline of Discourse Transcription. In: Edwards, J., Lampert, M. (eds.) Talking Data: transcription and coding in discourse research, pp. 45–90. Lawrence Erlbaum Associates Publishers, Hillsdale, New Jersey, Hove and London (1993)

    Google Scholar 

  10. Chafe, W.: Discourse, Consciousness, and Time: the Flow and Displacement of Conscious Experience in Speaking and Writing, pp. 53–70. The University of Chicago Press, Chicago (1994)

    Google Scholar 

  11. Schröder, M., Cowie, R., Douglas-Cowie, E., Westerdijk, M., Gielen, S.: Acoustic correlates of emotion dimensions in view of speech synthesis. In: proceedings of Eurospeech. Aalborg, pp. 87–90 (2001)

    Google Scholar 

  12. Savvidou, S., Cowie, R., Douglas-Cowie, E.: FEELTRACE: validating a tool for continuous measurement of perceived emotional content. In: Proceedings of the ISCA Workshop on Speech and Emotion, Belfast (2000)

    Google Scholar 

  13. Liscombe, J., Venditti, J., Hirschberg, J.: Classifying subject ratings of emotional speech using acoustic features. In: Proceedings of Eurospeech, Geneva, pp. 725–728 (2003)

    Google Scholar 

  14. Praat software, http://www.praat.org

  15. Murray, I.R., Arnott, J.L.: Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion. Journal of the Acoustic Society of America. 93(2), 1097–1108 (1993)

    Article  Google Scholar 

  16. Purandare, A., Litman, D.: Prosody analysis and automatic recognition for F*R*I*E*N*D*S. In: Proceedings of EMNLP, Sydney (2005)

    Google Scholar 

  17. Cohen, J.: A coefficient of agreement for nominal scales. Educational and Psychological Measurement 20, 37–46 (1960)

    Article  Google Scholar 

  18. Landis, J.R., Koch, G.G.: The Measurement of observer agreement for categorical data. Biometrics 33, 159–174 (1977)

    Article  MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Ana C. R. Paiva Rui Prada Rosalind W. Picard

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Amir, N., Cohen, R. (2007). Characterizing Emotion in the Soundtrack of an Animated Film: Credible or Incredible?. In: Paiva, A.C.R., Prada, R., Picard, R.W. (eds) Affective Computing and Intelligent Interaction. ACII 2007. Lecture Notes in Computer Science, vol 4738. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74889-2_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74889-2_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74888-5

  • Online ISBN: 978-3-540-74889-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics