ABSTRACT
In the current work, a conducted subjective evaluation of three basic components of a framework for applied Speech Emotion Recognition (SER) for theatrical performance and social media communication and interaction is presented. The multidisciplinary survey group used for the evaluation is consisted of participants with Theatrical and Performance Arts background, as well as Journalism and Mass Communications Studies. Initially, a publically available database of emotional speech utterances, Acted Emotional Speech Dynamic Database (AESDD) is evaluated. We examine the degree of agreement between the perceived emotion by the participants and the intended expressed emotion in the AESDD recordings. Furthermore, the participants are asked to choose between different coloured lighting of certain scenes captured on video. Correlations between the emotional content of the scenes and selected colors are observed and discussed. Finally, a prototype application for SER and multimodal speech emotion data gathering is evaluated in terms of Usefulness, Ease of Use, Ease of Learning and Satisfaction.
- El Ayadi, Moataz, Mohamed S. Kamel, and Fakhri Karray. 2011. Survey on speech emotion recognition: Features, classification schemes, and databases. Pattern Recognition 44,3 (2011), 572--587. Google ScholarDigital Library
- E.O. Bonde, E.K. Hansen, and G. Triantafyllidis. 2017. Auditory and Visual based Intelligent Lighting Design for Music Concerts. Eai Endrosed Trasactions on Creative Technologies (2017).Google Scholar
- N. Vryzas, A. Liatsou, R. Kotsakis, C. Dimoulas, and G. Kalliris. 2017. Augmenting Drama: A Speech Emotion-Controlled Stage Lighting Framework. In Proceedings of the 12th International Audio Mostly Conference on Augmented and Participatory Sound and Music Experiences. ACM. Google ScholarDigital Library
- S. Senecal, L. Cuel, A. Aristidou, and N. Magnenat-Thalmann. 2016. Continuous body emotion recognition system during theater performances. Computer Animation and Virtual Worlds, 27(3-4), 311--320. Google ScholarDigital Library
- M. Grassi, E. Cambria, A. Hussain, and F. Piazza. 2011. Sentic web: A new paradigm for managing social media affective information. Cognitive Computation 3, 3, 480--489.Google ScholarCross Ref
- P. Weerasinghe, A. Marasinghe, R. Ranaweera, S. Amarakeerthi, and M. Cohen. 2014. Emotion Expression for Affective Social Communication. International Journal of Affective Engineering 13, 4, 261--268.Google ScholarCross Ref
- C.N. Anagnostopoulos, T. Iliou, and I. Giannoukos. 2015. Features and classifiers for emotion recognition from speech: a survey from 2000 to 2011. Artificial Intelligence Review, 43, 2, 155--177. Google ScholarDigital Library
- C. Busso, M. Bulut, S. Narayanan, J. Gratch, and S. Marsella. 2013. Toward effective automatic recognition systems of emotion in speech. Social emotions in nature and artifact: emotions in human and human-computer interaction. J. Gratch and S. Marsella, Eds, 110--127.Google Scholar
- G. Trigeorgis, F. Ringeval, R. Brueckner, E. Marchi, M.A. Nicolaou, B. Schuller, and S. Zafeiriou. 2016. Adieu features? end-to-end speech emotion recognition using a deep convolutional recurrent network. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 5200--5204.Google Scholar
- S.E. Kahou, X. Bouthillier, P. Lamblin, C. Gulcehre, V. Michalski, V., Konda, et al. 2016. Emonets: Multimodal deep learning approaches for emotion recognition in video. Journal on Multimodal User Interfaces 10, 2, 99--111.Google ScholarCross Ref
- R. Jürgens, A. Grass, M. Drolet, and J. Fischer. 2015. Effect of acting experience on emotion expression and recognition in voice: Non-actors provide better stimuli than expected. Journal of nonverbal behavior 39, 3, 195--214.Google ScholarCross Ref
- N. Vryzas, A. Liatsou, R. Kotsakis, C. Dimoulas, and G. Kalliris. Speech Emotion Recognition for Performance Interaction. Journal of the Audio Engineering Society 66, 6, 457--467.Google Scholar
- P. Jackson, and S. Haq. 2014. Surrey Audio-Visual Expressed Emotion (SAVEE) Database, University of Surrey.Google Scholar
- P.N. Juslin, and D. Västfjäll, D. 2008. Emotional responses to music: The need to consider underlying mechanisms. Behavioral and brain sciences 31,5, 559--575.Google Scholar
- R. Plutchik. 2003. Emotions and life: Perspectives from psychology, biology, and evolution. American Psychological Association.Google Scholar
- M. Hemphill. 1996. A note on adults' color--emotion associations. The Journal of genetic psychology 157,3, 275--280.Google ScholarCross Ref
- N. Kaya, and H. Epps. 2004. Relationship between color and emotion: A study of college students. College Student J 38,3, 396.Google Scholar
- A.M. Lund. 2001. Measuring Usability with the USE Questionnaire. STC Usability SIG Newsletter 8,2.Google Scholar
Index Terms
- Subjective Evaluation of a Speech Emotion Recognition Interaction Framework
Recommendations
Emotion recognition from speech: a review
Emotion recognition from speech has emerged as an important research area in the recent past. In this regard, review of existing work on emotional speech processing is useful for carrying out further research. In this paper, the recent literature on ...
Application of Emotion Recognition and Modification for Emotional Telugu Speech Recognition
AbstractMajority of the automatic speech recognition systems (ASR) are trained with neutral speech and the performance of these systems are affected due to the presence of emotional content in the speech. The recognition of these emotions in human speech ...
Comments