Abstract
Emotion Recognition in Conversation (ERC) has become one of the most explored topics in Speech Technology. Unlike Emotion Recognition in a single utterance, context plays an important role in Emotion Recognition in Conversation. In literature complex architectures are proposed for ERC which makes the developement of real-time applications difficult. In this paper, we propose a simple architecture called Varta Rasa (in Sanskrit, Varta means conversation and Rasa means emotions) for ERC, which is a stacked ensemble model. Extensive experiments on the IEMOCAP dataset generated results comparable to state-of-the-art models while providing significantly lower complexity.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Busso, C., et al.: Iemocap: interactive emotional dyadic motion capture database. Lang. Resour. Eval. 42, 335–359 (2008). https://doi.org/10.1007/s10579-008-9076-6
Deepanway, G., Navonil, M., Soujanya, P., Niyati, C., Alexander, G.: DialogueGCN: a graph convolutional neural network for emotion recognition in conversation. arXiv preprint arXiv:1908.11540 (2019)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Hazarika, D., Poria, S., Mihalcea, R., Cambria, E., Zimmermann, R.: Icon: interactive conversational memory network for multimodal emotion detection. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 2594–2604 (2018)
Ke, G., et al.: LightGBM: a highly efficient gradient boosting decision tree. Adv. Neural. Inf. Process. Syst. 30, 3146–3154 (2017)
Li, W., Shao, W., Ji, S., Cambria, E.: BiERU: bidirectional emotional recurrent unit for conversational sentiment analysis. arXiv preprint arXiv:2006.00492 (2020)
Majumder, N., Poria, S., Hazarika, D., Mihalcea, R., Gelbukh, A., Cambria, E.: DialogueRNN: an attentive RNN for emotion detection in conversations. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33 (2019)
Mao, Y., et al.: DialogueTRM: exploring the intra-and inter-modal emotional behaviors in the conversation. arXiv preprint arXiv:2010.07637 (2020)
Pennington, J., Socher, R., Manning, C.D.: GloVe: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1532–1543 (2014)
Poria, S., Cambria, E., Hazarika, D., Majumder, N., Zadeh, A., Morency, L.P.: Context-dependent sentiment analysis in user-generated videos. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (volume 1: Long papers), pp. 873–883 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Ethics declarations
Disclaimer
This content is provided for general information purposes and is not intended to be used in place of consultation with our professional advisors. The Varta Rasa is the property of Accenture and its affiliates and Accenture be the holder of the copyright or any intellectual property over it. No part of this paper may be reproduced in any manner without the written permission of Accenture. Opinions expressed herein are subject to change without notice.
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Parida, R. et al. (2023). Varta Rasa - A Simple and Accurate System for Emotion Recognition in Conversations. In: Molla, A.R., Sharma, G., Kumar, P., Rawat, S. (eds) Distributed Computing and Intelligent Technology. ICDCIT 2023. Lecture Notes in Computer Science, vol 13776. Springer, Cham. https://doi.org/10.1007/978-3-031-24848-1_23
Download citation
DOI: https://doi.org/10.1007/978-3-031-24848-1_23
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-24847-4
Online ISBN: 978-3-031-24848-1
eBook Packages: Computer ScienceComputer Science (R0)