Abstract
Despite significant evidence linking mental health to almost every major development issue, individuals with mental disorders are among those most at risk of being excluded from development programs. We outline a novel task of detection of Cognitive Distortion and Emotion Cause extraction of associated emotions in conversations. Cognitive distortions are inaccurate thought patterns, beliefs, or perceptions that contribute to negative thinking, which subsequently elevates the chances of several mental illnesses. This work introduces a novel multi-modal mental health conversational corpus manually annotated with emotion, emotion causes, and the presence of cognitive distortion at the utterance level. We propose a multitasking framework that uses multi-modal information as inputs and uses both external commonsense knowledge and factual knowledge from the dataset to learn both tasks at the same time. This is because commonsense knowledge is a key part of understanding how and why emotions are implied. We achieve commendable performance gains on the cognitive distortion detection task (+3.91 F1%) and the emotion cause extraction task (+3 ROS points) when compared to the existing state-of-the-art model.
G. V. Singh and S. Ghosh—These authors contributed equally to this work and are joint first authors.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
mental health, psychiatric interview, psychotic, paranoia, hallucination, etc.
- 4.
Links to some sample videos: https://www.youtube.com/watch?v=P7qMfG-yNfA
- 5.
2 Ph.D. linguistics degree holders and 1 undergraduate student from the computer science discipline.
- 6.
- 7.
- 8.
- 9.
References
Aviezer, H., Trope, Y., Todorov, A.: Body cues, not facial expressions, discriminate between intense positive and negative emotions. Science 338(6111), 1225–1229 (2012)
Chen, S., Guo, X., Wu, T., Ju, X.: Exploring the online doctor-patient interaction on patient satisfaction based on text mining and empirical analysis. Inf. Process. Manage. 57(5), 102253 (2020)
Cheng, Z., Jiang, Z., Yin, Y., Yu, H., Gu, Q.: A symmetric local search network for emotion-cause pair extraction. In: Proceedings of the 28th International Conference on Computational Linguistics, pp. 139–149. International Committee on Computational Linguistics, Barcelona, Spain (Online), December 2020. https://doi.org/10.18653/v1/2020.coling-main.12, https://aclanthology.org/2020.coling-main.12
David, B., Burns, M.: Feeling good-the new mood therapy. NY: Signet Books. Chin, Richard. (1995) p. 3 (1980)
Deng, J., Guo, J., Xue, N., Zafeiriou, S.: Arcface: additive angular margin loss for deep face recognition. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4690–4699 (2019)
Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, Minneapolis, MN, USA, 2–7 June 2019, Volume 1 (Long and Short Papers), pp. 4171–4186. Association for Computational Linguistics (2019). https://doi.org/10.18653/v1/n19-1423
Dheeraj, K., Ramakrishnudu, T.: Negative emotions detection on online mental-health related patients texts using the deep learning with MHA-BCNN model. Expert Syst. Appl. 182, 115265 (2021)
Dozois, D.J., Beck, A.T.: Cognitive schemas, beliefs and assumptions. Risk factors in Depression, pp. 119–143 (2008)
Ekman, P.: An argument for basic emotions. Cogn. Emot. 6(3–4), 169–200 (1992)
Eyben, F., Wöllmer, M., Schuller, B.W.: Opensmile: the munich versatile and fast open-source audio feature extractor. In: Proceedings of the 18th International Conference on Multimedia 2010, Firenze, Italy, October 25–29, 2010, pp. 1459–1462. ACM (2010). https://doi.org/10.1145/1873951.1874246
Ghosh, S., Ekbal, A., Bhattacharyya, P.: A multitask framework to detect depression, sentiment and multi-label emotion from suicide notes. Cogn. Comput. 14(1), 110–129 (2021). https://doi.org/10.1007/s12559-021-09828-7
Ghosh, S., Roy, S., Ekbal, A., Bhattacharyya, P.: CARES: CAuse recognition for emotion in suicide notes. In: Hagen, M., et al. (eds.) ECIR 2022. LNCS, vol. 13186, pp. 128–136. Springer, Cham (2022). https://doi.org/10.1007/978-3-030-99739-7_15
Gilpin, L.H., Bau, D., Yuan, B.Z., Bajwa, A., Specter, M., Kagal, L.: Explaining explanations: An overview of interpretability of machine learning. In: 2018 IEEE 5th International Conference on data science and advanced analytics (DSAA), pp. 80–89. IEEE (2018)
Gui, L., Wu, D., Xu, R., Lu, Q., Zhou, Y.: Event-driven emotion cause extraction with corpus construction. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 1639–1649 (2016)
Gui, L., Wu, D., Xu, R., Lu, Q., Zhou, Y.: Event-driven emotion cause extraction with corpus construction. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, Austin, Texas, pp. 1639–1649. Association for Computational Linguistics, November 2016. https://doi.org/10.18653/v1/D16-1170, https://aclanthology.org/D16-1170
Hammen, C.L., Shih, J.: Depression and interpersonal processes (2014)
Hara, K., Kataoka, H., Satoh, Y.: Can spatiotemporal 3D CNNs retrace the history of 2d CNNs and imagenet? In: 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA, 18–22 June 2018, pp. 6546–6555. Computer Vision Foundation/IEEE Computer Society (2018). https://doi.org/10.1109/CVPR.2018.00685, http://openaccess.thecvf.com/content_cvpr_2018/html/Hara_Can_Spatiotemporal_3D_CVPR_2018_paper.html
Hazarika, D., Poria, S., Zadeh, A., Cambria, E., Morency, L., Zimmermann, R.: Conversational memory network for emotion recognition in dyadic dialogue videos. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2018, New Orleans, Louisiana, USA, 1–6 June 2018, Volume 1 (Long Papers), pp. 2122–2132. Association for Computational Linguistics (2018). https://doi.org/10.18653/v1/n18-1193
Joshi, M., Chen, D., Liu, Y., Weld, D.S., Zettlemoyer, L., Levy, O.: SpanBERT: improving pre-training by representing and predicting spans. Trans. Assoc. Comput. Linguist. 8, 64–77 (2020). https://doi.org/10.1162/tacl_a_00300, https://aclanthology.org/2020.tacl-1.5
Lee, S.Y.M., Chen, Y., Huang, C.R.: A text-driven rule-based system for emotion cause detection. In: Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text, pp. 45–53 (2010)
Lee-Thorp, J., Ainslie, J., Eckstein, I., Ontanon, S.: FNet: mixing tokens with fourier transforms. arXiv preprint arXiv:2105.03824 (2021)
Li, X., Song, K., Feng, S., Wang, D., Zhang, Y.: A co-attention neural network model for emotion cause analysis with emotional context awareness. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 4752–4757. Association for Computational Linguistics, Brussels, Belgium, October–November 2018. https://doi.org/10.18653/v1/D18-1506, https://aclanthology.org/D18-1506
Liu, Y., et al.: Roberta: a robustly optimized BERT pretraining approach. CoRR abs/1907.11692 (2019). http://arxiv.org/abs/1907.11692
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, vol. 26 (2013)
Organization, W.H., et al.: Depression: a global crisis. World Mental Health Day, October 10 2012. World Federation for Mental Health, Occoquan, VA, USA (2012)
Pantic, M., Sebe, N., Cohn, J.F., Huang, T.S.: Affective multimodal human-computer interaction. In: Proceedings of the 13th ACM International Conference on Multimedia, Singapore, 6–11 November 2005, pp. 669–676. ACM (2005). https://doi.org/10.1145/1101149.1101299
Peng, Y., Chen, Q., Lu, Z.: An empirical study of multi-task learning on BERT for biomedical text mining. In: Proceedings of the 19th SIGBioMed Workshop on Biomedical Language Processing, BioNLP 2020, Online, 9 July 2020, pp. 205–214. Association for Computational Linguistics (2020). https://doi.org/10.18653/v1/2020.bionlp-1.22, https://doi.org/10.18653/v1/2020.bionlp-1.22
Poria, S., et al.: Recognizing emotion cause in conversations. Cogn. Comput. 13(5), 1317–1332 (2021)
Singh, A., Hingane, S., Wani, S., Modi, A.: An end-to-end network for emotion-cause pair extraction. In: Proceedings of the Eleventh Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pp. 84–91 (2021)
Spitzer, R.L., Cohen, J., Fleiss, J.L., Endicott, J.: Quantification of agreement in psychiatric diagnosis: a new approach. Arch. Gen. Psychiatry 17(1), 83–87 (1967)
Uban, A.S., Chulvi, B., Rosso, P.: Understanding patterns of anorexia manifestations in social media data with deep learning. In: Proceedings of the Seventh Workshop on Computational Linguistics and Clinical Psychology: Improving Access, pp. 224–236. Association for Computational Linguistics, Online, June 2021. https://doi.org/10.18653/v1/2021.clpsych-1.24, https://aclanthology.org/2021.clpsych-1.24
Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017(December), pp. 4–9, 2017. Long Beach, CA, USA, pp. 5998–6008 (2017). https://proceedings.neurips.cc/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html
Yüksel, A., Bahadir-Yilmaz, E.: Relationship between depression, anxiety, cognitive distortions, and psychological well-being among nursing students. Perspect. Psychiatr. Care 55(4), 690–696 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Ethics declarations
Ethical Consideration
This study has been evaluated and approved by our Institutional Review Board (IRB). The videos used to create the dataset for this study do not have any copyright clauses attached to them. Furthermore, the videos are shared via various channels for the main purpose of facilitating research and educational purposes.
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Singh, G.V., Ghosh, S., Ekbal, A., Bhattacharyya, P. (2023). DeCoDE: Detection of Cognitive Distortion and Emotion Cause Extraction in Clinical Conversations. In: Kamps, J., et al. Advances in Information Retrieval. ECIR 2023. Lecture Notes in Computer Science, vol 13981. Springer, Cham. https://doi.org/10.1007/978-3-031-28238-6_11
Download citation
DOI: https://doi.org/10.1007/978-3-031-28238-6_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-28237-9
Online ISBN: 978-3-031-28238-6
eBook Packages: Computer ScienceComputer Science (R0)