ABSTRACT
Depression disorder is one of the major causes of disability in the world that can lead to tragic outcomes. In this paper, we propose a method for using an approximation to a mental lexicon to model the communication process of depressed and non-depressed participants in spontaneous North American English clinical interviews. Our approach, inspired by the Lexical Availability theory, identifies the most relevant vocabulary of the interviewed participant, and use it as features in a classification process. We performed an in-depth evaluation on the DAIC-WOZ [20] and the E-DAIC [11] clinical datasets. Obtained results indicate that our approach can compete against recent contextual embeddings when modeling and identifying depression. We show the generalization capabilities of our algorithm using outside data, reaching a macro F1 = 0.83 and F1 = 0.80 in the DAIC-WOZ and E-DAIC datasets respectively. An analysis of our method’s interpretability allows understanding how the classifier is making its decisions. During this process, we observed strong connections between our obtained results and previous research from the psychological field.
- Mari Aguilera, Clara Paz, Victoria Compañ, Juan Carlos Medina, and Guillem Feixas. 2019. Cognitive rigidity in patients with depression and fibromyalgia. International Journal of Clinical and Health Psychology 19, 2(2019), 160–164.Google ScholarCross Ref
- Tuka Al Hanai, Mohammad M Ghassemi, and James R Glass. 2018. Detecting Depression with Audio/Text Sequence Modeling of Interviews.. In Interspeech. 1716–1720.Google Scholar
- Mohammed Al-Mosaiwi and Tom Johnstone. 2018. In an absolute state: Elevated use of absolutist words is a marker specific to anxiety, depression, and suicidal ideation. Clinical Psychological Science 6, 4 (2018), 529–542.Google ScholarCross Ref
- Nancy JC Andreasen and Bruce Pfohl. 1976. Linguistic analysis of speech in affective disorders. Archives of General Psychiatry 33, 11 (1976), 1361–1367.Google ScholarCross Ref
- Antonio Manuel Ávila Muñoz and José María Sánchez Sáez. 2011. La posición de los vocablos en el cálculo del índice de disponibilidad léxica: procesos de reentrada en las listas del léxico disponible de la ciudad de Málaga. (2011).Google Scholar
- Hennie Brugman and Albert Russel. 2004. Annotating Multi-media/Multi-modal Resources with ELAN. In Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04). European Language Resources Association (ELRA), Lisbon, Portugal.Google Scholar
- Sergio G Burdisso, Marcelo Errecalde, and Manuel Montes-y Gómez. 2019. A text classification framework for simple and effective early depression detection over social media streams. Expert Systems with Applications 133 (2019), 182–197.Google ScholarDigital Library
- Francisco J. Callealta Barroso and Diego J. Gallego Gallego. 2016. Medidas de disponibilidad léxica: comparabilidad y normalización (Measures of lexical availability: comparability and standardization). Boletín de filología 51, 1 (2016), 39–92.Google Scholar
- Munmun De Choudhury, Scott Counts, and Eric Horvitz. 2013. Social Media As a Measurement Tool of Depression in Populations. In Proceedings of the 5th Annual ACM Web Science Conference (Paris, France) (WebSci ’13). ACM, New York, NY, USA, 47–56. https://doi.org/doi.org/10.1145/2464464.2464480Google ScholarDigital Library
- Simon De Deyne, Steven Verheyen, and Gert Storms. 2016. Structure and organization of the mental lexicon: A network approach derived from syntactic dependency relations and word associations. In Towards a theoretical framework for analyzing complex linguistic networks. Springer, 47–79.Google Scholar
- David DeVault, Ron Artstein, Grace Benn, Teresa Dey, Ed Fast, Alesia Gainer, Kallirroi Georgila, Jon Gratch, Arno Hartholt, Margaux Lhommet, 2014. SimSensei Kiosk: A virtual human interviewer for healthcare decision support. In Proceedings of the 2014 international conference on Autonomous agents and multi-agent systems. 1061–1068.Google Scholar
- Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1. Association for Computational Linguistics, Minneapolis, Minnesota, 4171–4186.Google Scholar
- S Pavankumar Dubagunta, Bogdan Vlasenko, and Mathew Magimai Doss. 2019. Learning Voice Source Related Information for Depression Detection. In ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 6525–6529.Google Scholar
- Thomas E Ellis and Billy Rutherford. 2008. Cognition and suicide: Two decades of progress. International Journal of Cognitive Therapy 1, 1 (2008), 47–68.Google ScholarCross Ref
- Maitena Etxebarria. 2014. Lexical variation and bilingual education in the Basque country. Spanish in Context 11, 1 (2014), 50–75.Google ScholarCross Ref
- Roberto Ferreira and Max S. Echeverría. 2010. Redes semánticas en el léxico disponible de inglés L1 e inglés LE. Onomázein (2010). https://www.redalyc.org/articulo.oa?id=134513546005Google Scholar
- Kathleen Kara Fitzpatrick, Alison Darcy, and Molly Vierhile. 2017. Delivering cognitive behavior therapy to young adults with symptoms of depression and anxiety using a fully automated conversational agent (Woebot): a randomized controlled trial. JMIR mental health 4, 2 (2017), e19.Google Scholar
- Edward A Fox and Joseph A Shaw. 1994. Combination of multiple searches. NIST special publication SP 243 (1994).Google Scholar
- Georges Gougenheim, R. Michéa, P. Rivenc, and A. Sauvageot. 1964. L’élaboration du français fondamental (1er degré): étude sur l’établissement d’un vocabulaire et d’une grammaire de base. Vol. 1. Chilton Books.Google Scholar
- Jonathan Gratch, Ron Artstein, Gale Lucas, Giota Stratou, Stefan Scherer, Angela Nazarian, Rachel Wood, Jill Boberg, David DeVault, Stacy Marsella, David Traum, Skip Rizzo, and Louis-Philippe Morency. 2014. The Distress Analysis Interview Corpus of human and computer interviews. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14). European Language Resources Association (ELRA), Reykjavik, Iceland, 3123–3128.Google Scholar
- Natividad Hernández Muñoz. 2010. Social aspects of oral and lexical written production in Spanish. SKY Journal of Linguistics 23 (2010), 101–123.Google Scholar
- Natividad Hernández-Muñoz, Cristina Izura, and Carmela Tomé. 2014. Cognitive factors of lexical availability in a second language. In Lexical availability in English and Spanish as a second language. Springer, 169–186.Google Scholar
- Natividad Hernández Muñoz and Carmela Tomé Cornejo. 2017. Léxico disponible en primera y segunda lengua: bases cognitivas. In Palabras vocabulario léxico: la lexicología aplicada a la didáctica ya la diacronía. Edizioni Ca Foscari, 99–122.Google Scholar
- Becky Inkster, Shubhankar Sarda, and Vinod Subramanian. 2018. An empathy-driven, conversational artificial intelligence agent (Wysa) for digital mental well-being: real-world data evaluation mixed-methods study. JMIR mHealth and uHealth 6, 11 (2018), e12106.Google Scholar
- Rosa María Jiménez Catalán. 2013. Lexical availability in English and Spanish as a second language. Vol. 17. Springer.Google Scholar
- Kurt Kroenke, Tara W Strine, Robert L Spitzer, Janet BW Williams, Joyce T Berry, and Ali H Mokdad. 2009. The PHQ-8 as a measure of current depression in the general population. Journal of affective disorders 114, 1-3 (2009), 163–173.Google ScholarCross Ref
- Adrian Pastor López-Monroy, Fabio A. González, Manuel Montes, Hugo Jair Escalante, and Thamar Solorio. 2018. Early Text Classification Using Multi-Resolution Concept Representations. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers). Association for Computational Linguistics, New Orleans, Louisiana, 1216–1225. https://doi.org/10.18653/v1/N18-1110Google ScholarCross Ref
- David E Losada, Fabio Crestani, and Javier Parapar. 2018. Overview of eRisk: Early Risk Prediction on the Internet. In International Conference of the Cross-Language Evaluation Forum for European Languages. Springer, 343–361.Google ScholarCross Ref
- David E. Losada, Fabio Crestani, and Javier Parapar. 2020. eRisk 2020: Self-harm and Depression Challenges. In Advances in Information Retrieval, Joemon M. Jose, Emine Yilmaz, João Magalhães, Pablo Castells, Nicola Ferro, Mário J. Silva, and Flávio Martins (Eds.). Springer International Publishing, Cham, 557–563.Google Scholar
- Geral Eduardo Mateus-Ferro, Laura Marcela Castiblanco, and Pedro Augusto Álvarez Bermúdez. 2018. Mecanismos lógicos y analógicos en la producción del léxico disponible. Revista Folios (2018).Google Scholar
- Colin D Mathers and Dejan Loncar. 2006. Projections of global mortality and burden of disease from 2002 to 2030. PLoS medicine 3, 11 (2006), e442.Google Scholar
- Michelle Morales, Stefan Scherer, and Rivka Levitan. 2018. A Linguistically-Informed Fusion Approach for Multimodal Depression Detection. In Proceedings of the Fifth Workshop on Computational Linguistics and Clinical Psychology: From Keyboard to Clinic. ACL, New Orleans, LA, 13–24.Google ScholarCross Ref
- Heili Orav. 2006. Lexical Knowledge of Personality Traits. In Proceedings of the GWC 2006. 239–343.Google Scholar
- Michael J Paul and Mark Dredze. 2011. You are what you tweet: Analyzing twitter for public health. In Fifth International AAAI Conference on Weblogs and Social Media.Google Scholar
- Chris Payne. 2016. Lexical availability. English teaching professional102 (2016), 18–20.Google Scholar
- James W Pennebaker, Martha E Francis, and Roger J Booth. 2001. Linguistic inquiry and word count: LIWC 2001. Mahway: Lawrence Erlbaum Associates 71, 2001 (2001), 2001.Google Scholar
- James W Pennebaker, Matthias R Mehl, and Kate G Niederhoffer. 2003. Psychological aspects of natural language use: Our words, our selves. Annual review of psychology 54, 1 (2003), 547–577.Google Scholar
- Marco Antonio Perez Durán. 2019. Un estudio sobre los macrocentros de interés (McI). Una aportación teórica para los estudios actuales de disponibilidad léxica. Cuadernos de Lingüística de El Colegio de México 6 (2019).Google Scholar
- Marco Pérez Durán. 2020. Análisis del léxico disponible del centro de interés del insulto en estudiantes de secundaria de San Luis Potosí, México. Revista de Filología y Lingüística de la Universidad de Costa Rica 46, 1 (mar. 2020), 261–278.Google ScholarCross Ref
- Anupama Ray, Siddharth Kumar, Rutvik Reddy, Prerana Mukherjee, and Ritu Garg. 2019. Multi-Level Attention Network Using Text, Audio and Video for Depression Prediction. In Proceedings of the 9th International on Audio/Visual Emotion Challenge and Workshop (Nice, France) (AVEC ’19). Association for Computing Machinery, New York, NY, USA, 81–88.Google ScholarDigital Library
- Alex Rinaldi, Jean Fox Tree, and Snigdha Chaturvedi. 2020. Predicting Depression in Screening Interviews from Latent Categorization of Interview Prompts. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. ACL, Online, 7–18.Google ScholarCross Ref
- Fabien Ringeval, Björn Schuller, Michel Valstar, Nicholas Cummins, Roddy Cowie, Leili Tavabi, Maximilian Schmitt, Sina Alisamir, Shahin Amiriparian, Eva-Maria Messner, 2019. AVEC 2019 workshop and challenge: state-of-mind, detecting depression with AI, and cross-cultural affect recognition. In Proceedings of the 9th International on Audio/Visual Emotion Challenge and Workshop. 3–12.Google Scholar
- Stephanie Rude, Eva-Maria Gortner, and James Pennebaker. 2004. Language use of depressed and depression-vulnerable college students. Cognition & Emotion 18, 8 (2004), 1121–1133.Google ScholarCross Ref
- Gabriela Ríos González. 2008. Diferencias léxicas entre el hombre y la mujer en tres centros de interés: Saludos, Temas de conversación y Despedidas. Revista de Filología y Lingüística de la Universidad de Costa Rica 33, 1 (mar. 2008), 151–166.Google Scholar
- Chris Segrin. 1990. A meta-analytic review of social skill deficits in depression. Communications Monographs 57, 4 (1990), 292–308.Google ScholarCross Ref
- Marjana Šifrar Kalan. 2015. Lexical Availability and L2 Vocabulary Acquisition. Journal of Foreign Language Teaching and Applied Linguistics 2, 2(2015).Google ScholarCross Ref
- Urmas Sutrop. 2001. List Task and a Cognitive Salience Index. Field Methods 13, 3 (2001), 263–276. arXiv:https://doi.org/10.1177/1525822X0101300303Google ScholarCross Ref
- Allison M Tackman, David A Sbarra, Angela L Carey, M Brent Donnellan, Andrea B Horn, Nicholas S Holtzman, To’Meisha S Edwards, James W Pennebaker, and Matthias R Mehl. 2019. Depression, negative emotionality, and self-referential language: A multi-lab, multi-measure, and multi-language-task research synthesis.Journal of personality and social psychology 116, 5(2019), 817.Google Scholar
- Yla R Tausczik and James W Pennebaker. 2010. The psychological meaning of words: LIWC and computerized text analysis methods. Journal of language and social psychology 29, 1 (2010), 24–54.Google ScholarCross Ref
- John D Teasdale, Jan Scott, Richard G Moore, Hazel Hayhurst, Marie Pope, and Eugene S Paykel. 2001. How does cognitive therapy prevent relapse in residual depression? Evidence from a controlled trial.Journal of consulting and clinical psychology 69, 3(2001), 347.Google Scholar
- Ester Trigo, Manuel-Francisco Romero, and Inmaculada-Clotilde Santos-Díaz. 2019. Empirical approach from lexical availability to the influence of sociolinguistic factors on mastery of spelling / Aproximación empírica desde la disponibilidad léxica a la influencia de los factores sociolingüísticos en el dominio ortográfico. Culture and Education 31, 4 (2019), 814–844. https://doi.org/doi.org/10.1080/11356405.2019.1659007Google ScholarCross Ref
- Marcel Trotzek, Sven Koitka, and Christoph M Friedrich. 2018. Utilizing neural networks and linguistic metadata for early detection of depression indications in text sequences. IEEE Transactions on Knowledge and Data Engineering (2018).Google Scholar
- Michel Valstar, Jonathan Gratch, Björn Schuller, Fabien Ringeval, Denis Lalanne, Mercedes Torres Torres, Stefan Scherer, Giota Stratou, Roddy Cowie, and Maja Pantic. 2016. Avec 2016: Depression, mood, and emotion recognition workshop and challenge. In Proceedings of the 6th international workshop on audio/visual emotion challenge. ACM, 3–10.Google ScholarDigital Library
- Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in neural information processing systems. 5998–6008.Google Scholar
- Esaú Villatoro-Tello, Pavankumar Dubagunta, Julian Fritsch, Gabriela Ramírez-de-la Rosa, Petr Motlicek, and Mathew Magimai.-Doss. 2021. Late Fusion of the Available Lexicon and Raw Waveform-based Acoustic Modeling for Depression and Dementia Recognition. In Proceedings of the INTERSPEECH 2021. ISCA-International Speech Communication Association 2021.Google ScholarCross Ref
- Esaú Villatoro-Tello, Gabriela Ramírez-de-la Rosa, and Héctor Jiménez-Salazar. 2017. UAM’s Participation at CLEF eRisk 2017 task: Towards Modelling Depressed Blogers.. In CLEF (Working Notes).Google Scholar
- Charles Welch, Allison Lahnala, Veronica Perez-Rosas, Siqi Shen, Sarah Seraj, Larry An, Kenneth Resnicow, James Pennebaker, and Rada Mihalcea. 2020. Expressive Interviewing: A Conversational System for Coping with COVID-19. In Proceedings of the 1st Workshop on NLP for COVID-19 (Part 2) at EMNLP 2020.Google ScholarCross Ref
- World Health Organization. 2017. Depression and Other Common Mental Disorders Global Health Estimates. Retrieved May 11, 2021 from ht+tps://www.who.int/mental_health/management/depression/prevalence_global_health_estimates/en/Google Scholar
- Til Wykes, Jessica Lipshitz, and Stephen M Schueller. 2019. Towards the design of ethical standards related to digital mental health and all its applications. Current Treatment Options in Psychiatry 6, 3 (2019), 232–242.Google ScholarCross Ref
- Shweta Yadav, Jainish Chauhan, Joy Prakash Sain, Krishnaprasad Thirunarayan, Amit Sheth, and Jeremiah Schumm. 2020. Identifying Depressive Symptoms from Tweets: Figurative Language Enabled Multitask Learning Framework. In Proceedings of the 28th International Conference on Computational Linguistics. Barcelona, Spain (Online), 696–709.Google ScholarCross Ref
- Ziheng Zhang, Weizhe Lin, Mingyu Liu, and Marwa Mahmoud. 2020. Multimodal deep learning framework for mental disorder recognition. In 2020 15th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2020). IEEE.Google ScholarDigital Library
Recommendations
Depression detection for twitter users using sentiment analysis in English and Arabic tweets
AbstractSince depression often results in suicidal thoughts and leaves a person severely disabled daily, there is an elevated risk of premature mortality due to mental problems caused by depression. Therefore, it's crucial to identify the patient's ...
Highlights- More than two-thirds of suicides each year are caused by depression, which is the most common mental illness.
- Social media posts can be a useful tool for tracking a variety of mental health conditions, including depression.
- The ...
Harnessing emotions for depression detection
AbstractHuman emotions using textual cues, speech patterns, and facial expressions can give insight into their mental state. Although there are several uni-modal datasets for emotion recognition, there are very few labeled datasets for multi-modal ...
Multimodal Detection of Depression in Clinical Interviews
ICMI '15: Proceedings of the 2015 ACM on International Conference on Multimodal InteractionCurrent methods for depression assessment depend almost entirely on clinical interview or self-report ratings. Such measures lack systematic and efficient ways of incorporating behavioral observations that are strong indicators of psychological ...
Comments