Abstract
Mental disorders present one of the leading causes of worldwide disability and have become a major social concern, as the symptoms behind mental disorders are almost hidden. Most of the conventional approaches used for diagnosing and identifying mental disorders rely on oral conversations (through interviews) having a limited focus on write-ups. Therefore, in this study, we attempt to explore identifying different types of mental disorders among people through their write-ups. To do so, we collect a total of 6893 posts and discussions that appeared in different problem-specific Internet forums and utilize them to identify different types of mental disorders. Leveraging appropriate machine learning algorithms over the collected write-ups, our study can categorize Depression, Schizophrenia, Suicidal Intention, Anxiety, Post Traumatic Stress Disorder (PTSD), Borderline Personality Disorder (BPD), and Eating Disorder (ED). To achieve a balanced dataset in the process of our study, we apply a combined sampling approach and achieve up to 89% accuracy in the identification task. We perform varied exploration tasks in our study covering 5-fold cross-validation, 5-times repetition on the used dataset, etc. We explain our findings in terms of precision, recall, specificity, and Matthews correlation coefficient to demonstrate the capability of our proposed approach in identifying mental disorders based on write-ups.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
World Health Organization. https://www.who.int/news-room/fact-sheets/detail/mental-disorders. Accessed 10 July 2022
Mental health: lessons learned in 2020 for 2021 and forward. https://blogs.worldbank.org/health/mental-health-lessons-learned-2020-2021-and-forward. Accessed 10 July 2022
Recognizing Suicidal Behavior. https://www.webmd.com/mental-health/recognizing-suicidal-behavior. Accessed 10 July 2022
How heavy use of social media is linked to mental illness. https://www.economist.com/graphic-detail/2018/05/18/how-heavy-use-of-social-media-is-linked-to-mental-illness. Accessed 10 July 2022
Kernot, D., Bossomaier, T., Bradbury, R.: The stylometric impacts of ageing and life events on identity. J. Quant. Linguist. 26, 1–21 (2017)
Al-Mosaiwi, M., Johnstone, T.: In an absolute state: elevated use of absolutist words is a marker specific to anxiety, depression, and suicidal ideation. Clin. Psychol. Sci. 6, 529–542 (2018)
Jockers, M.L., Witten, D.M.: A comparative study of machine-learning methods for authorship attribution. Literary Linguist. Comput. 25(2), 215–223 (2010)
Halvani, O., Winter, C., Pflug, A.: Authorship verification for different languages, genres and topics. Digit. Investig. 16, 33–43 (2016)
Mir, E., Novas, C., Seymour, M.: Social Media and Adolescents’ and Young Adults’ Mental Health. National Center for Health Research. http://www.center4research.org/social-media-affects-mental-health/. Accessed 10 July 2022
Roffo, G., Cristani, M., Bazzani, L., Minh, H.Q., Murino, V.: Trusting skype: learning the way people chat for fast user recognition and verification. In: IEEE International Conference on Computer Vision Workshops (ICCVW), Sydney, Australia, pp. 748–754 (2013)
Brocardo, M.L., Traore, I.: Continuous authentication using micro-messages. In: 12th Annual International Conference on Privacy, Security, and Trust, Toronto, Canada (2014)
Kernot, D.: Can three pronouns discriminate identity in writing? In: Sarker, R., Abbass, H.A., Dunstall, S., Kilby, P., Davis, R., Young, L. (eds.) Data and Decision Sciences in Action. LNMIE, pp. 397–411. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-55914-8_29
López-Escobedo, F., Méndez-Cruz, C.F., Sierra, G., Solórzano-Soto, J.: Analysis of stylometric variables in long and short texts. In: International Conference on Corpus Linguistics (CILC 2013), pp. 604–611 (2013)
Burnap, P., Colombo, G., Amery, R., Hodorog, A., Scourfield, J.: Multi-class machine classification of suicide-related communication on Twitter. Online Soc. Netw. Media 2, 32–44 (2017)
Argamon, S., Koppel, M., Avneri, G.: Routing documents according to style. In: Proceedings of the 1st International Workshop on Innovative Information (1998)
Baayen, H., Halteren, H.V., Tweedie, F.: Outside the cave of shadows: using syntactic an-notation to enhance authorship attribution. Literary Linguist. Comput. 2, 110–120 (1996)
Hayne, S.C., Pollard, C.E., Rice, R.E.: Identification of comment authorship in anonymous group support systems. J. Manag. Inf. Syst. 20, 301–329 (2003)
Koppel, M., Schler, J.: Exploiting stylistic idiosyncrasies for authorship attribution. In: Proceedings of the IJCAI Workshop on Computational Approaches to Style Analysis and Synthesis, pp. 69–72 (2003)
Abbasi, A., Chen, H.: Writeprints: a stylometric approach to identity-level identification and similarity detection in cyberspace. ACM Trans. Inf. Syst. 26(7), 1–29 (2008)
Panicheva, P., Ledovaya, Y., Bogolyubova, O.: Morphological and semantic correlates of the dark triad personality traits in Russian Fackbook tests. In: Proceedings of the IEEE Artificial Intelligence and Natural Language Conference (AINL) Fruct Conference, Saint-Petersburg, Russia (2016)
Thieme, A., Belgrave, D., Doherty, G.: Machine learning in mental health: a systematic review of HCI literature to support the development of effective and implementable ML systems. ACM Trans. Comput.-Hum. Interact. 27(5), 1–53 (2020)
Calvo, R.A., Milne, D.N., Hussain, S., Christensen, H.: Natural language processing in mental health application using non-clinical tests. Nat. Lang. Eng. 23(5), 649–685 (2017)
Kaur, P., Sharma, M.: Diagnosis of human psychological disorders using supervised learning and nature-inspired counting techniques: a meta-analysis. J. Med. Syst. 43(7), 1–30 (2019)
Kim, J., Lee, D., Park, E.: Machine learning for mental health in social media: biblio-metric study. J. Med. Internet Res. 23(3), e24870 (2021)
Low, D.M., Rumker, L., Talkar, T., Torous, J., Cecchi, G., Ghosh, S.: Natural language processing reveals vulnerable mental health support groups and heightened health anxiety on reddit during COVID-19: an observational study. J. Med. Internet Res. 22(10), e22635 (2020)
Nobles, A.L., Glenn, J.J., Kowsari, K., Teachman, B.A., Barnes, L.E.: Identification of imminent suicide risk among young adults using text messages. In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, pp. 413–435 (2018). https://doi.org/10.1145/3173574.3173987
Du, J., et al.: Extracting psychiatric stressors for suicide from social media using deep learning. BMC Med. Inform. Decis. Making 18(43), 77–87 (2018)
Havigerová, J.M., Haviger, J., Kučera, D., Hoffmannová, P.: Text-based detection of the risk of depression. Front. Psychol. 10, 513 (2019)
Roy, A., Nikolitch, K., McGinn, R., Jinah, S., Klement, W., Kaminsky, J.A.: A machine learning approach predicts future risk to suicidal ideation from social media data. NPJ Digit. Med. 3(1), 78 (2020)
Supplemental Material (2022). https://doi.org/10.6084/m9.figshare.4743547.v1. Accessed 10 July 2022
More Than 2000 of The Most Common Text Abbreviations. https://dexatel.com/blog/text-abbreviations/. Accessed 10 July 2022
The Complete List of 1697 Common Text Abbreviations & Acronyms. https://www.webopedia.com/reference/text-abbreviations/. Accessed 10 July 2022
The Free Dictionary by Farlex. https://www.thefreedictionary.com/List-of-pronouns.html. Accessed 10 July 2022
Fernández, A., García, S., Galar, M., Prati, R.C., Krawczyk, B., Herrera, F.: Imbalanced classification for big data. In: Fernández, A., García, S., Galar, M., Prati, R.C., Krawczyk, B., Herrera, F. (eds.) Learning from Imbalanced Data Sets, pp. 327–349. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-98074-4_13
Shamsudin, H., Yusof, U.K., Jayalakshmi, A., Khalid, M.N.A.: Combining oversampling and undersampling techniques for imbalanced classification: a comparative study using credit card fraudulent transaction dataset. In: 6th International Conference on Control & Automation (ICCA), pp. 803–808 (2020)
Precision-Recall. https://scikit-learn.org/stable/auto_examples/model_selection/-plot_precision_recall. Accessed 10 July 2022
Acknowledgement
The work has been conducted at and supported by the Bangladesh University of Engineering and Technology (BUET), Dhaka, Bangladesh.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Nipa, T.H., Al Islam, A.B.M.A. (2023). Revealing Mental Disorders Through Stylometric Features in Write-Ups. In: Longfei, S., Bodhi, P. (eds) Mobile and Ubiquitous Systems: Computing, Networking and Services. MobiQuitous 2022. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 492. Springer, Cham. https://doi.org/10.1007/978-3-031-34776-4_14
Download citation
DOI: https://doi.org/10.1007/978-3-031-34776-4_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-34775-7
Online ISBN: 978-3-031-34776-4
eBook Packages: Computer ScienceComputer Science (R0)