skip to main content
research-article

Contextual Gaps in Machine Learning for Mental Illness Prediction: The Case of Diagnostic Disclosures

Published:04 October 2023Publication History
Skip Abstract Section

Abstract

Getting training data for machine learning (ML) prediction of mental illness on social media data is labor intensive. To work around this, ML teams will extrapolate proxy signals, or alternative signs from data to evaluate illness status and create training datasets. However, these signals' validity has not been determined, whether signals align with important contextual factors, and how proxy quality impacts downstream model integrity. We use ML and qualitative methods to evaluate whether a popular proxy signal, diagnostic self-disclosure, produces a conceptually sound ML model of mental illness. Our findings identify major conceptual errors only seen through a qualitative investigation -- training data built from diagnostic disclosures encodes a narrow vision of diagnosis experiences that propagates into paradoxes in the downstream ML model. This gap is obscured by strong performance of the ML classifier (F1 = 0.91). We discuss the implications of conceptual gaps in creating training data for human-centered models, and make suggestions for improving research methods.

References

  1. Carlos Aguirre and Mark Dredze. 2021. Qualitative Analysis of Depression Models by Demographics. In Proceedings of the Seventh Workshop on Computational Linguistics and Clinical Psychology: Improving Access. 169--180.Google ScholarGoogle ScholarCross RefCross Ref
  2. Carlos Aguirre, Keith Harrigian, and Mark Dredze. 2021. Gender and Racial Fairness in Depression Research using Social Media. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume. 2932--2949.Google ScholarGoogle ScholarCross RefCross Ref
  3. Leah Ajmani, Stevie Chancellor, Bijal Mehta, Casey Fiesler, Michael Zimmer, and Munmun De Choudhury. 2023. A Systematic Review of Ethics Disclosures in Predictive Mental Health Research. In Forthcoming at FAccT 2023. 1--15.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Saleema Amershi, Maya Cakmak, William Bradley Knox, and Todd Kulesza. 2014. Power to the people: The role of humans in interactive machine learning. Ai Magazine, Vol. 35, 4 (2014), 105--120.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Saleema Amershi, Max Chickering, Steven M Drucker, Bongshin Lee, Patrice Simard, and Jina Suh. 2015. Modeltracker: Redesigning performance analysis tools for machine learning. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems. 337--346.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. John W Ayers, Theodore L Caputi, Camille Nebeker, and Mark Dredze. 2018. Don't quote me: reverse identification of research participants in social media studies. NPJ digital medicine, Vol. 1, 1 (2018), 1--2.Google ScholarGoogle Scholar
  7. Eric PS Baumer, David Mimno, Shion Guha, Emily Quan, and Geri K Gay. 2017. Comparing grounded theory and topic modeling: Extreme divergence or unlikely convergence? Journal of the Association for Information Science and Technology, Vol. 68, 6 (2017), 1397--1410.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Victoria Bellotti and Keith Edwards. 2001. Intelligibility and accountability: human considerations in context-aware systems. Human-Computer Interaction, Vol. 16, 2--4 (2001), 193--212.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Adrian Benton, Margaret Mitchell, and Dirk Hovy. 2017. Multi-task learning for mental health using social media text. arXiv preprint arXiv:1712.03538 (2017).Google ScholarGoogle Scholar
  10. Michael L Birnbaum, Sindhu Kiranmai Ernala, Asra F Rizvi, Munmun De Choudhury, and John M Kane. 2017. A collaborative approach to identifying social media markers of schizophrenia by employing machine learning and clinical appraisals. Journal of medical Internet research, Vol. 19, 8 (2017), e7956.Google ScholarGoogle ScholarCross RefCross Ref
  11. Lindsay Blackwell, Jill Dimond, Sarita Schoenebeck, and Cliff Lampe. 2017. Classification and its consequences for online harassment: Design insights from heartmob. Proceedings of the ACM on Human-Computer Interaction, Vol. 1, CSCW (2017), 1--19.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Geoffrey C Bowker, Karen Baker, Florence Millerand, and David Ribes. 2009. Toward information infrastructure studies: Ways of knowing in a networked environment. In International handbook of internet research. Springer, 97--117.Google ScholarGoogle Scholar
  13. Geoffrey C Bowker and Susan Leigh Star. 2000. Sorting things out: Classification and its consequences. MIT press.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Duy Duc An Bui and Qing Zeng-Treitler. 2014. Learning regular expressions for clinical text classification. Journal of the American Medical Informatics Association, Vol. 21, 5 (2014), 850--857.Google ScholarGoogle ScholarCross RefCross Ref
  15. Joy Buolamwini and Timnit Gebru. 2018. Gender shades: Intersectional accuracy disparities in commercial gender classification. In Conference on fairness, accountability and transparency. PMLR, 77--91.Google ScholarGoogle Scholar
  16. Stevie Chancellor. 2022. Towards Practices for Human-Centered Machine Learning. arXiv preprint arXiv:2203.00432 (2022).Google ScholarGoogle Scholar
  17. Stevie Chancellor, Michael L Birnbaum, Eric D Caine, Vincent MB Silenzio, and Munmun De Choudhury. 2019. A taxonomy of ethical tensions in inferring mental health states from social media. In Proceedings of the conference on fairness, accountability, and transparency. 79--88.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Stevie Chancellor and Munmun De Choudhury. 2020. Methods in predictive techniques for mental health status on social media: a critical review. NPJ digital medicine, Vol. 3, 1 (2020), 1--11.Google ScholarGoogle Scholar
  19. Stevie Chancellor, Andrea Hu, and Munmun De Choudhury. 2018. Norms matter: contrasting social support around behavior change in online weight loss communities. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. 1--14.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Stevie Chancellor, Yannis Kalantidis, Jessica A Pater, Munmun De Choudhury, and David A Shamma. 2017. Multimodal classification of moderated online pro-eating disorder content. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems. 3213--3226.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Stevie Chancellor, Jessica Annette Pater, Trustin Clear, Eric Gilbert, and Munmun De Choudhury. 2016. # thyghgapp: Instagram content moderation and lexical variation in pro-eating disorder communities. In Proceedings of the 19th ACM conference on computer-supported cooperative work & social computing. 1201--1213.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Eshwar Chandrasekharan, Mattia Samory, Anirudh Srinivasan, and Eric Gilbert. 2017. The bag of communities: Identifying abusive behavior online with preexisting internet data. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems. 3175--3187.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Eli Clare. 2017. Brilliant imperfection: Grappling with cure. Duke University Press.Google ScholarGoogle Scholar
  24. Glen Coppersmith, Mark Dredze, and Craig Harman. 2014a. Quantifying mental health signals in Twitter. In Proceedings of the workshop on computational linguistics and clinical psychology: From linguistic signal to clinical reality. 51--60.Google ScholarGoogle ScholarCross RefCross Ref
  25. Glen Coppersmith, Mark Dredze, Craig Harman, and Kristy Hollingshead. 2015. From ADHD to SAD: Analyzing the language of mental health on Twitter through self-reported diagnoses. In Proceedings of the 2nd workshop on computational linguistics and clinical psychology: from linguistic signal to clinical reality. 1--10.Google ScholarGoogle ScholarCross RefCross Ref
  26. Glen Coppersmith, Craig Harman, and Mark Dredze. 2014b. Measuring post traumatic stress disorder in Twitter. In Eighth international AAAI conference on weblogs and social media.Google ScholarGoogle ScholarCross RefCross Ref
  27. Glen Coppersmith, Kim Ngo, Ryan Leary, and Anthony Wood. 2016. Exploratory analysis of social media prior to a suicide attempt. In Proceedings of the third workshop on computational linguistics and clinical psychology. 106--117.Google ScholarGoogle ScholarCross RefCross Ref
  28. Patrick W. Corrigan, Fred E. Markowitz, and Amy C. Watson. 2004. Structural levels of mental illness stigma and discrimination. Schizophrenia bulletin, Vol. 30, 3 (2004), 481--491.Google ScholarGoogle Scholar
  29. Kaitlin L Costello and Diana Floegel. 2020. ?Predictive ads are not doctors": Mental health tracking and technology companies. Proceedings of the Association for Information Science and Technology, Vol. 57, 1 (2020), e250.Google ScholarGoogle ScholarCross RefCross Ref
  30. Vedant Das Swain, Victor Chen, Shrija Mishra, Stephen M Mattingly, Gregory D Abowd, and Munmun De Choudhury. 2022. Semantic Gap in Predicting Mental Wellbeing through Passive Sensing. In CHI Conference on Human Factors in Computing Systems. 1--16.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Norberto Nuno Gomes de Andrade, Dave Pawson, Dan Muriello, Lizzy Donahue, and Jennifer Guadagno. 2018. Ethics and artificial intelligence: suicide prevention on Facebook. Philosophy & Technology, Vol. 31, 4 (2018), 669--684.Google ScholarGoogle ScholarCross RefCross Ref
  32. Munmun De Choudhury. 2015. Anorexia on tumblr: A characterization study. In Proceedings of the 5th international conference on digital health 2015. 43--50.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Munmun De Choudhury, Scott Counts, Eric J Horvitz, and Aaron Hoff. 2014. Characterizing and predicting postpartum depression from shared facebook data. In Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing. 626--638.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Munmun De Choudhury, Michael Gamon, Scott Counts, and Eric Horvitz. 2013. Predicting depression via social media. In Seventh international AAAI conference on weblogs and social media.Google ScholarGoogle Scholar
  35. Alex J DeGrave, Joseph D Janizek, and Su-In Lee. 2021. AI for radiographic COVID-19 detection selects shortcuts over signal. Nature Machine Intelligence, Vol. 3, 7 (2021), 610--619.Google ScholarGoogle ScholarCross RefCross Ref
  36. Emily Denton, Alex Hanna, Razvan Amironesei, Andrew Smart, Hilary Nicole, and Morgan Klaus Scheuerman. 2020. Bringing the people back in: Contesting benchmark machine learning datasets. arXiv preprint arXiv:2007.07399 (2020).Google ScholarGoogle Scholar
  37. Robert F DeVellis and Carolyn T Thorpe. 2021. Scale development: Theory and applications. Sage publications.Google ScholarGoogle Scholar
  38. Fifth Edition et al. 2013. Diagnostic and statistical manual of mental disorders. Am Psychiatric Assoc, Vol. 21, 21 (2013), 591--643.Google ScholarGoogle Scholar
  39. Sindhu Kiranmai Ernala, Michael L Birnbaum, Kristin A Candan, Asra F Rizvi, William A Sterling, John M Kane, and Munmun De Choudhury. 2019. Methodological gaps in predicting mental health states from social media: triangulating diagnostic signals. In Proceedings of the 2019 chi conference on human factors in computing systems. 1--16.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Sindhu Kiranmai Ernala, Tristan Labetoulle, Fred Bane, Michael L Birnbaum, Asra F Rizvi, John M Kane, and Munmun De Choudhury. 2018. Characterizing audience engagement and assessing its impact on social media disclosures of mental illnesses. In Twelfth international AAAI conference on web and social media.Google ScholarGoogle ScholarCross RefCross Ref
  41. Jerry Alan Fails and Dan R Olsen Jr. 2003. Interactive machine learning. In Proceedings of the 8th international conference on Intelligent user interfaces. 39--45.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Jessica L Feuston, Michael Ann DeVito, Morgan Klaus Scheuerman, Katy Weathington, Marianna Benitez, Bianca Z Perez, Lucy Sondheim, and Jed R Brubaker. 2022. " Do You Ladies Relate?": Experiences of Gender Diverse People in Online Eating Disorder Communities. Proceedings of the ACM on Human-Computer Interaction, Vol. 6, CSCW2 (2022), 1--32.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Jessica L Feuston and Anne Marie Piper. 2018. Beyond the coded gaze: Analyzing expression of mental health and illness on instagram. Proceedings of the ACM on Human-Computer Interaction, Vol. 2, CSCW (2018), 1--21.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Jessica L Feuston and Anne Marie Piper. 2019. Everyday experiences: small stories and mental illness on Instagram. In Proceedings of the 2019 CHI conference on human factors in computing systems. 1--14.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Jessica L Feuston, Alex S Taylor, and Anne Marie Piper. 2020. Conformity of Eating Disorders through Content Moderation. Proceedings of the ACM on Human-Computer Interaction, Vol. 4, CSCW1 (2020), 1--28.Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Casey Fiesler and Nicholas Proferes. 2018. ?Participant" perceptions of Twitter research ethics. Social Media Society, Vol. 4, 1 (2018), 2056305118763366.Google ScholarGoogle ScholarCross RefCross Ref
  47. Michel Foucault. 2013. History of madness. Routledge.Google ScholarGoogle Scholar
  48. Arthur W Frank. 2013. The wounded storyteller: Body, illness, and ethics. University of Chicago Press.Google ScholarGoogle Scholar
  49. R Stuart Geiger, Kevin Yu, Yanlai Yang, Mindy Dai, Jie Qiu, Rebekah Tang, and Jenny Huang. 2020. Garbage in, garbage out? Do machine learning application papers in social computing report where human-labeled training data comes from?. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency. 325--336.Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Robert Geirhos, Jörn-Henrik Jacobsen, Claudio Michaelis, Richard Zemel, Wieland Brendel, Matthias Bethge, and Felix A Wichmann. 2020. Shortcut learning in deep neural networks. Nature Machine Intelligence, Vol. 2, 11 (2020), 665--673.Google ScholarGoogle ScholarCross RefCross Ref
  51. Ysabel Gerrard. 2018. Beyond the hashtag: Circumventing content moderation on social media. New Media & Society, Vol. 20, 12 (2018), 4492--4511.Google ScholarGoogle ScholarCross RefCross Ref
  52. David C Giles and Julie Newbold. 2011. Self-and other-diagnosis in user-led mental health online communities. Qualitative Health Research, Vol. 21, 3 (2011), 419--428.Google ScholarGoogle ScholarCross RefCross Ref
  53. Aaron Halfaker and R Stuart Geiger. 2020. Ores: Lowering barriers with participatory machine learning in wikipedia. Proceedings of the ACM on Human-Computer Interaction, Vol. 4, CSCW2 (2020), 1--37.Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Keith Harrigian, Carlos Aguirre, and Mark Dredze. 2020. Do Models of Mental Health Based on Social Media Data Generalize?. In Proceedings of the 2020 conference on empirical methods in natural language processing: findings. 3774--3788.Google ScholarGoogle ScholarCross RefCross Ref
  55. Christina Harrington, Sheena Erete, and Anne Marie Piper. 2019. Deconstructing community-based collaborative design: Towards more equitable participatory design engagements. Proceedings of the ACM on Human-Computer Interaction, Vol. 3, CSCW (2019), 1--25.Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. Kenneth Holstein, Jennifer Wortman Vaughan, Hal Daumé III, Miro Dudik, and Hanna Wallach. 2019. Improving fairness in machine learning systems: What do industry practitioners need?. In Proceedings of the 2019 CHI conference on human factors in computing systems. 1--16.Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Amina Hussain, Mishal Dar, and Kyle T Ganson. 2022. A post-structural feminist analysis of eating disorders intervention research. Affilia, Vol. 37, 3 (2022), 505--519.Google ScholarGoogle ScholarCross RefCross Ref
  58. Maia Jacobs, James Clawson, and Elizabeth D Mynatt. 2014. Cancer navigation: opportunities and challenges for facilitating the breast cancer journey. In Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing. 1467--1478.Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. Jialun Aaron Jiang, Kandrea Wade, Casey Fiesler, and Jed R Brubaker. 2021. Supporting serendipity: Opportunities and challenges for Human-AI Collaboration in qualitative analysis. Proceedings of the ACM on Human-Computer Interaction, Vol. 5, CSCW1 (2021), 1--23.Google ScholarGoogle ScholarDigital LibraryDigital Library
  60. Dan Jurafsky. 2000. Speech & language processing. Pearson Education India.Google ScholarGoogle Scholar
  61. Harmanpreet Kaur, Eytan Adar, Eric Gilbert, and Cliff Lampe. 2022a. Sensible AI: Re-imagining Interpretability and Explainability using Sensemaking Theory. arXiv preprint arXiv:2205.05057 (2022).Google ScholarGoogle Scholar
  62. Harmanpreet Kaur, Daniel McDuff, Alex C Williams, Jaime Teevan, and Shamsi T Iqbal. 2022b. ?I Didn't Know I Looked Angry": Characterizing Observed Emotion and Reported Affect at Work. In CHI Conference on Human Factors in Computing Systems. 1--18.Google ScholarGoogle ScholarDigital LibraryDigital Library
  63. Harmanpreet Kaur, Harsha Nori, Samuel Jenkins, Rich Caruana, Hanna Wallach, and Jennifer Wortman Vaughan. 2020. Interpreting interpretability: understanding data scientists' use of interpretability tools for machine learning. In Proceedings of the 2020 CHI conference on human factors in computing systems. 1--14.Google ScholarGoogle ScholarDigital LibraryDigital Library
  64. Arthur Kleinman. 2020. The illness narratives: Suffering, healing, and the human condition. Basic books.Google ScholarGoogle Scholar
  65. Kate Loveys, Patrick Crutchley, Emily Wyatt, and Glen Coppersmith. 2017. Small but mighty: affective micropatterns for quantifying mental health from social media language. In Proceedings of the fourth workshop on computational linguistics and clinical Psychology-From linguistic signal to clinical reality. 85--95.Google ScholarGoogle ScholarCross RefCross Ref
  66. Gianluca Maguolo and Loris Nanni. 2021. A critic evaluation of methods for COVID-19 automatic detection from X-ray images. Information Fusion, Vol. 76 (2021), 1--7.Google ScholarGoogle ScholarDigital LibraryDigital Library
  67. Gabriela Marcu, Nadia Dowshen, Shuvadittya Saha, Ressa Reneth Sarreal, and Nazanin Andalibi. 2016. TreatYoSelf: Empathy-driven behavioral intervention for marginalized youth living with HIV. In Proceedings of the 10th EAI International Conference on Pervasive Computing Technologies for Healthcare. 69--76.Google ScholarGoogle ScholarCross RefCross Ref
  68. Alice E Marwick and Danah Boyd. 2011. I tweet honestly, I tweet passionately: Twitter users, context collapse, and the imagined audience. New media & society, Vol. 13, 1 (2011), 114--133.Google ScholarGoogle Scholar
  69. Nora McDonald, Sarita Schoenebeck, and Andrea Forte. 2019. Reliability and inter-rater reliability in qualitative research: Norms and guidelines for CSCW and HCI practice. Proceedings of the ACM on human-computer interaction, Vol. 3, CSCW (2019), 1--23.Google ScholarGoogle ScholarDigital LibraryDigital Library
  70. Paul E Meehl. 1999. Clarifications about taxometric method. Applied and Preventive Psychology, Vol. 8, 3 (1999), 165--174.Google ScholarGoogle ScholarCross RefCross Ref
  71. Margaret Mitchell, Kristy Hollingshead, and Glen Coppersmith. 2015. Quantifying the language of schizophrenia in social media. In Proceedings of the 2nd workshop on Computational linguistics and clinical psychology: From linguistic signal to clinical reality. 11--20.Google ScholarGoogle ScholarCross RefCross Ref
  72. Alexandra Olteanu, Carlos Castillo, Fernando Diaz, and Emre Kiciman. 2019. Social data: Biases, methodological pitfalls, and ethical boundaries. Frontiers in Big Data, Vol. 2 (2019), 13.Google ScholarGoogle ScholarCross RefCross Ref
  73. Kayur Patel, James Fogarty, James A Landay, and Beverly L Harrison. 2008. Examining Difficulties Software Developers Encounter in the Adoption of Statistical Machine Learning.. In AAAI. 1563--1566.Google ScholarGoogle Scholar
  74. Jessica A Pater, Oliver L Haimson, Nazanin Andalibi, and Elizabeth D Mynatt. 2016. ?Hunger Hurts but Starving Works" Characterizing the Presentation of Eating Disorders Online. In Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing. 1185--1200.Google ScholarGoogle Scholar
  75. Jessica A Pater, Lauren E Reining, Andrew D Miller, Tammy Toscos, and Elizabeth D Mynatt. 2019. " Notjustgirls" Exploring Male-related Eating Disordered Content across Social Media Platforms. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. 1--13.Google ScholarGoogle ScholarDigital LibraryDigital Library
  76. V'ictor M Prieto, Sergio Matos, Manuel Alvarez, Fidel Cacheda, and José Lu'is Oliveira. 2014. Twitter: a good place to detect health conditions. PloS one, Vol. 9, 1 (2014), e86191.Google ScholarGoogle ScholarCross RefCross Ref
  77. Inioluwa Deborah Raji, I Elizabeth Kumar, Aaron Horowitz, and Andrew Selbst. 2022. The Fallacy of AI Functionality. In 2022 ACM Conference on Fairness, Accountability, and Transparency. 959--972.Google ScholarGoogle Scholar
  78. Donghao Ren, Saleema Amershi, Bongshin Lee, Jina Suh, and Jason D Williams. 2016. Squares: Supporting interactive performance analysis for multiclass classifiers. IEEE transactions on visualization and computer graphics, Vol. 23, 1 (2016), 61--70.Google ScholarGoogle ScholarDigital LibraryDigital Library
  79. Paul Röttger, Bertie Vidgen, Dirk Hovy, and Janet B Pierrehumbert. 2021. Two Contrasting Data Annotation Paradigms for Subjective NLP Tasks. arXiv preprint arXiv:2112.07475 (2021).Google ScholarGoogle Scholar
  80. Shiori Sagawa, Aditi Raghunathan, Pang Wei Koh, and Percy Liang. 2020. An investigation of why overparameterization exacerbates spurious correlations. In International Conference on Machine Learning. PMLR, 8346--8356.Google ScholarGoogle Scholar
  81. Johnny Salda na. 2009. The coding manual for qualitative researchers. SAGE Publications.Google ScholarGoogle Scholar
  82. Nithya Sambasivan, Shivani Kapania, Hannah Highfill, Diana Akrong, Praveen Paritosh, and Lora M Aroyo. 2021. ?Everyone wants to do the model work, not the data work": Data Cascades in High-Stakes AI. In proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. 1--15.Google ScholarGoogle ScholarDigital LibraryDigital Library
  83. Morgan Klaus Scheuerman, Alex Hanna, and Emily Denton. 2021. Do datasets have politics? Disciplinary values in computer vision dataset development. Proceedings of the ACM on Human-Computer Interaction, Vol. 5, CSCW2 (2021), 1--37.Google ScholarGoogle ScholarDigital LibraryDigital Library
  84. Morgan Klaus Scheuerman, Jacob M Paul, and Jed R Brubaker. 2019. How computers see gender: An evaluation of gender classification in commercial facial analysis services. Proceedings of the ACM on Human-Computer Interaction, Vol. 3, CSCW (2019), 1--33.Google ScholarGoogle ScholarDigital LibraryDigital Library
  85. Andrew Scull. 2015. Madness in Civilization: A Cultural History of Insanity, from the Bible to Freud, from the Madhouse to Modern Medicine. Princeton University Press. 432 pages.Google ScholarGoogle Scholar
  86. Indira Sen, Fabian Flöck, Katrin Weller, Bernd Weiß, and Claudia Wagner. 2021. A total error framework for digital traces of human behavior on online platforms. Public Opinion Quarterly, Vol. 85, S1 (2021), 399--422.Google ScholarGoogle ScholarCross RefCross Ref
  87. C Estelle Smith, Bowen Yu, Anjali Srivastava, Aaron Halfaker, Loren Terveen, and Haiyi Zhu. 2020. Keeping community in the loop: Understanding wikipedia stakeholder values for machine learning-based systems. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. 1--14.Google ScholarGoogle ScholarDigital LibraryDigital Library
  88. Ian Stewart, Stevie Chancellor, Munmun De Choudhury, and Jacob Eisenstein. 2017. # anorexia,# anarexia,# anarexyia: Characterizing online community practices with orthographic variation. In 2017 IEEE International Conference on Big Data (Big Data). IEEE, 4353--4361.Google ScholarGoogle ScholarCross RefCross Ref
  89. TS Szasz. 1974. The myth of mental illness: Foundations of a theory of personal conduct. HarperPerennial.Google ScholarGoogle Scholar
  90. Anja Thieme, Danielle Belgrave, and Gavin Doherty. 2020. Machine learning in mental health: A systematic review of the HCI literature to support the development of effective and implementable ML systems. ACM Transactions on Computer-Human Interaction (TOCHI), Vol. 27, 5 (2020), 1--53.Google ScholarGoogle ScholarDigital LibraryDigital Library
  91. Philip S Wang, Patricia A Berglund, Mark Olfson, and Ronald C Kessler. 2004. Delays in initial treatment contact after first onset of a mental disorder. Health services research, Vol. 39, 2 (2004), 393--416.Google ScholarGoogle Scholar
  92. Tao Wang, Markus Brede, Antonella Ianni, and Emmanouil Mentzakis. 2017. Detecting and characterizing eating-disorder communities on social media. In Proceedings of the Tenth ACM International conference on web search and data mining. 91--100.Google ScholarGoogle ScholarDigital LibraryDigital Library
  93. Tongshuang Wu, Marco Tulio Ribeiro, Jeffrey Heer, and Daniel S Weld. 2019. Errudite: Scalable, reproducible, and testable error analysis. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 747--763.Google ScholarGoogle ScholarCross RefCross Ref
  94. Andrew Yates, Arman Cohan, and Nazli Goharian. 2017. Depression and self-harm risk assessment in online forums. arXiv preprint arXiv:1709.01848 (2017).Google ScholarGoogle Scholar
  95. Anon Ymous, Katta Spiel, Os Keyes, Rua M Williams, Judith Good, Eva Hornecker, and Cynthia L Bennett. 2020. " I am just terrified of my future"-Epistemic Violence in Disability Related Technology Research. In Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems. 1--16.Google ScholarGoogle ScholarDigital LibraryDigital Library
  96. Jun Yuan, Jesse Vig, and Nazneen Rajani. 2022. iSEA: An Interactive Pipeline for Semantic Error Analysis of NLP Models. In 27th International Conference on Intelligent User Interfaces. 878--888.Google ScholarGoogle ScholarDigital LibraryDigital Library
  97. Haiyi Zhu, Bowen Yu, Aaron Halfaker, and Loren Terveen. 2018. Value-sensitive algorithm design: Method, case study, and lessons. Proceedings of the ACM on Human-Computer Interaction, Vol. 2, CSCW (2018), 1--23.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Contextual Gaps in Machine Learning for Mental Illness Prediction: The Case of Diagnostic Disclosures

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image Proceedings of the ACM on Human-Computer Interaction
          Proceedings of the ACM on Human-Computer Interaction  Volume 7, Issue CSCW2
          CSCW
          October 2023
          4055 pages
          EISSN:2573-0142
          DOI:10.1145/3626953
          Issue’s Table of Contents

          Copyright © 2023 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 4 October 2023
          Published in pacmhci Volume 7, Issue CSCW2

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
        • Article Metrics

          • Downloads (Last 12 months)130
          • Downloads (Last 6 weeks)12

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader