Abstract
Capturing changes in an individual’s language is an important aspect of personalised mental health monitoring. A key component is modelling the influence of time, as contextual information both in the recent or distant past/future carries varying semantic weight. We capture and contrast this information by identifying neural, time-sensitive, bi-directional representations of individuals – modelling time-intervals in their social-media posts inspired by the Hawkes process. We demonstrate that our approach helps identify whether an individual’s mood is changing drastically, or smoothly on two social media datasets – yielding superior performance compared to time-insensitive baselines and outperforming the state-of-the-art on the CLPsych 2022 shared task.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Alhamed, F., Ive, J., Specia, L.: Predicting moments of mood changes overtime from imbalanced social media data. In: Proceedings of the Eighth Workshop on Computational Linguistics and Clinical Psychology, pp. 239–244. Association for Computational Linguistics, Seattle, USA, July 2022. https://doi.org/10.18653/v1/2022.clpsych-1.23. https://aclanthology.org/2022.clpsych-1.23
Amir, S., Coppersmith, G., Carvalho, P., Silva, M.J., Wallace, B.C.: Quantifying mental health from social media with neural user embeddings. In: Machine Learning for Healthcare Conference, pp. 306–321. PMLR (2017)
Azim, T., Gyanendro Singh, L., Middleton, S.E.: Detecting moments of change and suicidal risks in longitudinal user texts using multi-task learning. In: Proceedings of the Eighth Workshop on Computational Linguistics and Clinical Psychology, pp. 213–218. Association for Computational Linguistics, Seattle, USA, July 2022. https://doi.org/10.18653/v1/2022.clpsych-1.19. https://aclanthology.org/2022.clpsych-1.19
Bagroy, S., Kumaraguru, P., De Choudhury, M.: A social media based index of mental well-being in college campuses. In: Proceedings of the 2017 CHI Conference on Human factors in Computing Systems, pp. 1634–1646 (2017)
Bayram, U., Benhiba, L.: Emotionally-informed models for detecting moments of change and suicide risk levels in longitudinal social media data. In: Proceedings of the Eighth Workshop on Computational Linguistics and Clinical Psychology, pp. 219–225. Association for Computational Linguistics, Seattle, USA, July 2022. https://doi.org/10.18653/v1/2022.clpsych-1.20. https://aclanthology.org/2022.clpsych-1.20
Boinepelli, S., Subramanian, S., Singam, A., Raha, T., Varma, V.: Towards capturing changes in mood and identifying suicidality risk. In: Proceedings of the Eighth Workshop on Computational Linguistics and Clinical Psychology, pp. 245–250 (2022)
Bucur, A.M., Jang, H., Liza, F.F.: Capturing changes in mood over time in longitudinal data using ensemble methodologies (2022)
Coppersmith, G., Dredze, M., Harman, C., Hollingshead, K., Mitchell, M.: CLPsych 2015 shared task: Depression and PTSD on Twitter. In: Proceedings of the 2nd Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, pp. 31–39. Association for Computational Linguistics, Denver, Colorado, 5 June 2015. https://doi.org/10.3115/v1/W15-1204. https://aclanthology.org/W15-1204
Coppersmith, G., Ngo, K., Leary, R., Wood, A.: Exploratory analysis of social media prior to a suicide attempt. In: Proceedings of the Third Workshop on Computational Linguistics and Clinical Psychology, pp. 106–117 (2016)
Culnan, J., Diaz, D.R., Bethard, S.: Exploring transformers and time lag features for predicting changes in mood over time. In: Proceedings of the Eighth Workshop on Computational Linguistics and Clinical Psychology, pp. 226–231 (2022)
Daley, D.J., Vere-Jones, D.: An Introduction to the Theory of Point Processes. Volume II: General Theory and Structure. Springer (2008)
Daley, D.J., Vere-Jones, D., et al.: An introduction to the theory of point processes: volume I: elementary theory and methods. Springer (2003)
Daly, M., Sutin, A.R., Robinson, E.: Longitudinal changes in mental health and the COVID-19 pandemic: evidence from the UK Household Longitudinal Study. Psychological Medicine, pp. 1–10, November 2020. https://doi.org/10.1017/S0033291720004432. https://www.cambridge.org/core/product/identifier/S0033291720004432/type/journal_article
De Choudhury, M., Gamon, M., Counts, S., Horvitz, E.: Predicting depression via social media. In: Proceedings of the International AAAI Conference on Web and Social Media, vol. 7 (2013)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of Deep Bidirectional Transformers for Language Understanding, pp. 4171–4186, June 2019. https://doi.org/10.18653/v1/N19-1423. https://www.aclweb.org/anthology/N19-1423
Du, N., Dai, H., Trivedi, R., Upadhyay, U., Gomez-Rodriguez, M., Song, L.: Recurrent marked temporal point processes: embedding event history to vector. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1555–1564 (2016)
Ernala, S.K., et al.: Methodological gaps in predicting mental health states from social media: triangulating diagnostic signals. In: Proceedings of the 2019 chi Conference on Human Factors in Computing Systems, pp. 1–16 (2019)
Gamaarachchige, P.K., Orabi, A.H., Orabi, M.H., Inkpen, D.: Multi-task learning to capture changes in mood over time. In: Proceedings of the Eighth Workshop on Computational Linguistics and Clinical Psychology, pp. 232–238 (2022)
Ganesan, A.V., et al.: Wwbp-sqt-lite: multi-level models and difference embeddings for moments of change identification in mental health forums. In: Proceedings of the Eighth Workshop on Computational Linguistics and Clinical Psychology, pp. 251–258 (2022)
Gkotsis, G., et al.: Characterisation of mental health conditions in social media using informed deep learning. Sci. Rep. 7(1), 1–11 (2017)
Hawkes, A.G.: Point spectra of some mutually exciting point processes. J. Roy. Stat. Soc.: Ser. B (Methodol.) 33(3), 438–443 (1971)
Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollar, P.: Focal Loss for Dense Object Detection, pp. 2980–2988 (2017). https://openaccess.thecvf.com/content_iccv_2017/html/Lin_Focal_Loss_for_ICCV_2017_paper.html
Loveys, K., Crutchley, P., Wyatt, E., Coppersmith, G.: Small but mighty: affective micropatterns for quantifying mental health from social media language. In: Proceedings of the Fourth Workshop on Computational Linguistics and Clinical Psychology-From Linguistic Signal to Clinical Reality, pp. 85–95 (2017)
Marcos, H.F., et al.: Approximate nearest neighbour extraction techniques and neural networks for suicide risk prediction in the clpsych 2022 shared task. In: Proceedings of the Eighth Workshop on Computational Linguistics and Clinical Psychology, pp. 199–204 (2022)
Masuda, N., Kurahashi, I., Onari, H.: Suicide ideation of individuals in online social networks. PLoS ONE 8(4), e62262 (2013)
Preoţiuc-Pietro, D., et al.: The role of personality, age, and gender in tweeting about mental illness. In: Proceedings of the 2nd Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, pp. 21–30 (2015)
Rizoiu, M.A., Lee, Y., Mishra, S., Xie, L.: A tutorial on hawkes processes for events in social media. arXiv preprint arXiv:1708.06401 (2017)
Sawhney, R., Agarwal, S., Wadhwa, A., Shah, R.: Exploring the scale-free nature of stock markets: hyperbolic graph learning for algorithmic trading. In: Proceedings of the Web Conference 2021, pp. 11–22. ACM, Ljubljana Slovenia, April 2021. https://doi.org/10.1145/3442381.3450095. https://dl.acm.org/doi/10.1145/3442381.3450095
Sawhney, R., Joshi, H., Shah, R.R., Flek, L.: Suicide ideation detection via social and temporal user representations using hyperbolic learning. In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 2176–2190. Association for Computational Linguistics, Online (2021). https://doi.org/10.18653/v1/2021.naacl-main.176. https://www.aclweb.org/anthology/2021.naacl-main.176
Shchur, O., Türkmen, A.C., Januschowski, T., Günnemann, S.: Neural temporal point processes: a review. arXiv preprint arXiv:2104.03528 (2021)
Shing, H.C., Nair, S., Zirikly, A., Friedenberg, M., Daumé III, H., Resnik, P.: Expert, crowdsourced, and machine assessment of suicide risk via online postings. In: Proceedings of the Fifth Workshop on Computational Linguistics and Clinical Psychology: From Keyboard to Clinic, pp. 25–36 (2018)
Tamire, M., Anumasa, S., Srijith, P.K.: Bi-directional recurrent neural ordinary differential equations for social media text classification. In: Proceedings of the 2nd Workshop on Deriving Insights from User-Generated Text, pp. 20–24. Association for Computational Linguistics, (Hybrid) Dublin, Ireland, and Virtual, May 2022. https://doi.org/10.18653/v1/2022.wit-1.3. https://aclanthology.org/2022.wit-1.3
Tsakalidis, A., et al.: Overview of the CLPsych 2022 shared task: Capturing moments of change in longitudinal user posts. In: Proceedings of the Eighth Workshop on Computational Linguistics and Clinical Psychology, pp. 184–198. Association for Computational Linguistics, Seattle, USA, July 2022. https://doi.org/10.18653/v1/2022.clpsych-1.16. https://aclanthology.org/2022.clpsych-1.16
Tsakalidis, A., Nanni, F., Hills, A., Chim, J., Song, J., Liakata, M.: Identifying moments of change from longitudinal user text. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 4647–4660. Association for Computational Linguistics, Dublin, Ireland, May 2022. https://doi.org/10.18653/v1/2022.acl-long.318. https://aclanthology.org/2022.acl-long.318
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Ethics declarations
Ethics Statement
Ethics IRB approval was obtained from the IRB Committee of the lead University prior to engaging in this research study. Our work involves ethical considerations around the analysis of user generated content shared on social media (TalkLife and Reddit). A license was obtained to work with the user data from TalkLife and a project proposal was submitted to them in order to embark on the project. Potential risks from the application of our work in being able to identify moments of change in individuals’ timelines are akin to the identification of those in earlier work on personal event identification from social media and the detection of suicidal ideation. Potential mitigation strategies include restricting and regulating access to the code base and annotation labels used for evaluation.
Appendices
A Annotation Guidelines
The two datasets of longitudinal user posts annotated for MoCs that we make use of in this paper were sourced by [33, 34] for TalkLife and Reddit respectively. Both datasets were annotated using the same annotation guidelines and annotation interface proposed [34].
Annotators were provided with timelines to view, containing chronologically ordered posts by users, along with their associated comments and timestamps. They were then asked to label posts for MoCs.
The first type of label, “Switch” was defined in the guidelines as a “drastic change in mood, in comparison with the recent past”. Annotators were also tasked to label how long the Switch in mood persists (i.e. label its beginning and end). The second type of label “Escalation” was defined in their guidelines as a “gradual change in mood, which should last for a few posts”. Similarly, annotators were also instructed to label the associated range of posts for how long this change persists: where a peak of the escalation must be labelled, and the beginning and end of the gradual mood change also provided. Finally, a label of “None” was provided by default where no mood change was identified for that given post.
B Hyper-parameters Searched
We perform a grid-search over the HEAT parameters (Eq. 1): both \(\beta \) (decay rate) and \(\alpha \) (self-excitation) in the range [0.00001, 0.001, 0.1] for both datasets. All models are searched with learning rates in the range [0.0001, 0.001, 0.01] on Reddit and [0.001, 0.01] for TalkLife. All models are trained with 100 epochs with early stopping using a patience of 5 for all models and both datasets. For the BiLSTM module, we perform a grid-search over all layers using output dimensions of [128, 256, 512] and [128, 256] for Reddit and Talklife respectively. All models were implemented with PyTorch, and were trained using K-Fold cross validation over 5 folds using training, validation, and testing sizes of 60%, 20%, 20% respectively.
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Hills, A., Tsakalidis, A., Liakata, M. (2023). Time-Aware Predictions of Moments of Change in Longitudinal User Posts on Social Media. In: Ifrim, G., et al. Advanced Analytics and Learning on Temporal Data. AALTD 2023. Lecture Notes in Computer Science(), vol 14343. Springer, Cham. https://doi.org/10.1007/978-3-031-49896-1_19
Download citation
DOI: https://doi.org/10.1007/978-3-031-49896-1_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-49895-4
Online ISBN: 978-3-031-49896-1
eBook Packages: Computer ScienceComputer Science (R0)