skip to main content
research-article

Supporting Personalized Health Care With Social Media Analytics: An Application to Hypothyroidism

Published: 15 October 2021 Publication History

Abstract

Social media analytics can considerably contribute to understanding health conditions beyond clinical practice, by capturing patients’ discussions and feelings about their quality of life in relation to disease treatments. In this article, we propose a methodology to support a detailed analysis of the therapeutic experience in patients affected by a specific disease, as it emerges from health forums. As a use case to test the proposed methodology, we analyze the experience of patients affected by hypothyroidism and their reactions to standard therapies. Our approach is based on a data extraction and filtering pipeline, a novel topic detection model named Generative Text Compression with Agglomerative Clustering Summarization (GTCACS), and an in-depth data analytic process. We advance the state of the art on automated detection of adverse drug reactions (ADRs) since, rather than simply detecting and classifying positive or negative reactions to a therapy, we are capable of providing a fine characterization of patients along different dimensions, such as co-morbidities, symptoms, and emotional states.

References

[1]
Hassan Alhuzali and Sophia Ananiadou. 2019. Improving classification of adverse drug reactions through using sentiment analysis and transfer learning. In Proceedings of the 18th BioNLP Workshop and Shared Task. Association for Computational Linguistics, 339–347. https://doi.org/10.18653/v1/W19-5036
[2]
Mohammed Bahja and Mark Lycett. 2016. Identifying patient experience from online resources via sentiment analysis and topic modelling. In Proceedings of the 3rd IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT’16). Association for Computing Machinery, New York, NY, 9499. https://doi.org/10.1145/3006299.3006335
[3]
Maksim Belousov, Nikola Milosevic, William Dixon, and Goran Nenadic. 2017. Extracting adverse drug reactions and their context using sequence labelling ensembles in TAC2017. In Proceedings of the 2017 Text Analysis Conference (TAC’17). NIST, Gaithersburg, Maryland.
[4]
James Bergstra, Rémi Bardenet, Yoshua Bengio, and Balázs Kégl. 2011. Algorithms for hyper-parameter optimization. In Proceedings of the 24th International Conference on Neural Information Processing Systems (NIPS ’11). Curran Associates Inc., Red Hook, NY, 25462554.
[5]
David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent Dirichlet allocation. Journal of Machine Learning Research 3, null (March 2003), 9931022.
[6]
Piotr Bojanowski, Edouard Grave, Armand Joulin, and Tomas Mikolov. 2017. Enriching word vectors with subword information. Transactions of the Association for Computational Linguistics 5 (2017), 135–146. https://doi.org/10.1162/tacl_a_00051
[7]
Gerlof Bouma. 2009. Normalized (pointwise) mutual information in collocation extraction, From Form to Meaning: Processing Texts Automatically. In Proceedings of the Biennial GSCL Conference 2009, 31–40.
[8]
Tadeusz Caliński and Joachim Harabasz. 1974. A dendrite method for cluster analysis. Communications in Statistics—Theory and Methods 3, 1 (1974), 1–27. https://doi.org/10.1080/03610927408827101
[9]
Layal Chaker, Antonio Bianco, Jacqueline Jonklaas, and Robin Peeters. 2017. Hypothyroidism. The Lancet 390 (March 2017). https://doi.org/10.1016/S0140-6736(17)30703-1
[10]
Ying Chen, Chu-Ren Huang, and Sophia Lee. 2011. Automatic recognition of emotion based on a cognitively motivated emotion annotation system. Journal of Cognitive Science 12 (Dec. 2011), 279–296. https://doi.org/10.17791/jcs.2011.12.3.279
[11]
Rion Correia, Ian Wood, Johan Bollen, and Luis Rocha. 2020. Mining social media data for biomedical signals and health-related behavior. Annual Review of Biomedical Data Science 3 (July 2020), 433–458. https://doi.org/10.1146/annurev-biodatasci-030320-040844
[12]
Oriol Cunillera, Ricard Tresserras, Luis Rajmil, Gemma Vilagut, Pilar Brugulat, Mike Herdman, Mompart Anna, Antonia Medina, Yolanda Pardo, Jordi Alonso, John Brazier, and Montse Ferrer. 2010. Discriminative capacity of the EQ-5D, SF-6D, and SF-12 as measures of health status in population health survey. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation 19 (March 2010), 853–864. https://doi.org/10.1007/s11136-010-9639-z
[13]
David L. Davies and Donald W. Bouldin. 1979. A cluster separation measure. IEEE Transactions on Pattern Analysis and Machine Intelligence 1, 2 (Feb. 1979), 224227. https://doi.org/10.1109/TPAMI.1979.4766909
[14]
Richard Dawkins. 1982. The Extended Phenotype: The Gene as the Unit of Selection. Freeman, Oxford. 81009889https://books.google.it/books?id=uJCUAQAACAAJ.
[15]
Sara Della Rosa and Falguni Sen. 2019. Health topics on facebook groups: Content analysis of posts in multiple sclerosis communities. Interactive Journal of Medical Research 8, 1 (Feb. 2019), e10146. https://doi.org/10.2196/10146
[16]
Mies van Eenbergen, Lonneke Poll-Franse, Emiel Krahmer, Suzan Verberne, and Floortje Mols. 2018. Analysis of content shared in online cancer communities: Systematic review. JMIR Cancer 4, 1 (April 2018), e6. https://doi.org/10.2196/cancer.7926
[17]
Paul Ekman. 1992. An argument for basic emotions. Cognition and Emotion 6, 3-4 (1992), 169–200. https://doi.org/10.1080/02699939208411068
[18]
Ane Garmendia Madariaga, Silvia Santos Palacios, Francisco Guillén-Grima, and Juan C. Galofré. 2014. The incidence and prevalence of thyroid dysfunction in europe: A meta-analysis. The Journal of Clinical Endocrinology & Metabolism 99, 3 (March 2014), 923–931. https://doi.org/10.1210/jc.2013-2409arXiv:https://academic.oup.com/jcem/article-pdf/99/3/923/11157991/jcem09 23.pdf.
[19]
John Glover. 2016. Modeling documents with generative adversarial networks. CoRR abs/1612.09122. arxiv:1612.09122http://arxiv.org/abs/1612.09122.
[20]
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial networks. Advances in Neural Information Processing Systems 3 (June 2014). https://doi.org/10.1145/3422622
[21]
Giorgio Grani, D. Tumino, V. Ramundo, Laura Ciotti, Cristiano Lomonaco, M. Armillotta, Rosa Falcone, P. Lucia, Marianna Maranghi, S. Filetti, and C. Durante. 2019. Changes in TSH levels in athyreotic patients with differentiated thyroid cancer during levothyroxine therapy: Influence on dose adjustments. Journal of Endocrinological Investigation 42 (June 2019). https://doi.org/10.1007/s40618-019-01074-x
[22]
Felix Greaves, Daniel Ramirez-Cano, Christopher Millett, Ara Darzi, and Liam Donaldson. 2013. Harnessing the cloud of patient experience: Using social media to detect poor quality healthcare. BMJ Quality & Safety 22 (Jan. 2013). https://doi.org/10.1136/bmjqs-2012-001527
[23]
Maria Halkidi, Yannis Batistakis, and Michalis Vazirgiannis. 2002. Cluster validity methods: Part I. SIGMOD Rec. 31, 2 (June 2002), 4045. https://doi.org/10.1145/565117.565124
[24]
Ahmed Hamed, Tamer Fandy, Xindong Wu, and Robert Erickson. 2015. Twitter K-H networks in action: Advancing biomedical literature for drug search. Journal of Biomedical Informatics 56 (2015), 157–168. https://doi.org/10.1016/j.jbi.2015.05.015
[25]
Henry Hexmoor. 2014. Computational Network Science: An Algorithmic Approach (1st ed.). Morgan Kaufmann Publishers, Inc., San Francisco, CA.
[26]
Frank Hutter, Holger H. Hoos, and Kevin Leyton-Brown. 2011. Sequential model-based optimization for general algorithm configuration. In Proceedings of the 5th International Conference on Learning and Intelligent Optimization (LION’05). Springer-Verlag, Berlin, 507523. https://doi.org/10.1007/978-3-642-25566-3_40
[27]
Sachin Jain, Brian Powers, Jared Hawkins, and John Brownstein. 2015. The digital phenotype. Nature Biotechnology 33 (May 2015), 462–463. https://doi.org/10.1038/nbt.3223
[28]
Rose McCabe and Patrick G. T. Healey. 2018. “Miscommunication in Doctor-Patient Communication.”Topics in Cognitive Science 10 (Feb. 2018), 409–424.
[29]
Stephen Oppong, Dominic Asamoah, Emmanuel Oppong, and Derrick Lamptey. 2019. Business decision support system based on sentiment analysis. International Journal of Information Engineering and Electronic Business 11 (Jan. 2019), 36–49. https://doi.org/10.5815/ijieeb.2019.01.05
[30]
Pentti Paatero and Unto Tapper. 1994. Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values. Environmetrics 5, 2 (1994), 111–126.
[31]
Wullianallur Raghupathi and Viju Raghupathi. 2014. Big data analytics in healthcare: Promise and potential. Health Information Science and Systems 2 (Feb. 2014), 3. https://doi.org/10.1186/2047-2501-2-3
[32]
Bhavani Raskutti and Christopher Leckie. 1999. An evaluation of criteria for measuring the quality of clusters. In Proceedings of the 16th International Joint Conference on Artificial Intelligence (IJCAI’99). Morgan Kaufmann Publishers, Inc., San Francisco, CA, 905910.
[33]
Peter Rousseeuw. 1987. Rousseeuw, P.J.: Silhouettes: A Graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics 20 (Nov. 1987), 53–65. https://doi.org/10.1016/0377-0427(87)90125-7
[34]
Andrew S. Gallan. 2014. Evaluation and measurement of patient experience. Patient Experience Journal 1 (April 2014), 5.
[35]
Hariprasad Sampathkumar, Xue-Wen Chen, and Bo Luo. 2014. Mining adverse drug reactions from online healthcare forums using hidden Markov model.BMC Medical Informatics and Decision Making 14 (Oct. 2014), 91. https://doi.org/10.1186/1472-6947-14-91
[36]
Emanuel Schegloff. 2000. When ‘others’ initiate repair. Applied Linguistics 21 (June 2000). https://doi.org/10.1093/applin/21.2.205
[37]
Edin Smailhodzic, Wyanda Hooijsma, Albert Boonstra, and David J. Langley. 2016. Social media use in healthcare: A systematic review of effects on patients and on their relationship with healthcare professionals. BMC Health Services Research 16, 1 (Aug. 2016), 442.
[38]
Sven Stegemann, Robert L. Ternik, Graziano Onder, Mansoor Khan, and Diana van Riet-Nales. 2016. Defining patient centric pharmaceutical drug product design. The AAPS Journal 18 (June 2016). https://doi.org/10.1208/s12248-016-9938-6
[39]
Giovanni Stilo, Moreno De Vincenzi, Alberto E. Tozzi, and Paola Velardi. 2013. Automated learning of everyday patients’ language for medical blogs analytics. In Recent Advances in Natural Language Processing (RANLP’13). INCOMA Ltd. Shoumen, BULGARIA, Hissar, Bulgaria, 640–648. https://www.aclweb.org/anthology/R13-1084.
[40]
Joe H. Ward. 1963. Hierarchical grouping to optimize an objective function. Journal of the American Statistical Assocication 58, 301 (1963), 236–244. https://doi.org/10.1080/01621459.1963.10500845
[41]
Ziqiao Weng. 2019. From conventional machine learning to AutoML. Journal of Physics: Conference Series 1207 (April 2019), 012015.
[42]
Ryen W. White, P. Murali Doraiswamy, and Eric Horvitz. 2018. Detecting neurodegenerative disorders from web search signals. npj Digital Medicine 1 (Dec. 2018). https://doi.org/10.1038/s41746-018-0016-6
[43]
David Wolpert. 1992. Stacked generalization. Neural Networks 5 (Dec. 1992), 241–259. https://doi.org/10.1016/S0893-6080(05)80023-1
[44]
D. H. Wolpert and W. G. Macready. 1997. No free lunch theorems for optimization. IEEE Transactions on Evolutionary Computation 1, 1 (April 1997), 6782. https://doi.org/10.1109/4235.585893
[45]
Christopher C. Yang, Haodong Yang, Ling Jiang, and Mi Zhang. 2012. Social media mining for drug safety signal detection. In Proceedings of the 2012 International Workshop on Smart Health and Wellbeing (SHB’12). Association for Computing Machinery, New York, NY, 3340. https://doi.org/10.1145/2389707.2389714.
[46]
Yang Yu, Zhiqiang Gong, Ping Zhong, and Jiaxin Shan. 2017. Unsupervised representation learning with deep convolutional neural network for remote sensing images. In Image and Graphics, Yao Zhao, Xiangwei Kong, and David Taubman (Eds.). Springer International Publishing, Cham, 97–108.
[47]
Amy X. Zhang, Jilin Chen, Wei Chai, Jinjun Xu, Lichan Hong, and Ed CHI. 2018. Evaluation and refinement of clustered search results with the crowd. ACM Transactions on Interactive Intelligent Systems 8, 2 (June 2018), Article 14, 28 pages. https://doi.org/10.1145/3158226
[48]
Junbo Zhao, Michael Mathieu, and Yann Lecun. 2016. Energy-based generative adversarial network. CoRR abs/1609.03126 (Sept. 2016). arxiv:1609.03126. http://arxiv.org/abs/1609.03126.

Cited By

View all
  • (2024)A Systematic Review of Natural Language Processing Methods and Applications in ThyroidologyMayo Clinic Proceedings: Digital Health10.1016/j.mcpdig.2024.03.0072:2(270-279)Online publication date: Jun-2024
  • (2024)Meta-Learning on Clinical Data for Diagnosis Support Systems: A Systematic ReviewResearch and Innovation Forum 202310.1007/978-3-031-44721-1_57(751-759)Online publication date: 1-Jan-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Computing for Healthcare
ACM Transactions on Computing for Healthcare  Volume 3, Issue 1
January 2022
255 pages
EISSN:2637-8051
DOI:10.1145/3485154
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 October 2021
Accepted: 01 May 2021
Revised: 01 April 2021
Received: 01 July 2020
Published in HEALTH Volume 3, Issue 1

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Personalized health care
  2. adverse drug reactions
  3. hypothyroid patients
  4. social analytics
  5. generative adversarial networks
  6. meta-learning

Qualifiers

  • Research-article
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)65
  • Downloads (Last 6 weeks)1
Reflects downloads up to 18 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)A Systematic Review of Natural Language Processing Methods and Applications in ThyroidologyMayo Clinic Proceedings: Digital Health10.1016/j.mcpdig.2024.03.0072:2(270-279)Online publication date: Jun-2024
  • (2024)Meta-Learning on Clinical Data for Diagnosis Support Systems: A Systematic ReviewResearch and Innovation Forum 202310.1007/978-3-031-44721-1_57(751-759)Online publication date: 1-Jan-2024

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media