skip to main content
10.1145/3469830.3470903acmotherconferencesArticle/Chapter ViewAbstractPublication PagessstdConference Proceedingsconference-collections
research-article

Clustering of Adverse Events of Post-Market Approved Drugs

Published: 23 August 2021 Publication History

Abstract

Adverse side effects of a drug may vary over space and time due to different populations, environments, and drug quality. Discovering all side effects during the development process is impossible. Once a drug is approved, observed adverse effects are reported by doctors and patients and made available in the Adverse Event Reporting System provided by the U.S. Food and Drug Administration . Mining such records of reported adverse effects, this study proposes a spatial clustering approach to identify regions that exhibit similar adverse effects. We apply a topic modeling approach on textual representations of reported adverse effects using Latent Dirichlet Allocation. By describing a spatial region as a mixture of the resulting latent topics, we find clusters of regions that exhibit similar (topics of) adverse events for the same drug using Hierarchical Agglomerative Clustering. We investigate the resulting clusters for spatial autocorrelation to test the hypothesis that certain (topics of) adverse effects may occur only in certain spatial regions using Moran’s I measure of spatial autocorrelation.
Our experimental evaluation exemplary applies our proposed framework to a number of blood-thinning drugs, showing that some drugs exhibit more coherent textual topics among their reported adverse effects than other drugs, but showing no significant spatial autocorrelation of these topics. Our approach can be applied to other drugs or vaccines to study if spatially localized adverse effects may justify further investigation.

References

[1]
European Medicines Agency. 2018. Eliquis. https://www.ema.europa.eu/en/medicines/human/EPAR/eliquis
[2]
European Medicines Agency. 2018. Pradaxa. https://www.ema.europa.eu/en/medicines/human/EPAR/pradaxa
[3]
European Medicines Agency. 2018. Xarelto. https://www.ema.europa.eu/en/medicines/human/EPAR/xarelto
[4]
Avi Baehr, Juliet C Peña, and Dale J Hu. 2015. Racial and ethnic disparities in adverse drug events: a systematic review of the literature. Journal of racial and ethnic health disparities 2, 4(2015), 527–536.
[5]
David M Blei, Andrew Y Ng, and Michael I Jordan. 2003. Latent dirichlet allocation. Journal of machine Learning research 3, Jan (2003), 993–1022.
[6]
Elliot G Brown, Louise Wood, and Sue Wood. 1999. The medical dictionary for regulatory activities (MedDRA). Drug safety 20, 2 (1999), 109–117.
[7]
Alina Chircu, Eldar Sultanow, and Satya Prakash Saraswat. 2014. Healthcare RFID in Germany: an integrated pharmaceutical supply chain perspective. Journal of Applied Business Research (JABR) 30, 3 (2014), 737–752.
[8]
William HE Day and Herbert Edelsbrunner. 1984. Efficient algorithms for agglomerative hierarchical clustering methods. Journal of classification 1, 1 (1984), 7–24.
[9]
Wilfrid J Dixon and Frank J Massey Jr. 1951. Introduction to statistical analysis.McGraw-Hill.
[10]
Hesha J Duggirala, Joseph M Tonning, Ella Smith, Roselie A Bright, Khaled Bouri, 2016. Use of data mining at the Food and Drug Administration. Journal of the American Medical Informatics Association 23, 2(2016), 428–434.
[11]
Charles W Dunnett. 1955. A multiple comparison procedure for comparing several treatments with a control. J. Amer. Statist. Assoc. 50, 272 (1955), 1096–1121.
[12]
Michael Edelstein, Lisa M Lee, Asha Herten-Crabb, David L Heymann, and David R Harper. 2018. Strengthening global public health surveillance through data and benefit sharing. Emerging infectious diseases 24, 7 (2018), 1324.
[13]
US Food, Drug Administration, 2018. FDA adverse event reporting system (FAERS) public dashboard. US Food and Drug Administration(2018).
[14]
Alan M Hochberg, Stephanie J Reisinger, Ronald K Pearson, Donald J O’Hara, and Kevin Hall. 2007. Using data mining to predict safety actions from FDA adverse event reporting system data. Drug information journal: DIJ/Drug Information Association 41, 5(2007), 633–643.
[15]
G Jeetu and G Anusha. 2010. Pharmacovigilance: a worldwide master key for drug safety monitoring. Journal of Young Pharmacists 2, 3 (2010), 315–320.
[16]
K Jordahl. 2014. GeoPandas: Python tools for geographic data. URL: https://github. com/geopandas/geopandas (2014).
[17]
Jeong-Hee Kang, Chul-Woung Kim, and Sang-Yi Lee. 2014. Nurse-perceived patient adverse events and nursing practice environment. Journal of Preventive Medicine and Public Health 47, 5 (2014), 273.
[18]
David A Kessler 1993. Introducing MEDWatch: a new approach to reporting medication and device adverse effects and product problems. Jama 269, 21 (1993), 2765–2768.
[19]
Lada Leyens, Matthias Reumann, Nuria Malats, and Angela Brand. 2017. Use of big data for drug development and for public and personal health and care. Genetic epidemiology 41, 1 (2017), 51–60.
[20]
Aristidis Likas, Nikos Vlassis, and Jakob J Verbeek. 2003. The global k-means clustering algorithm. Pattern recognition 36, 2 (2003), 451–461.
[21]
S Lupkin. 2017. Nearly 1 in 3 recent FDA drug approvals followed by major safety action. Scientific American (2017).
[22]
Rasmus E Madsen, David Kauchak, and Charles Elkan. 2005. Modeling word burstiness using the Dirichlet distribution. In Proceedings of the 22nd international conference on Machine learning. 545–552.
[23]
Qiaozhu Mei, Xuehua Shen, and ChengXiang Zhai. 2007. Automatic labeling of multinomial topic models. In Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining. 490–499.
[24]
Patrick AP Moran. 1950. Notes on continuous stochastic phenomena. Biometrika 37, 1/2 (1950), 17–23.
[25]
Patricia Mozzicato. 2009. MedDRA. Pharmaceutical Medicine 23, 2 (2009), 65–75.
[26]
Peter A Noseworthy, Xiaoxi Yao, Neena S Abraham, Lindsey R Sangaralingham, Robert D McBane, and Nilay D Shah. 2016. Direct comparison of dabigatran, rivaroxaban, and apixaban for effectiveness and safety in nonvalvular atrial fibrillation. Chest 150, 6 (2016), 1302–1312.
[27]
Juliet Siena Okoroh, Erika Flores Uribe, and Saul Weingart. 2017. Racial and ethnic disparities in patient safety. Journal of patient safety 13, 3 (2017), 153–161.
[28]
Francisco Gilberto Fernandes Pereira, Márcia Barroso Camilo de Ataíde, Ricardo Leal Silva, Eugenie Desirèe Rabelo Néri, Gerdane Celene Nunes Carvalho, and Joselany Áfio Caetano. 2018. Environmental variables and errors in the preparation and administration of medicines. Revista brasileira de enfermagem 71, 3 (2018), 1046–1054.
[29]
Carlotta Piccardi, Jens Detollenaere, Pierre Vanden Bussche, and Sara Willems. 2018. Social disparities in patient safety in primary care: a systematic review. International Journal for Equity in Health 17, 1 (2018), 114.
[30]
Sergio J Rey and Luc Anselin. 2010. PySAL: A Python library of spatial analytical methods. In Handbook of applied spatial analysis. Springer, 175–193.
[31]
Ole-Christian W Rutherford, Christian Jonasson, Waleed Ghanima, Fabian Söderdahl, and Sigrun Halvorsen. 2020. Comparison of dabigatran, rivaroxaban, and apixaban for effectiveness and safety in atrial fibrillation: a nationwide cohort study. European Heart Journal-Cardiovascular Pharmacotherapy 6, 2(2020), 75–85.
[32]
Erich Schubert, Jörg Sander, Martin Ester, Hans Peter Kriegel, and Xiaowei Xu. 2017. DBSCAN revisited, revisited: why and how you should (still) use DBSCAN. ACM Transactions on Database Systems (TODS) 42, 3 (2017), 1–21.
[33]
Xu Teng, Jingchao Yang, Joon-Seok Kim, Goce Trajcevski, Andreas Züfle, and Mario A Nascimento. 2019. Fine-grained diversification of proximity constrained queries on road networks. In Proceedings of the 16th International Symposium on Spatial and Temporal Databases. 51–60.
[34]
U.S. Food and Drug Administration (FDA). 2020. FDA Adverse Event Reporting System (FAERS) Quarterly Data Extract Files (url:https://fis.fda.gov/extensions/FPD-QDE-FAERS/FPD-QDE-FAERS.html).
[35]
U.S. Food and Drug Administration (FDA). 2020. Open Data API openFDA (url:https://open.fda.gov/).
[36]
Yefeng Wang, Divya R Gunashekar, Terrence J Adam, and Rui Zhang. 2017. Mining adverse events of dietary supplements from product labels by topic modeling. Studies in health technology and informatics 245 (2017), 614.
[37]
Wayne Xin Zhao, Jing Jiang, Jianshu Weng, Jing He, Ee-Peng Lim, Hongfei Yan, and Xiaoming Li. 2011. Comparing twitter and traditional media using topic models. In European conference on information retrieval. Springer, 338–349.
[38]
Marinka Zitnik, Monica Agrawal, and Jure Leskovec. 2018. Modeling polypharmacy side effects with graph convolutional networks. Bioinformatics 34, 13 (2018), i457–i466.

Cited By

View all
  • (2024)A Clustering Ensemble Method for Drug Safety Signal Detection in Post-Marketing SurveillanceTherapeutic Innovation & Regulatory Science10.1007/s43441-024-00705-759:1(89-101)Online publication date: 20-Oct-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
SSTD '21: Proceedings of the 17th International Symposium on Spatial and Temporal Databases
August 2021
173 pages
ISBN:9781450384254
DOI:10.1145/3469830
© 2021 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of the United States government. As such, the United States Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 August 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Adverse Events
  2. Clustering
  3. Health Geography
  4. Pharmacovigilance
  5. Spatial Data Mining
  6. Topic Modeling

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

SSTD '21

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)20
  • Downloads (Last 6 weeks)1
Reflects downloads up to 16 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)A Clustering Ensemble Method for Drug Safety Signal Detection in Post-Marketing SurveillanceTherapeutic Innovation & Regulatory Science10.1007/s43441-024-00705-759:1(89-101)Online publication date: 20-Oct-2024

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media