Abstract
We study the similarity of adverse effects of COVID-19 vaccines across different states in the United States. We use data of 300,000 COVID-19 vaccine adverse event reports obtained from the Vaccine Adverse Event Reporting System (VAERS). We extract latent topics from the reported adverse events using a topic modeling approach based on Latent Dirichlet allocation (LDA). This approach allows us to represent each U.S. state as a low-dimensional distribution over topics. Using Moran’s index of spatial autocorrelation we show that some of the topics of adverse events exhibit significant spatial autocorrelation, indicating that there exist spatial clusters of nearby states that exhibit similar adverse events. Using Anselin’s local indicator of spatial association we discover and report these clusters. Our results show that adverse events of COVID-19 vaccines vary across states which justifies further research to understand the underlying causality to better understand adverse effects and to reduce vaccine hesitancy.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Anselin, L.: Local indicators of spatial association—lisa. Geogr. Anal. 27(2), 93–115 (1995)
Askar, A., Züfle, A.: Spatio-temporal clustering of adverse events of post-market approved drugs using latent dirichlet allocation. In: Proceedings of the 17th International Symposium on Spatial and Temporal Databases (2021)
Baehr, A., Peña, J.C., Hu, D.J.: Racial and ethnic disparities in adverse drug events: a systematic review of the literature. J. Racial Ethnic Health Disparities 2(4), 527–536 (2015)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Brown, E.G., Wood, L., Wood, S.: The medical dictionary for regulatory activities (meddra). Drug Saf. 20(2), 109–117 (1999)
Centers for disease control and prevention: different COVID-19 vaccines. (https://www.cdc.gov/coronavirus/2019-ncov/vaccines/different-vaccines.html)
Chircu, A., Sultanow, E., Saraswat, S.P.: Healthcare RFID in germany: an integrated pharmaceutical supply chain perspective. J. Appl. Bus. Res. (JABR) 30(3), 737–752 (2014)
Day, W.H., Edelsbrunner, H.: Efficient algorithms for agglomerative hierarchical clustering methods. J. Classif. 1(1), 7–24 (1984)
Dixon, W.J., Massey Jr, F.J.: Introduction to statistical analysis (1951)
Dong, E., Du, H., Gardner, L.: An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect. Dis. 20(5), 533-534 (2020). (https://coronavirus.jhu.edu/map.html)
Dror, A.A., et al.: Vaccine hesitancy: the next challenge in the fight against Covid-19. Eur. J. Epidemiol. 35(8), 775–779 (2020)
Dushoff, J., et al.: Vaccinating to protect a vulnerable subpopulation. PLoS Med 4(5), e174 (2007)
Ester, M., Kriegel, H.P., Sander, J., Xu, X., et al.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Kdd, pp. 226–231 (1996)
FDA, CDC: vaccine adverse event reporting system. vaers.hhs.gov (2021)
Fotheringham, A.S., Brunsdon, C., Charlton, M.: Geographically Weighted Regression: The Analysis of Spatially Varying Relationships. John Wiley & Sons, Hoboken (2003)
Glaser, R., Kiecolt-Glaser, J.K., Malarkey, W.B., Sheridan, J.F.: The influence of psychological stress on the immune response to vaccines. Ann. NY Acad. Sci. 840(1), 649–655 (1998)
Griffiths, T.L., Steyvers, M.: Finding scientific topics. Proc. Natl. Acad. Sci. 101(suppl 1), 5228–5235 (2004)
Jeetu, G., Anusha, G.: Pharmacovigilance: a worldwide master key for drug safety monitoring. J. Young Pharmacists 2(3), 315–320 (2010)
Jordahl, K.: Geopandas: python tools for geographic data (2014). https://github.com/geopandas/geopandas
Kang, J.H., Kim, C.W., Lee, S.Y.: Nurse-perceived patient adverse events and nursing practice environment. J. Prev. Med. Public Health 47(5), 273 (2014)
Leyens, L., Reumann, M., Malats, N., Brand, A.: Use of big data for drug development and for public and personal health and care. Genet. Epidemiol. 41(1), 51–60 (2017)
Likas, A., Vlassis, N., Verbeek, J.J.: The global k-means clustering algorithm. Pattern Recogn. 36(2), 451–461 (2003)
Madsen, R.E., Kauchak, D., Elkan, C.: Modeling word burstiness using the dirichlet distribution. In: Proceedings of the 22nd international Conference on Machine Learning, pp. 545–552 (2005)
Moran, P.A.: Notes on continuous stochastic phenomena. Biometrika 37(1/2), 17–23 (1950)
Okoroh, J.S., Uribe, E.F., Weingart, S.: Racial and ethnic disparities in patient safety. J. Patient Saf. 13(3), 153–161 (2017)
Pereira, F.G.F., Ataíde, M.B.C.D., Silva, R.L., Néri, E.D.R., Carvalho, G.C.N., Caetano, J.Á.: Environmental variables and errors in the preparation and administration of medicines. Rev. Bra. Enfermagem 71(3), 1046–1054 (2018)
Piccardi, C., Detollenaere, J., Bussche, P.V., Willems, S.: Social disparities in patient safety in primary care: a systematic review. Int. J. Equity Health 17(1), 114 (2018)
PolitiFact, The poynter institute: federal VAERS database is a critical tool for researchers, but a breeding ground for misinformation. (https://www.politifact.com/article/2021/may/03/vaers-governments-vaccine-safety-database-critical/)
Ramage, D., Hall, D., Nallapati, R., Manning, C.D.: Labeled LDA: a supervised topic model for credit attribution in multi-labeled corpora. In: Proceedings of the 2009 Conference on eempirical Methods in Natural Language Processing, pp. 248–256 (2009)
Rey S.J., Anselin L.: PySAL: A python library of spatial analytical methods. In: Fischer M., Getis A. (eds) Handbook of Applied Spatial Analysis. Springer Heidelberg (2010). https://doi.org/10.1007/978-3-642-03647-7_11
Sallam, M.: Covid-19 vaccine hesitancy worldwide: a concise systematic review of vaccine acceptance rates. Vaccines 9(2), 160 (2021)
Schubert, E., Sander, J., Ester, M., Kriegel, H.P., Xu, X.: DBSCAN revisited, revisited: why and how you should (still) use DBSCAN. ACM Trans. Database Syst. (TODS) 42(3), 1–21 (2017)
Shaffer, J.P.: Multiple hypothesis testing. Ann. Rev. Psychol. 46(1), 561–584 (1995)
Stratton, K., Ford, A., Rusch, E., Clayton, E.C., et al.: Adverse Effects of Vaccines: Evidence and ccausality. Committee to Review Adverse Effects of Vaccines (2011)
Wang, Y., Gunashekar, D.R., Adam, T.J., Zhang, R.: Mining adverse events of dietary supplements from product labels by topic modeling. Stud. Health Technol. Inf. 245, 614 (2017)
Weisstein, E.W.: Bonferroni correction (2004). https://mathworld.wolfram.com/
Acknowledgements
This research was prepared or accomplished by Ahmed Askar in his personal capacity. The opinions expressed in this article are the author’s own and do not reflect the view of the U.S. Food and Drug Administration, the Department of Health and Human Services, or the United States government. This research received no external funding.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Askar, A., Züfle, A. (2021). Clustering Adverse Events of COVID-19 Vaccines Across the United States. In: Reyes, N., et al. Similarity Search and Applications. SISAP 2021. Lecture Notes in Computer Science(), vol 13058. Springer, Cham. https://doi.org/10.1007/978-3-030-89657-7_23
Download citation
DOI: https://doi.org/10.1007/978-3-030-89657-7_23
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-89656-0
Online ISBN: 978-3-030-89657-7
eBook Packages: Computer ScienceComputer Science (R0)