Abstract
Seasonality plays a significant role in the prevalence of infectious diseases. We evaluate the performance of different approaches used to deal with seasonality in clinical prediction models, including a new proposal based on sliding windows. Class imbalance, high dimensionality and interpretable models are also considered since they are common traits of clinical datasets.
We tested these approaches with four datasets: two created synthetically and two extracted from the MIMIC-III database. Our results corroborate that clinical prediction models for infections can be improved by considering the effect of seasonality. However, the techniques employed to obtain the best results are highly dependent on the dataset.
This work was partially funded by the SITSUS project (Ref: RTI2018-094832-B-I00), given by MCIU/AEI/FEDER, UE.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Cánovas-Segura, B., et al.: Exploring antimicrobial resistance prediction using post-hoc interpretable methods. In: Marcos, M., Juarez, J.M., Lenz, R., Nalepa, G.J., Nowaczyk, S., Peleg, M., Stefanowski, J., Stiglic, G. (eds.) KR4HC/TEAAM -2019. LNCS (LNAI), vol. 11979, pp. 93–107. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-37446-4_8
Christiansen, C., Pedersen, L., Sørensen, H., Rothman, K.: Methods to assess seasonal effects in epidemiological studies of infectious diseases-exemplified by application to the occurrence of meningococcal disease. Clin. Microbiol. Infect. 18(10), 963–969 (2012)
Dowell, S.F.: Seasonal variation in host susceptibility and cycles of certain infectious diseases. Emerg. Infect. Dis. 7(3), 369–374 (2001)
Friedman, J., Hastie, T., Tibshirani, R.: Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33(1) 1 (2010)
Godahewa, R., Yann, T., Bergmeir, C., Petitjean, F.: Seasonal averaged one-dependence estimators: a novel algorithm to address seasonal concept drift in high-dimensional stream classification. In: Proceedings of the International Joint Conference on Neural Networks (2020)
Hastie, T., Qian, J.: Glmnet ignette. Technical report. https://web.stanford.edu/~hastie/Papers/Glmnet_Vignette.pdf. Accessed 24 Jan 2021
Johnson, A.E., et al.: MIMIC-III, a freely accessible critical care database. Sci. Data 3(1) (2016)
Novoselova, N., Wang, J., Pessler, F., Klawonn, F.: Biocomb: Feature Selection and Classification with the Embedded Validation Procedures for Biomedical Data Analysis. https://cran.r-project.org/web/packages/Biocomb/index.html. Accessed 24 Jan 2021
Williams, D.J., et al.: Predicting severe pneumonia outcomes in children. Pediatrics 138(4), e20161019 (2016)
Yu, L., Liu, H.: Efficient feature selection via analysis of relevance and redundancy. J. Mach. Learn. Res. 5, 1205–1224 (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Cánovas-Segura, B., Morales, A., Juárez, J.M., Campos, M. (2021). Seasonality in Infection Predictions Using Interpretable Models for High Dimensional Imbalanced Datasets. In: Tucker, A., Henriques Abreu, P., Cardoso, J., Pereira Rodrigues, P., Riaño, D. (eds) Artificial Intelligence in Medicine. AIME 2021. Lecture Notes in Computer Science(), vol 12721. Springer, Cham. https://doi.org/10.1007/978-3-030-77211-6_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-77211-6_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-77210-9
Online ISBN: 978-3-030-77211-6
eBook Packages: Computer ScienceComputer Science (R0)