Reference Hub1
The Effects of Sampling Methods on Machine Learning Models for Predicting Long-term Length of Stay: A Case Study of Rhode Island Hospitals

The Effects of Sampling Methods on Machine Learning Models for Predicting Long-term Length of Stay: A Case Study of Rhode Island Hospitals

Son Nguyen, Alicia T. Lamere, Alan Olinsky, John Quinn
Copyright: © 2019 |Volume: 6 |Issue: 3 |Pages: 17
ISSN: 2334-4598|EISSN: 2334-4601|EISBN13: 9781522568469|DOI: 10.4018/IJRSDA.2019070103
Cite Article Cite Article

MLA

Nguyen, Son, et al. "The Effects of Sampling Methods on Machine Learning Models for Predicting Long-term Length of Stay: A Case Study of Rhode Island Hospitals." IJRSDA vol.6, no.3 2019: pp.32-48. http://doi.org/10.4018/IJRSDA.2019070103

APA

Nguyen, S., Lamere, A. T., Olinsky, A., & Quinn, J. (2019). The Effects of Sampling Methods on Machine Learning Models for Predicting Long-term Length of Stay: A Case Study of Rhode Island Hospitals. International Journal of Rough Sets and Data Analysis (IJRSDA), 6(3), 32-48. http://doi.org/10.4018/IJRSDA.2019070103

Chicago

Nguyen, Son, et al. "The Effects of Sampling Methods on Machine Learning Models for Predicting Long-term Length of Stay: A Case Study of Rhode Island Hospitals," International Journal of Rough Sets and Data Analysis (IJRSDA) 6, no.3: 32-48. http://doi.org/10.4018/IJRSDA.2019070103

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

The ability to predict the patients with long-term length of stay (LOS) can aid a hospital's admission management, maintain effective resource utilization and provide a high quality of inpatient care. Hospital discharge data from the Rhode Island Department of Health from the time period between 2010 to 2013 reveals that inpatients with long-term stays, i.e. two weeks or more, costs about six times more than those with short stays while only accounting for 4.7% of the inpatients. With the imbalance in the distribution of long-stay patients and short-stay patients, predicting long-term LOS patients becomes an imbalanced classification problem. Sampling methods—balancing the data before fitting it to a traditional classification model—offer a simple approach to the problem. In this work, the authors propose a new resampling method called RUBIES which provides superior predictive ability when compared to other commonly used sampling techniques.