skip to main content
10.1145/3218585.3218588acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdsaiConference Proceedingsconference-collections
research-article

Predicting length of stay in hospitalized patients using SSL algorithms

Published: 20 June 2018 Publication History

Abstract

Length of stay in hospitalized patients is acknowledged as a critical factor for healthcare policy planning that consequently affects the available human, technical and financial resources as well as facilities occupation. Over recent years, data mining and machine learning led to the development of several efficient and accurate models for predicting of how long a patient will stay in the hospital and support healthcare policy planning. As an alternative to traditional classification methods, semi-supervised learning algorithms have become a hot topic of significant research which exhibit remarkable performance over labeled data but lack the ability to be applied on large amounts of unlabeled data. In this work, we evaluate the performance of semi-supervised methods in predicting the length of stay of hospitalized patients. Our reported experimental results illustrate that a good predictive accuracy can be achieved using few labeled data in comparison to well known supervised learning algorithms.

References

[1]
D. Aha. 1997. Lazy Learning. Dordrecht: Kluwer Academic Publishers.
[2]
J. Bai, A. Fügener, J. Schoenfelder, and J.O. Brunner. 2018. Operations research in intensive care unit management: a literature review. Health care management science 21, 1 (2018), 1--24.
[3]
A. Belderrar and A. Hazzab. 2017. Hierarchical Genetic Algorithm and Fuzzy Radial Basis Function Networks for Factors Influencing Hospital Length of Stay Outliers. Healthcare Informatics Research 23, 3 (2017), 226--232.
[4]
A. Blum and T. Mitchell. 1998. Combining labeled and unlabeled data with co-training. In 11th annual conference on Computational learning theory. ACM, 92--100.
[5]
P. Domingos and M. Pazzani. 1997. On the optimality of the simple Bayesian classifier under zero-one loss. Machine Learning 29 (1997), 103--130.
[6]
J. Du, C.X. Ling, and Z.H. Zhou. 2011. When does co-training work in real data? IEEE Transactions on Knowledge and Data Engineering 23, 5 (2011), 788--799.
[7]
H. Finner. 1993. On a monotonicity problem in step-down multiple test procedures. J. Amer. Statist. Assoc. 88, 423 (1993), 920--923.
[8]
E. Frank and I.H. Witten. 1998. Generating Accurate Rule Sets Without Global Optimization. In 15th International Conference on Machine Learning. 144--151.
[9]
T. Guo and G. Li. 2012. Improved tri-training with unlabeled data. Software Engineering and Knowledge Engineering: Theory and Practice (2012), 139--147.
[10]
D.H. Gustafson. 1968. Length of Stay: Prediction and Explanation. Health Services Research 3, 1 (1968), 12--34.
[11]
P.R. Hachesu, M. Ahmadi, S. Alizadeh, and F. Sadoughi. 2013. Use of data mining techniques to determine and predict length of stay of cardiac patients. Healthcare informatics research 19, 2 (2013), 121--129.
[12]
M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. Witten. 2009. The WEKA data mining software: An update. SIGKDD Explorations Newsletters 11 (2009), 10--18. Issue 1.
[13]
J.L. Hodges and E.L. Lehmann. 1962. Rank methods for combination of independent experiments in analysis of variance. The Annals of Mathematical Statistics 33, 2 (1962), 482--497.
[14]
D.J. Huang, L.Z. Xie, and Y. Qiu. 2016. Analysis of factors affecting the length of hospital stay for patients with diabetes. Experimental and Clinical Endocrinology & Diabetes 124, 1 (2016), 5--10.
[15]
X. Jiang, X. Qu, and L.B. Davis. 2010. Using Data Mining to Analyze Patient Discharge Data for an Urban Hospital. In Proceedings of International Conference on Data Mining. 139--144.
[16]
N. Khajehali and S. Alizadeh. 2017. Extract critical factors affecting the length of hospital stay of pneumonia patient by data mining (case study: an Iranian hospital). Artificial Intelligence in Medicine 83 (2017), 2--13.
[17]
I.E. Livieris, K. Drakopoulou, V. Tampakas, T. Mikropoulos, and P. Pintelas. 2018. Predicting secondary school students' performance utilizing a semi-supervised learning approach. Journal of Educational Computing Research (2018).
[18]
L. Luigi, A. di Giorgio, and A.F. Dragoni. 2015. Length of Stay Prediction and Analysis through a Growing Neural Gas Model. In Proceedings of the 4th International Workshop on Artificial Intelligence and Assistive Medicine. 11--21.
[19]
A. Morton, E. Marzban, G. Giannoulis, A. Patel, R. Aparasu, and LA. Kakadiaris. 2014. A Comparison of Supervised Machine Learning Techniques for Predicting Short-Term In-Hospital Length of Stay among Diabetic Patients. In 13th International Conference on Machine Learning and Applications. 428--431.
[20]
V. Ng and C. Cardie. 2003. Weakly supervised natural language learning without redundant views. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology-Volume 1. Association for Computational Linguistics, 94--101.
[21]
I. Nouaouri, A. Samet, and H. Allaoui. 2015. Evidential data mining for length of stay (LOS) prediction problem. In IEEE International Conference on Automation Science and Engineering 2015. IEEE, 1415--1420.
[22]
V.U. Panchami and N. Radhika. 2014. A novel approach for predicting the length of hospital stay with DBSCAN and supervised classification algorithms. In 5th International Conference in Applications of Digital Information and Web Technologies. 207--212.
[23]
J. Platt. 1999. Using sparseness and analytic QP to speed training of support vector machines. In Advances in neural information processing systems, M.S. Kearns, S.A. Solla, and DA. Cohn (Eds.). MA: MIT Press, 557--563.
[24]
J.R. Quinlan. 1993. C4.5: Programs for machine learning. Morgan Kaufmann, San Francisco.
[25]
D.E. Rumelhart, G.E. Hinton, and R.J. Williams. 1986. Learning internal representations by error propagation. In Parallel Distributed Processing: Explorations in the Microstructure of Cognition, D. Rumelhart and J. McClelland (Eds.). Cambridge, Massachusetts, 318--362.
[26]
M. Sokolova, N. Japkowicz, and S. Szpakowicz. 2006. Beyond accuracy, F-score and ROC: A family of discriminant measures for performance evaluation. In Australian conference on artificial intelligence, Vol. 4304. 1015--1021.
[27]
S. Sun and F. Jin. 2011. Robust co-training. International Journal of Pattern Recognition and Artificial Intelligence 25, 07 (2011), 1113--1126.
[28]
P.F. Tsai, P.C. Chen, Y.Y. Chen, H.Y. Song, H.M. Lin, F.M. Lin, and Q.P. Huang. 2016. Length of Hospital Stay Prediction at the Admission Stage for Cardiology Patients Using Artificial Neural Network. Journal of Healthcare Engineering (2016), 1--11.
[29]
I.W.M. Verburg, A. Atashi, S. Eslami, R. Holman, A. Abu-Hanna, E. de Jonge, N. Peek, and N.F. de Keizer. 2017. Which models can I use to predict adult ICU length of stay? A systematic review. Critical care medicine 45, 2 (2017), 222--231.
[30]
I.W.M. Verburg, N.F. de Keizer, E. de Jonge, and N. Peek. 2014. Comparison of Regression Methods for Modeling Intensive Care Length of Stay. PLoS One 9, 10 (2014).
[31]
S. Walczak, R.J. Scorpio, and W.E. Pofahl. 1998. Predicting Hospital Length of Stay with Neural Networks. In Proceedings of the 11th International FLAIRS Conference. 333--337.
[32]
World Health Organization. 2004. International statistical classification of diseases and related health problems. Vol. 1. World Health Organization.
[33]
X. Wu, V Kumar, J.R. Quinlan, J. Ghosh, Q. Yang, H. Motoda, G.J. McLachlan, A.F.M. Ng, B. Liu, P.S. Yu, Z.H. Zhou, M. Steinbach, D.J. Hand, and D. Steinberg. 2008. Top 10 algorithms in data mining. Knowledge and information systems 14, 1 (2008), 1--37.
[34]
C.S. Yang, C.P. Wei, C.C. Yuan, and J.Y. Schoung. 2010. Predicting the length of hospital stay of burn patients: Comparisons of prediction accuracy among different clinical stages. Decision Support Systems 50, 1 (2010), 325--335.
[35]
Z.H. Zhou and M. Li. 2005. Tri-training: Exploiting unlabeled data using three classifiers. IEEE Transactions on knowledge and Data Engineering 17, 11 (2005), 1529--1541.
[36]
X. Zhu and A.B. Goldberg. 2009. Introduction to semi-supervised learning. Synthesis lectures on artificial intelligence and machine learning 3, 1 (2009), 1--130.

Cited By

View all
  • (2023)Mining Themes in Clinical Notes to Identify Phenotypes and to Predict Length of Stay in Patients admitted with Heart Failure2023 IEEE International Conference on Digital Health (ICDH)10.1109/ICDH60066.2023.00038(208-216)Online publication date: Jul-2023
  • (2022)Big Data Applications in Healthcare AdministrationResearch Anthology on Big Data Analytics, Architectures, and Applications10.4018/978-1-6684-3662-2.ch048(1003-1034)Online publication date: 2022
  • (2022)Predicting Patient Length of Stay in Australian Emergency Departments Using Data MiningSensors10.3390/s2213496822:13(4968)Online publication date: 30-Jun-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
DSAI '18: Proceedings of the 8th International Conference on Software Development and Technologies for Enhancing Accessibility and Fighting Info-exclusion
June 2018
365 pages
ISBN:9781450364676
DOI:10.1145/3218585
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 June 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Length of stay
  2. classification
  3. co-training
  4. data mining
  5. self-training
  6. semi-supervised learning
  7. tri-training

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

DSAI 2018

Acceptance Rates

DSAI '18 Paper Acceptance Rate 17 of 23 submissions, 74%;
Overall Acceptance Rate 17 of 23 submissions, 74%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Mining Themes in Clinical Notes to Identify Phenotypes and to Predict Length of Stay in Patients admitted with Heart Failure2023 IEEE International Conference on Digital Health (ICDH)10.1109/ICDH60066.2023.00038(208-216)Online publication date: Jul-2023
  • (2022)Big Data Applications in Healthcare AdministrationResearch Anthology on Big Data Analytics, Architectures, and Applications10.4018/978-1-6684-3662-2.ch048(1003-1034)Online publication date: 2022
  • (2022)Predicting Patient Length of Stay in Australian Emergency Departments Using Data MiningSensors10.3390/s2213496822:13(4968)Online publication date: 30-Jun-2022
  • (2022)A systematic review of the prediction of hospital length of stay: Towards a unified frameworkPLOS Digital Health10.1371/journal.pdig.00000171:4(e0000017)Online publication date: 14-Apr-2022
  • (2021)Machine learning in the prediction of medical inpatient length of stayInternal Medicine Journal10.1111/imj.1496252:2(176-185)Online publication date: 27-Oct-2021
  • (2020)Hospital Length of Stay Prediction using Regression Models2020 IEEE International Conference for Innovation in Technology (INOCON)10.1109/INOCON50539.2020.9298294(1-5)Online publication date: 6-Nov-2020
  • (2018)Decision Support Software for Forecasting Patient’s Length of StayAlgorithms10.3390/a1112019911:12(199)Online publication date: 6-Dec-2018

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media