research-article

Efficient healthcare service based on Stacking Ensemble

Authors:
Yunsang Joo

Department of Computer Engineering, Gachon University, Republic of Korea

Department of Computer Engineering, Gachon University, Republic of Korea
View Profile

,
Seungwon Lee

Department of Computer Engineering, Gachon University, Republic of Korea

Department of Computer Engineering, Gachon University, Republic of Korea
View Profile

,
Hyoungju Kim

Department of Computer Engineering, Chosun University, Republic of Korea

Department of Computer Engineering, Chosun University, Republic of Korea
View Profile

,
Pankoo Kim

Department of Computer Engineering, Chosun University, Republic of Korea

Department of Computer Engineering, Chosun University, Republic of Korea
View Profile

,
Seongoun Hwang

Department of Computer Engineering, Gachon University, Republic of Korea

Department of Computer Engineering, Gachon University, Republic of Korea
View Profile

,
Chang Choi

Department of Computer Engineering, Gachon University, Republic of Korea

Department of Computer Engineering, Gachon University, Republic of Korea
View Profile

ACM ICEA '20: Proceedings of the 2020 ACM International Conference on Intelligent Computing and its Emerging ApplicationsDecember 2020Article No.: 28Pages 1–5https://doi.org/10.1145/3440943.3444727

Published:27 September 2021Publication History

ACM ICEA '20: Proceedings of the 2020 ACM International Conference on Intelligent Computing and its Emerging Applications

Pages 1–5

ABSTRACT

Recently, research using medical big data to predict patients with high probability of disease are receiving a lot of attention. Due to the advancement of artificial intelligence, continuous research is essential in that diseases can be predicted only by computational numbers and can be prevented before they occur. Therefore, machine learning and deep learning research using medical big data for disease prediction are actively progressing. Due to the nature of medical data, diseases are rare, so there is a tendency to oversampling or under sampling that can lead to information distortion. Also, given that most machine learning-based research is based on certain predictive models, there is a risk that the predictions themselves will reflect the biases that exist. So, if you generalize the data your model will train on, or adjust the model's bias, you can get better predictions. In this white paper, we use diabetes, heart disease, and breast cancer data through several individual classifiers to get predicted values and use them as training data for one meta-model to get the final predictions. That is, by constructing a stacking ensemble model, the presence or absence of a disease is predicted, and its performance is analysed through experiments. This model trains multiple classifiers on the same data, so there is a possibility that the model will overfit the data. So, when training multiple classifiers, we compare the model with and without cross validation. In the experiment, the model using cross-validation for training showed an average of 1.4% higher performance than that of the individual single model. On the other hand, the meta-model without cross-validation shows lower performance than that of individual single models. In other words, when constructing a stacking ensemble model, high performance could be obtained only by essentially cross-validating individual single classifiers. Performing one final prediction on the predicted values of high-performance individual models will yield more stable and reliable predictions. The cross-learning-based cumulative ensemble model proposed in this paper predicts the presence or absence of a disease and can be used for medical service development and disease prevention.

References

Lee Seunghee, Kim Jongyeop. (2020). Artificial intelligence technology trends based on medical big data. Journal of the Korean Association of Telecommunications (Information and Communications), 37 (9), 85--91.Google Scholar
Ko Seungwan, Kang Hyuntae, Oh Youngtaek, Park Jae-ho and Heo Ui-nam (2018). A disease prescription prediction model using medical big data. Journal of Academic Announcement of the Korean Society of Information Sciences, 2216--2218.Google Scholar
Huang Uiwon, Choi Sungwoon, Ha Heonseok and Yun Seong-ro (2017). Prediction of disease from electronic health record data using a generative antagonistic neural network. Journal of Academic Announcement of the Korean Society of Information Sciences, 808--810.Google Scholar
Uhm Haneul, Kim Jaesung, Choi Sangok (2020). Machine learning-based verification and policy suggestions of corporate default risk prediction model: Focused on improvements through the Stacking Ensemble model. Intelligence Information Research, 26(2), 105--129.Google Scholar
Dietterich, T. G. (2000, June). Ensemble methods in machine learning. In International workshop on multiple classifier systems (pp. 1--15). Springer, Berlin, Heidelberg. Google ScholarDigital Library
Lee Soo-eun, Kim Han-joon. (2020). A new ensemble learning technique with multiple stacking. Journal of the Korea Electronic Trade Association, 25(3) and 1-13.Google Scholar
Dietterich, T. (1995). Overfitting and undercomputing in machine learning. ACM computing surveys (CSUR), 27(3), 326--327. Google ScholarDigital Library
Tang, J., S. Alelyani, and H. Liu. (2015). Data Classification: Algorithms and Applications. Data Mining and Knowledge Discovery Series, CRC Press, 498--500.Google Scholar
Efron, B., & Tibshirani, R. (1997). Improvements on cross-validation: the 632+ bootstrap method. Journal of the American Statistical Association, 92(438), 548--560.Google Scholar
Dietterich, T. G. (2000). An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting, and randomization. Machine learning, 40(2), 139--157. Google ScholarDigital Library
Syarif, I., Zaluska, E., Prugel-Bennett, A., & Wills, G. (2012, July). Application of bagging, boosting and stacking to intrusion detection. In International Workshop on Machine Learning and Data Mining in Pattern Recognition (pp. 593--602). Springer, Berlin, Heidelberg. Google ScholarDigital Library
MLXTEND, http://rasbt.github.io/mlxtend/Google Scholar
Wolpert, David H. (1992). Stacked generalization. Neural networks 5.2, 241--259. Google ScholarDigital Library
Liaw, A., & Wiener, M. (2002). Classification and regression by randomForest. R news, 2(3), 18--22.Google Scholar
Suykens, J. A., & Vandewalle, J. (1999). Least squares support vector machine classifiers. Neural processing letters, 9(3), 293--300. Google ScholarDigital Library
Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., ... & Liu, T. Y. (2017). Lightgbm: A highly efficient gradient boosting decision tree. In Advances in neural information processing systems (pp. 3146--3154). Google ScholarDigital Library
Kaggle DataSets, https://www.kaggle.com/uciml/breast-cancer-wisconsin-dataGoogle Scholar
Kaggle DataSets, https://www.kaggle.com/uciml/pima-indians-diabetes-databaseGoogle Scholar
Kaggle DataSets, https://www.kaggle.com/ronitf/heart-disease-uciGoogle Scholar
Kaggle DataSets, Kaggle DataSets, https://www.kaggle.com/datasetsGoogle Scholar
UCI Repository, https://archive.ics.uci.edu/ml/index.phpGoogle Scholar
Scikit-Learn, https://scikit-learn.org/stable/Google Scholar

Index Terms

Efficient healthcare service based on Stacking Ensemble
1. Computing methodologies
  1. Machine learning
    1. Machine learning algorithms
      1. Ensemble methods

Recommendations

A mixed-ensemble model for hospital readmission

A mixed-ensemble model for hospital readmission is proposed.The mixed-ensemble model enables controlling the tradeoff between reasoning transparency and predictive accuracy.The mixed-ensemble model increases the classification accuracy for positive ...
Read More
Optimized stacking ensemble models for the prediction of diabetic progression
Abstract
The influence of applied machine learning in our day-to-day life has seen significant improvement over the last few years. The use of machine learning in Artificial Intelligence to predict various aspects of human life has helped industries in ...
Read More
A stacking-based ensemble learning method for earthquake casualty prediction
Abstract
The estimation of the loss and prediction of the casualties in earthquake-stricken areas are vital for making rapid and accurate decisions during rescue efforts. The number of casualties is determined by various factors, necessitating ...
Highlights
- Construct a comprehensive feature set for the earthquake casualty prediction.
- ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

ACM ICEA '20: Proceedings of the 2020 ACM International Conference on Intelligent Computing and its Emerging Applications
December 2020
219 pages
ISBN:9781450383042
DOI:10.1145/3440943

Copyright © 2020 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 27 September 2021
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Ensemble Learning
Health care
Overfitting
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 48
  Total Downloads
- Downloads (Last 12 months)16
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Efficient healthcare service based on Stacking Ensemble

ACM ICEA '20: Proceedings of the 2020 ACM International Conference on Intelligent Computing and its Emerging Applications

ABSTRACT

References

Cited By

Index Terms

Recommendations

A mixed-ensemble model for hospital readmission

Optimized stacking ensemble models for the prediction of diabetic progression

A stacking-based ensemble learning method for earthquake casualty prediction

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Efficient healthcare service based on Stacking Ensemble

ACM ICEA '20: Proceedings of the 2020 ACM International Conference on Intelligent Computing and its Emerging Applications

ABSTRACT

References

Cited By

Index Terms

Recommendations

A mixed-ensemble model for hospital readmission

Optimized stacking ensemble models for the prediction of diabetic progression

A stacking-based ensemble learning method for earthquake casualty prediction

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media