skip to main content
10.1145/3573942.3573949acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaiprConference Proceedingsconference-collections
research-article

A Prediction Model of Diabetes Based on Ensemble Learning

Published: 16 May 2023 Publication History

Abstract

Abstract: Diabetes is a common disease that seriously endangers human health, mostly in the middle-aged and the elderly. Predicting the incidence rate of diabetes enables doctors to make a scientific treatment plan in advance, which will significantly improve the cure rate and reduce the incidence rate. Based on this situation, this paper proposes a diabetes prediction model based on ensemble learning, which integrates some classical machine learning algorithms, including Logisticregression, Kneigbors, Decisiontree, GaussianNB, and support vector machine (SVM) The first four low correlation algorithms are constructed as basic learners, and then integrated into meta learner SVM to build an integrated learning model. The advantages of the comprehensive model are evaluated from the following aspects: accuracy, precision, recall rate, AUC, and other evaluation indicators. The experiment was carried out on the Pima Indian diabetes data set (PIDD) published by UCI. First, the XGboost algorithm was used to select the optimal features, and then an integrated learning model was constructed to predict. The experimental results show that the accuracy rate of the integrated learning model is 81.63%, the precision rate is 80%, the recall rate is 80%, and the AUC is 84%. The advantages of the model in accuracy, precision, recall, and AUC are verified. The model will effectively help doctors make more accurate diagnoses and predictions of patients' physical conditions and implement more scientific treatment.

References

[1]
Shao wei, Zhang Linshan, Cai Qingqing, Clinical analysis of 106 cases of type 2 diabetes complicated with insulin autoimmune syndrome[J].Chinese Journal of Endocrinology and metabolism,2020(04):304-308.
[2]
Lu Yanan, Li Wei, Li Yuxiu, Two cases of insulin autoimmune syndrome in patients with type 2 diabetes mediated by lipoic acid[J].Journal of Chinese Academy of Medical Sciences, 2020,42(05):702-705.
[3]
Zhang Zhichao.Study on the age distribution characteristics of diabetes and the related factors of younger onset [J].Basic Medical Forum,2018,22(10):1304-1305.
[4]
Kuang X Y,Hou H R. Current status of risk prediction models based on electronic medical record data in clinical care[J]. Journal of Nursing,2020,27(16):21-24.
[5]
Wraith S M,Aikins J S,Buchanan B G,et al. Computerized consultation system for selection of antimicrobial therapy[J].American Journal of Hospital Pharmacy,1976,33(12):1304-1308.
[6]
Zhao M D,Sun J A.Research progress of machine learning in cardiovascular disease diagnosis[J].Beijing Biomedical Engineering, 2020,39(2): 208-214.
[7]
Santhanam T,Padmavathi MS.Application of K-means and genetic algorithms for dimension reduction by integrating SVM for diabetes diagnosis[J].Procedia Computer Science,2015,47:76-83.
[8]
Xu Weifeng,Zhang Jianxin,Zhang Qiang,et al.Risk prediction of type II diabetes based on random forest model[C]//3rd Internation Conference on Advances in Electrical.IEEE,2017:382-386.
[9]
Vijayan VV, Anjali C.Prediction and diagnosis of diabetes mellitus-A machine learning approach[C]//Intelligent Computational Systems.IEEE,2016:122-127.
[10]
Miao F H, Li Y, Gao C,et al.Diabetes prediction method based on CatBoost algorithm[J]. Computer System Applications, 2019,28(9):215-218
[11]
Nikhil G, Priya C, Shravani P. Heart disease prediction using classification techniques:a comparative study[J]. International Journal of Management,IT and Engineering,2019,9(7):77-89.
[12]
Asuero A G,Sayago A, Gonzalez A G. The correlation coefficient:An overview[J]. Critical Reviews in Analytical Chemistry,2006,36(1):41- 59.
[13]
Breiman, L. Bagging predictors. Machine Learning, 1996,24(2):123–140.
[14]
Freund, Y, & Schapire, R. Experiments with a new boosting algorithm. InProceedings of the thirteenth international conference on machine learning, Bari, Italy 1996.pp,148–156.
[15]
Wolpert, D. H. Stacked generalization. Neural Networks,1992, 5(2), 241–259.

Cited By

View all
  • (2024)An Improved Ensemble Machine Learning Approach for Diabetes DiagnosisPertanika Journal of Science and Technology10.47836/pjst.32.3.1932:3(1335-1350)Online publication date: 4-Apr-2024
  • (2024)Development of Machine Learning Based Diabetes Mellitus Survival Prognostic Model2024 International Conference on Science, Engineering and Business for Driving Sustainable Development Goals (SEB4SDG)10.1109/SEB4SDG60871.2024.10629763(1-6)Online publication date: 2-Apr-2024

Index Terms

  1. A Prediction Model of Diabetes Based on Ensemble Learning

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    AIPR '22: Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition
    September 2022
    1221 pages
    ISBN:9781450396899
    DOI:10.1145/3573942
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 16 May 2023

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    AIPR 2022

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)14
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 01 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)An Improved Ensemble Machine Learning Approach for Diabetes DiagnosisPertanika Journal of Science and Technology10.47836/pjst.32.3.1932:3(1335-1350)Online publication date: 4-Apr-2024
    • (2024)Development of Machine Learning Based Diabetes Mellitus Survival Prognostic Model2024 International Conference on Science, Engineering and Business for Driving Sustainable Development Goals (SEB4SDG)10.1109/SEB4SDG60871.2024.10629763(1-6)Online publication date: 2-Apr-2024

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media