skip to main content
10.1145/3326172.3326213acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicbetConference Proceedingsconference-collections
research-article

Predictive Analytics in Healthcare for Diabetes Prediction

Authors Info & Claims
Published:28 March 2019Publication History

ABSTRACT

Diabetes mellitus type 2 is a chronic disease which poses a serious challenge to human health worldwide. Globally, about 8.3% of the population is diagnosed with the disease. The applications of predictive analytics in diagnosis of diabetes are gaining significant momentum in medical research. The aim of this research paper is to aid medical professionals in the early detection and efficient diagnosis of Type 2 diabetes. We utilize bioinformatics theory and supervised machine learning techniques for improving the accuracy in predicting diabetes, based on 8 clinical measurements existing in the widely used PIMA dataset. We outline our methodology and highlight the implementation steps, while reviewing prominent past work in the field. Moreover, this paper fully exploits known machine learning algorithms and provides a detailed comparison of the results obtained from each method. The gradient boosting algorithm with parameter tuning proves to be the most successful, having an F1 Score of 0.853 and out of sample accuracy of 89.94%. Our prediction model focuses on computing the probability of the onset of diabetes in an individual based on their clinical data. The most crucial results of using this research within the healthcare sector are its cost-effectiveness and yielding of instant diagnosis. With this work, we intend to improve the process of diagnosing Type 2 diabetes and inspire other researchers to use machine learning based techniques for further inquiry into diabetes prediction.

References

  1. N.H. Choa, J.E. Shaw, S. Karuranga, Y. Huang, J.D. da Rocha Fernandes, A.W. Ohlrogge, B. Malanda. 2018. IDF Diabetes Atlas: Global estimates of diabetes prevalence for 2017 and projections for 2045. Diabetes Research and Clinical Practice Volume 138, April 2018, Pages 271--281Google ScholarGoogle Scholar
  2. SA, Meo & Zia, Inam & Bukhari, Ishfaq & Arain, Shoukat. 2016. Type 2 diabetes mellitus in Pakistan: Current prevalence and future forecast. JPMA. The Journal of the Pakistan Medical Association. 66. 1637--1642.Google ScholarGoogle Scholar
  3. Stephanie D. Zaugg, Godwin Dogbey, Karen Collins, Sharon Reynolds, Carter Batista, Grace Brannan, Jay H. Shubrook. 2014. Diabetes Numeracy and Blood Glucose Control: Association With Type of Diabetes and Source of Care. Clinical Diabetes Oct 2014, 32 (4) 152--157.Google ScholarGoogle Scholar
  4. Aiswarya Iyer, S. Jeyalatha, Ronak Sumbaly. 2015. Diagnosis of diabetes using classification mining techniques. International Journal of Data Mining & Knowledge Management Process (IJDKP), Vol.5, No.1, January 2015, pp. 1--14.Google ScholarGoogle ScholarCross RefCross Ref
  5. Andreas Mayr, Harald Binder, Olaf Gefeller, Matthias Schmid. 2014. The Evolution of Boosting Algorithms - From Machine Learning to Statistical Modelling. Methods Inf Med 2014; 53(6): 419--427.Google ScholarGoogle Scholar
  6. E. K. Hashi, M. S. U. Zaman and M. R. Hasan. 2017. An expert clinical decision support system to predict disease using classification techniques. 2017 International Conference on Electrical, Computer and Communication Engineering (ECCE), Cox's Bazar, pp. 396--400.Google ScholarGoogle Scholar
  7. Alghamdi M, Al-Mallah M, Keteyian S, Brawner C, Ehrman J, Sakr S. 2017. Predicting diabetes mellitus using SMOTE and ensemble machine learning approach: The Henry Ford ExercIse Testing (FIT) project. PLoS ONE 12(7): e0179805.Google ScholarGoogle ScholarCross RefCross Ref
  8. Kavakiotis, Ioannis and Tsave, Olga and Salifoglou, Athanasios and Maglaveras, N and Vlahavas, I and Chouvarda, Ioanna. 2017. Machine Learning and Data Mining Methods in Diabetes Research. Computational and Structural Biotechnology Journal. 15.Google ScholarGoogle Scholar
  9. G. D. Kalyankar, S. R. Poojara and N. V. Dharwadkar. 2017. Predictive analysis of diabetic patient data using machine learning and Hadoop. International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC), Palladam, pp. 619--624.Google ScholarGoogle Scholar
  10. A. Anand and D. Shakti. 2015. Prediction of diabetes based on personal lifestyle indicators. 1st International Conference on Next Generation Computing Technologies (NGCT), Dehradun, pp. 673--676.Google ScholarGoogle Scholar
  11. Dua, D. and Karra Taniskidou, E. 2017. UCI Machine Learning Repository. Irvine, CA: University of California, School of Information and Computer Science. DOI= http://archive.ics.uci.edu/ml.Google ScholarGoogle Scholar
  12. Keating BJ. 2015. Advances in risk prediction of type 2 diabetes: integrating genetic scores with Framingham risk models. Diabetes 64(5):1495--7.Google ScholarGoogle ScholarCross RefCross Ref
  13. M. Fathi Ganji and M. Saniee Abadeh. 2010. Using fuzzy ant colony optimization for diagnosis of diabetes disease. 18th Iranian Conference on Electrical Engineering, Isfahan, 2010, pp. 501--505.Google ScholarGoogle Scholar
  14. Mohamed, Ehab and Linder, Roland and Perriello, Gabriele and Daniele, Nicola and Pöppl, Siegfried and De Lorenzo, Antonino. 2002. Predicting Type 2 diabetes using an electronic nose-based artificial neural network analysis. Diabetes, nutrition & metabolism. 15. 215--21.Google ScholarGoogle Scholar
  15. R Chandvaniya, Jitendra and Aluvalu, Rajanikanth. 2014. Ranking with Distance based Outlier Detection Techniques: A Survey. International Journal of Computer Applications. 89.Google ScholarGoogle Scholar
  16. S. Vijayarani and S. Dhayanand. 2015. Data Mining Classification Algorithms for Kidney Disease Prediction. International Journal on Cybernetics & Informatics. 4. 13--25.Google ScholarGoogle Scholar
  17. T. Jayalakshmi and A. Santhakumaran. 2010. A Novel Classification Method for Diagnosis of Diabetes Mellitus Using Artificial Neural Networks. International Conference on Data Storage and Data Engineering, Bangalore, 2010, pp. 159--163. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Gorunescu, Florin. 2011. Data Mining: Concepts, models and techniques. Intelligent Systems Reference Library. Berlin Heidelberg, Springer-Verlag. p. 256--60Google ScholarGoogle Scholar
  19. Nirmala Devi, M., Balamurugan, A., & Reshma Kris, M. 2016. Developing a Modified Logistic Regression Model for Diabetes Mellitus and Identifying the Important Factors of Type II Dm. Indian Journal Of Science And Technology, 9(4).Google ScholarGoogle Scholar
  20. Shankaracharya, and Odedra, Devang and Mallick, Medhavi and Shukla, Prateek and Samanta, Subir and Vidyarthi, Ambarish. 2011. Java-Based Diabetes Type 2 Prediction Tool for Better Diagnosis. Diabetes technology & therapeutics. 14. 251--6.Google ScholarGoogle Scholar

Index Terms

  1. Predictive Analytics in Healthcare for Diabetes Prediction

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICBET '19: Proceedings of the 2019 9th International Conference on Biomedical Engineering and Technology
        March 2019
        327 pages
        ISBN:9781450361309
        DOI:10.1145/3326172

        Copyright © 2019 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 28 March 2019

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader