Abstract
Nowadays, medical data classification plays an important role in healthcare informatics applications such as disease prediction, classification, etc. The recently developed machine learning and deep learning models are commonly employed for effective medical data classification, which can be applied for disease diagnosis. The existing techniques make few shortcomings in terms of computational complexity, higher-dimensional features, higher execution time, etc. To tackle these issues, this paper develops a new classification model with metaheuristic algorithm based optimal feature selection for Chronic Kidney Disease diagnosis. Initially, the data with missing values were evacuated in the pre-processing stage. Then, the best subset of features selected by a metaheuristic algorithm named Oppositional based FireFly Optimization algorithm, which helps in the prediction or classification of the disease more accurately. The incorporation of oppositional based learning concept helps to improve the convergence rate of FireFly algorithm. For classification, Deep Neural Network was proposed to diagnose the existence of CKD. The effectiveness of the proposed feature selection-based classifier was tested on the dataset as measures of sensitivity, specificity, and accuracy. The results concluded that the proposed algorithm achieved a high accuracy rate when compared to the algorithms of existing classifier models.
Similar content being viewed by others
Availability of data and materials
We thank the UCI Machine Learning Repository for publishing attribute information of our own and original chronic kidney disease dataset: L.Jerlin Rubini, P.Eswaran & Dr.P.Soundarapandian, M.D., D.M (Senior Consultant Nephrologist). https://archive.ics.uci.edu/ml/datasets/Chronic_Kidney_Disease
References
Akben SB (2018) Early stage chronic kidney disease diagnosis by applying data mining methods to urinalysis blood analysis and disease history. IRBM 39(5):353–358. https://doi.org/10.1016/j.irbm.2018.09.004
Almansour NA, Syed HF, Khayat NR, Altheeb RK, Juri RE, Alhiyafi J, Alrashed S, Olatunji SO (2019) Neural network and support vector machine for the prediction of chronic kidney disease: a comparative study. Comput Biol Med 109:101–111. https://doi.org/10.1016/j.compbiomed.2019.04.017
AlMuhaideb S, Menai MEB (2016) An individualized preprocessing for medical data classification. Procedia Comput Sci 82:35–42. https://doi.org/10.1016/j.procs.2016.04.006
Brezočnik L, Fister I, Podgorelec V (2018) Swarm intelligence algorithms for feature selection: a review. Appl Sci 8(1521):1–31. https://doi.org/10.3390/app8091521
Cigdem O, Demirel H (2018) Performance analysis of different classification algorithms using different feature selection methods on Parkinson’s disease detection. J Neurosci Methods 309:81–90. https://doi.org/10.1016/j.jneumeth.2018.08.017
Dash R (2018) An adaptive harmony search approach for gene selection and classification of high dimensional medical data. J King Saud Univ-Comput Inf Sci 33:195–207. https://doi.org/10.1016/j.jksuci.2018.02.013
Eddy S, Mariani LH, Kretzler M (2020) Integrated multi-omics approaches to improve classification of chronic kidney disease. Nat Rev Nephrol 16(11):657–668. https://doi.org/10.1038/s41581-020-0286-5
Harb HM, Desuky AS (2014) Feature selection on classification of medical datasets based on particle swarm optimization. Int J Comput Appl 104(5):14–17. https://doi.org/10.5120/18197-9118
Harimoorthy K, Thangavelu M (2020) Multi-disease prediction model using improved SVM-radial bias technique in healthcare monitoring system. J Ambient Intell Human Comput 12:3715–3723. https://doi.org/10.1007/s12652-019-01652-0
Jain D, Singh V (2018) Feature selection and classification systems for chronic disease prediction: a review. Egypt Inform J 19(3):179–189. https://doi.org/10.1016/j.eij.2018.03.002
Jerlin RL, Eswaran P (2020a) Hybrid kernel support vector machine classifier and grey wolf optimization algorithm based intelligent classification algorithm for chronic kidney disease. J Med Imaging Health Inform 10(10):2297–2307. https://doi.org/10.1166/jmihi.2020.3177
Jerlin RL, Eswaran P (2020b) An effect of machine learning based classification algorithms on chronic kidney disease. Int J Inno Tech Explo Eng 9:2249–2256. https://doi.org/10.35940/ijitee.C9012.019320
Jerlin RL, Eswaran P (2020c) Efficient classification of chronic kidney disease by using multi-kernel support vector machine and fruit fly optimization algorithm. Int J Imag Syst Tech 30(3):660–673. https://doi.org/10.1002/ima.22406
Jerlin RL, Eswaran P (2020d) Optimal feature selection methods for chronic kidney disease classification using intelligent optimization algorithms. Recent Adv Comput Sci Commun 14(9):2894–2906. https://doi.org/10.2174/2666255813999200818131835
Jerlin RL, Eswaran P, Soundarapandian P (2015) UCI chronic kidney disease dataset. School of Information and Computer Sciences. University of California Irvine. CA. USA. https://archive.Ics.uci.edu/ml/datasets/Chronic_Kidney_Disease
Kasthurirathne SN, Dixon BE, Gichoya J, Xu H, Xia Y, Mamlin B, Grannis SJ (2017) Toward better public health reporting using existing off the shelf approaches: the value of medical dictionaries in automated cancer detection using plaintext medical data. J Biomed Inform 69:160–176. https://doi.org/10.1016/j.jbi.2017.04.008
Lakshmi KS, Vadivu G (2019) A novel approach for disease comorbidity prediction using weighted association rule mining. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-019-01217-1
Levey AS, de Jong PE, Coresh J et al (2011) The definition, classification, and prognosis of chronic kidney disease: a KDIGO controversies conference report. Kidney Int 80(1):17–28. https://doi.org/10.1038/ki.2010.483
Lynch CM, Abdollahi B, Fuqua JD, de Carlo AR, Bartholomai JA, Balgemann RN, van Berkel VH, Frieboes HB (2017) Prediction of lung cancer patient survival via supervised machine learning classification techniques. Int J Med Inform 108:1–8. https://doi.org/10.1016/j.ijmedinf.2017.09.013
Maniruzzaman Md, Nishith K, Menhazul AM, Shaykhul IM, Harman SS, Ayman SE, Jasjit SS (2017) Comparative approaches for classification of diabetes mellitus data: machine learning paradigm. Comput Meth Prog Bio 152:23–34. https://doi.org/10.1016/j.cmpb.2017.09.004
Mohapatra P, Chakravarty S, Dash PK (2015) An improved cuckoo search based extreme learning machine for medical data classification. Swarm Evolut Comput 24:25–49. https://doi.org/10.1016/j.swevo.2015.05.003
Mohapatra P, Chakravarty S, Dash PK (2016) Microarray medical data classification using kernel ridge regression and modified cat swarm optimization based gene selection system. Swarm Evolut Comput 28:144–160. https://doi.org/10.1016/j.swevo.2016.02.002
Moreira MW, Rodrigues JJ, Carvalho FH, Chilamkurti N, Al-Muhtadi J, Denisov V (2019) Biomedical data analytics in mobile-health environments for high-risk pregnancy outcome prediction. J Ambient Intell Human Comput 10(10):4121–4134. https://doi.org/10.1007/s12652-019-01230-4
Naganjaneyulu S, Rao BS (2018) A novel feature selection based classification algorithm for real-time medical disease prediction. In: international conference on cognitive informatics & cognitive computing. IEEE, pp. 275–282. https://doi.org/10.1109/ICCI-CC.2018.8482045
Rady EA, Anwar AS (2019) Prediction of kidney disease stages using data mining algorithms. Inform Med Unlocked 15:100178. https://doi.org/10.1016/j.imu.2019.100178
Rubini LJ, Eswaran P (2017) Optimal fuzzy min-max neural network for medical data classification using group search optimiser algorithm. Int J Mob Netw Des Innov 7(3/4):140–149. https://doi.org/10.1504/IJMNDI.2017.089285
Salekin A, Stankovic J (2016) Detection of chronic kidney disease and selecting important predictive attributes. In: international conference on healthcare informatics. IEEE, pp 262–270. https://doi.org/10.1109/ICHI.2016.36
Seera M, Lim CP (2014) A hybrid intelligent system for medical data classification. Expert Syst Appl 41(5):2239–2249. https://doi.org/10.1016/j.eswa.2013.09.022
Shen L, Chen H, Yu Z, Kang W, Zhang B, Li H, Yang B, Liu D (2016) Evolving support vector machines using fruit fly optimization for medical data classification. Knowl-Based Syst 96:61–75. https://doi.org/10.1016/j.knosys.2016.01.002
Tomcikova D, Felsoci M, Spinar J et al (2013) Risk of in-hospital mortality identified according to the typology of patients with acute heart failure: classification tree analysis on data from the acute heart failure database–main registry. J Crit Care 28(3):250–258. https://doi.org/10.1016/j.jcrc.2012.09.014
Yang S, Guo JZ, Jin JW (2018) An improved Id3 algorithm for medical data classification. Comput Electr Eng 65:474–487. https://doi.org/10.1016/j.compeleceng.2017.08.005
Yelipe U, Porika S, Golla M (2018) An efficient approach for imputation and classification of medical data values using class-based clustering of medical records. Comput Electr Eng 66(c):487–504. https://doi.org/10.1016/j.compeleceng.2017.11.030
Zhao J, Gu S, McDermaid A (2019) Predicting outcomes of chronic kidney disease from EMR data based on random forest regression. Math Biosci 310:24–30. https://doi.org/10.1016/j.mbs.2019.02.001
Acknowledgements
We thank Dr.P.Soundarapandian, M.D., D.M (Senior Consultant Nephrologist) and Apollo Hospital, Karaikudi, India for their support and facility at the time of CKD data collection and preliminary study.
Funding
This article has been written with the financial support of RUSA–Phase 2.0 grant sanctioned vide Letter No. F. 24-51/2014-U, Policy (TNMulti-Gen), Dept. of Edn. Govt. of India, Dt. 09.10.2018.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors of the present research have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Lambert, J.R., Perumal, E. Oppositional firefly optimization based optimal feature selection in chronic kidney disease classification using deep neural network. J Ambient Intell Human Comput 13, 1799–1810 (2022). https://doi.org/10.1007/s12652-021-03477-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-021-03477-2