Abstract
Heart diseases are an essential research topic in healthcare institutions around the world. Therefore, using machine learning and optimization algorithms attracts attention as an important method in detecting heart diseases. Additionally, the factors that affect heart disease are a matter of current debate. In this study, an effective DFD method is proposed using optimization techniques for classifying heart diseases and examining the factors affecting the disease. Initially, the study employs classical machine learning and ensemble algorithms for classification. Subsequently, feature selection is performed using BEO, BSPO, GA, and GFO methods, and the importance levels of features are determined utilizing the DFD approach. The results indicate that the ensemble model achieved an accuracy of 86.34% without optimization methods, whereas the proposed DFD method, when applied in conjunction with ensemble models, increased the accuracy to 99.08%. Therefore, it is observed that ensemble models yield the highest results when used in conjunction with optimization algorithms. The outcomes identified using the DFD method, which are clinically significant, are believed to hold great importance in reducing the number of heart patients and enhancing treatment.






Similar content being viewed by others
Data availability
The data that support the findings of this study are available from the corresponding author upon reasonable request.
References
"Cardiovascular diseases." https://www.who.int/health-topics/cardiovascular-diseases/#tab=tab_1 (accessed Feb. 13, 2023).
Bharti, R., Khamparia, A., Shabaz, M., Dhiman, G., Pande, S., Singh, P.: Prediction of heart disease using a combination of machine learning and deep learning, Comput. Intell. Neurosci., 2021, (2021).
FJ Wolters 2018 Coronary heart disease, heart failure, and the risk of dementia: A systematic review and meta-analysis Alzheimer's Dement. 14 11 1493 1504 https://doi.org/10.1016/J.JALZ.2018.01.007
MV Dogan IM Grumbach JJ Michaelson RA Philibert 2018 Integrated genetic and epigenetic prediction of coronary heart disease in the Framingham Heart Study PLoS ONE 13 1 e0190549 https://doi.org/10.1371/JOURNAL.PONE.0190549
K Sabanci MF Aslan E Yigit A Kayabasi A Toktas H Duysak 2017 A comparative classification of wheat grains for artificial neural network and extreme learning machine Int. J. Environ. Trends 1 14 21
MO Kaya 2021 Performance evaluation of multi-layer perceptron artificial neural network model in the classification of heart failure J. Cogn. Syst. 6 1 35 38
K Shameer KW Johnson BS Glicksberg JT Dudley PP Sengupta 2018 Machine learning in cardiovascular medicine: Are we there yet? Heart 104 14 1156 1164 https://doi.org/10.1136/heartjnl-2017-311198
M Öğrenme Y Dayali K Kalp H Erken T Rüstem Yilmaz FH Yağın 2022 Early detection of coronary heart disease based on machine learning methods Med. Rec. 4 1 1 6 https://doi.org/10.37990/MEDR.1011924
SM Birjandi SH Khasteh 2021 A survey on data mining techniques used in medicine J. Diabetes Metab. Disord. 20 2 2055 2071 https://doi.org/10.1007/S40200-021-00884-2/TABLES/1
A Basati MM Faghih 2022 PDAE: Efficient network intrusion detection in IoT using parallel deep auto-encoders Inf. Sci. (Ny) 598 57 74 https://doi.org/10.1016/J.INS.2022.03.065
İB Çiçek Z Küçükakçalı C Çolak 2020 Assocıatıve classıfıcatıon approach can predıct prostate cancer based on the extracted assocıatıon rules J. Cogn. Syst. 5 2 51 54
Z Tunç İB Çiçek E Güldoğan 2020 Performance evaluatıon of the deep learnıng models ın the classıfıcatıon of heart attack and determınatıon of related factors J. Cogn. Syst. 5 2 99 103
Dun, B., Wang, E., Majumder, S.: Heart Disease Diagnosis on Medical Data Using Ensemble Learning.
F Yaghouby A Ayatollahi R Soleimani 2009 Classification of cardiac abnormalities using reduced features of heart rate variability signal World Appl. Sci. J. 6 11 1547 1554
BM Asl SK Setarehdan M Mohebbi 2008 Support vector machine-based arrhythmia classification using reduced features of heart rate variability signal Artif. Intell. Med. 44 1 51 64 https://doi.org/10.1016/J.ARTMED.2008.04.007
Hussain, L., Awan, I. A., Aziz, W., Saeed, S., Ali, A., Zeeshan, F., & Kwak, K. S.: Detecting congestive heart failure by extracting multimodal features and employing machine learning techniques. BioMed. Res. Int. (2020). https://doi.org/10.1155/2020/4281243
Alim, M. A., Habib, S., Farooq, Y., Rafay, A.: Robust Heart Disease Prediction: A Novel Approach based on Significant Feature and Ensemble learning Model, In: 2020 3rd International Conference on Computer Mathematics Engineering Technology. Idea to Innov. Build. Knowl. Econ. iCoMET 2020, (2020), doi: https://doi.org/10.1109/ICOMET48670.2020.9074135.
R Kannan V Vasanthi 2019 "Machine learning algorithms with ROC curve for predicting and diagnosing the heart disease," SpringerBriefs Appl Sci. Technol. https://doi.org/10.1007/978-981-13-0059-2_8/COVER
Atallah, R., Al-Mousa, A.: Heart disease detection using machine learning majority voting ensemble method, In: 2019 2nd International Conference on New Trends Computer Science ICTCS 2019 - Proc., (2019), doi: https://doi.org/10.1109/ICTCS.2019.8923053
P Kiran A Swathi M Sindhu Y Manikanta K Mahesh Babu 2022 Effective heart disease prediction using hybrid machine learning technique South Asian J. Eng. Technol. 12 3 123 130 https://doi.org/10.26524/SAJET.2022.12.49
Nishat, M. M., Faisal, F., Hasan Udoy, M.: Performance Evaluation and Comparative Analysis of Different Machine Learning Algorithms in Predicting Cardiovascular Disease, (2021), Accessed: Mar. 06, 2023. [Online]. Available: https://www.researchgate.net/publication/351706115
I Tougui A Jilbab J Mhamdi El 2020 Heart disease classification using data mining tools and machine learning techniques Health Technol. (Berl) 10 5 1137 1144 https://doi.org/10.1007/S12553-020-00438-1/FIGURES/2
R Katarya SK Meena 2021 Machine learning techniques for heart disease prediction: A comparative study and analysis Health Technol. (Berl) 11 1 87 97 https://doi.org/10.1007/S12553-020-00505-7/TABLES/3
V Chang VR Bhavani AQ Xu M Hossain 2022 An artificial intelligence model for heart disease detection using machine learning algorithms Healthc. Anal. 2 100016https://doi.org/10.1016/J.HEALTH.2022.100016
MA Kadhim AM Radhi 2023 Heart disease classification using optimized Machine learning algorithms Iraqi J. Comput. Sci. Math. 4 2 31 42 https://doi.org/10.52866/ijcsm.2023.02.02.004
X Wei C Rao X Xiao L Chen M Goh 2023 Risk assessment of cardiovascular disease based on SOLSSA-CatBoost model Expert Syst. Appl. 219 February 119648 https://doi.org/10.1016/j.eswa.2023.119648
J Saputra C Lawrencya JM Saini S Suharjito 2023 Hyperparameter optimization for cardiovascular disease data-driven prognostic system Vis Comput. Ind. Biomed. Art https://doi.org/10.1186/s42492-023-00143-6
A Faramarzi S Mirjalili M Heidarinejad 2022 Binary equilibrium optimizer: Theory and application in building optimal control problems Energy Build. 277 112503 https://doi.org/10.1016/J.ENBUILD.2022.112503
Kennedy, J., Eberhart, R. C.: A discrete binary version of the particle swarm algorithm, İn: Proceedings of the 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation, IEEE, Orlando, FL, USA, (1997).
Guo, S.-S., Wang, J.-S., Guo, M.-W.: Z-Shaped Transfer Functions for Binary Particle Swarm Optimization Algorithm, (2020), doi: https://doi.org/10.1155/2020/6502807.
H Nezamabadi-Pour M Rostami-Shahrbabaki MM Farsangi 2008 Binary particle swarm optimization: Challenges and new solutions J. Comput. Soc. Iran Comput. Sci. Eng. 6 21 32
A El-Maleh A Sheikh S Sait 2013 A soft computing, and undefined, Binary particle swarm optimization (BPSO) based state assignment for area minimization of sequential circuits Appl. Soft Comput. 13 12 4832 4840
Jh, H.: Adaptation in natural and artificial systems, Ann Arbor (1975).
A Konak DW Coit AE Smith 2006 Multi-objective optimization using genetic algorithms: A tutorial Reliab. Eng. Syst. Saf. 91 9 992 1007 https://doi.org/10.1016/J.RESS.2005.11.018
H Faris I Aljarah MA Al-Betar S Mirjalili 2018 Grey wolf optimizer: a review of recent variants and applications Neural Comput. Appl. 30 2 413 435 https://doi.org/10.1007/S00521-017-3272-5/TABLES/4
S Mirjalili A Lewis 2014 Grey Wolf Optimizer Adv. Eng. Softw. 69 46 61
Koc, I., Baykan, O., Dergisi, I. B.-P.L.: Gri kurt optimizasyon algoritmasına dayanan çok seviyeli imge eşik seçimi, dergipark.org.trI Koc, OK Baykan, I BabaogluPoliteknik Dergisi, 2018 dergipark.org.tr, (2018) doi: https://doi.org/10.2339/politeknik.389613.
Rymarczyk, T., Kozłowski, E., Kłosowski, G., Niderla K.: Sensors, and undefined 2019, "Logistic regression for machine learning in process tomography," mdpi.comT Rymarczyk, E Kozłowski, G Kłosowski, K NiderlaSensors, 2019 mdpi.com, (2019), doi: https://doi.org/10.3390/s19153400.
B Ning W Junwei H Feng 2019 Spam message classification based on the Na{"\i}ve Bayes classification algorithm IAENG Int. J. Comput. Sci. 46 1 46 53
Asfi -Ar-, M., Asif, R., Nishat, M. M., Faisal, F., Hasan Udoy, M.: Performance Evaluation and Comparative Analysis of Different Machine Learning Algorithms in Predicting Cardiovascular Disease Machine learning for healthcare applications View project DC motor braking system View project, (2021), Accessed: Mar. 27, 2023. [Online]. Available: https://www.researchgate.net/publication/351706115
C-C Le P Prasad A Alsadoon L Pham A Elchouemi 2019 Text classification: Naive Bayes classifier with sentiment lexicon IAENG Int. J. Comput. Sci. 46 2 141 148
Li, Y., Yang, Y., Che, J., Zhang, L.: Predicting the Number of Nearest Neighbor for kNN Classifier.
Arafat, M., Hoque, S., Xu, S., Farid, D.: Machine learning for mining imbalanced data, (2019).
Reund, Y. F., Mason, L.: The alternating decision tree learning algorithm, [Accessed on 27 Mar 2023]. [Online]. Available: http://www.rulequest.com/.
S Alam SK Sonbhadra S Agarwal P Nagabhushan 2020 One-class support vector classifiers: A survey Knowledge-Based Systems 196 105754
V Vapnik 1999 The nature of statistical learning theory Springer Science & Business Media New York
M Skurichina RPW Duin 2002 Bagging, boosting and the random subspace method for linear classifiers Pattern Anal. Appl. 5 2 121 135 https://doi.org/10.1007/S100440200011/METRICS
T Yan SL Shen A Zhou X Chen 2022 Prediction of geological characteristics from shield operational parameters by integrating grid search and K-fold cross validation into stacking classification algorithm J. Rock Mech. Geotech. Eng. 14 4 1292 1303 https://doi.org/10.1016/J.JRMGE.2022.03.002
N Mahendran 2019 Sensor-Assisted Weighted Average Ensemble Model for Detecting Major Depressive Disorder Sensors. 19 22 4822 https://doi.org/10.3390/S19224822
LI Kuncheva JJ Rodríguez 2014 A weighted voting framework for classifiers ensembles Knowl. Inf. Syst. 38 2 259 275 https://doi.org/10.1007/S10115-012-0586-6/FIGURES/7.profitsectors
Acknowledgements
The author gratefully acknowledges the partial support of the Faculties of Engineering at Kırıkkale University.
Funding
This research received no external funding.
Author information
Authors and Affiliations
Contributions
Fuat Türk (FT) carried out the most of implementations and simulations for this manuscript. FT provided core concepts and drafted the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Conflict of interest
The authors declare no conflict of interest.
Ethics approval and consent to participate
Not applicable.
Informed consent
The author agreed on the final approval of the version to be published.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Türk, F. Investigation of machine learning algorithms on heart disease through dominant feature detection and feature selection. SIViP 18, 3943–3955 (2024). https://doi.org/10.1007/s11760-024-03060-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-024-03060-0