Abstract
Classification of medical data is largely dependent on the effective identification of key features of the data that can be used to aid in the diagnosis of related diseases. This goal can be achieved through feature selection methods that endeavor to get rid of redundant and irrelevant features to ameliorate classification accuracy. This is the aim of this work where a new meta-heuristic, referred to as snake optimizer, was adopted for the purpose of boosting the performance of existing feature selection methods. This optimizer may smoothly fall into local optimal solutions, which may present weak search performance and slow convergence speeds in handling feature selection problems. On this basis, this paper presents three improved adaptive versions of this optimizer, each of which has increased search performance over the basic optimizer. This optimizer was improved using three mathematical models named exponential, power, and delayed S-shaped, to create three methods, referred to as exponential, power, and delayed S-shaped snake optimizers, respectively. These proposed versions were also matured to have more balance between exploration and exploitation aspects. Then, binary variants of these optimizers were evolved to solve feature selection problems using the k-nearest neighbor classifier. To verify the efficacy of these binary optimizers, 24 datasets were used, and then compared with other feature selection optimizers. The experimental results obviously manifested the efficiency of the proposed optimizers in realizing the optimal feature set by achieving utmost accuracy and minimal number of features in the majority of the studied datasets. The proposed binary power snake optimizer outperformed all other competitors in 13, 10, 8, 8, and 12 datasets in respect of classification accuracy, number of chosen attributes, specificity, sensitivity, and fitness scores, respectively. Out of the 24 datasets taken into consideration, the results on 12, 6, and 8 datasets, respectively, showed that this proposed optimizer presented performance scores of more than 90% in respect of sensitivity, accuracy, and specificity metrics.
Similar content being viewed by others
Data availability
The used datasets in this research are well-known datasets that are available online to the researchers.
References
Abed MM, Belal A-K, Mohammed Y, Mostafa Salama A, Seifedine K, Hameed AK, Begonya G-Z (2022) Novel crow swarm optimization algorithm and selection approach for optimal deep learning COVID-19 diagnostic model. Comput Intell Neurosci 2022:1307944
Abouelmagd LM, Shams MY, El-Attar NE, Hassanien AE (2022) Feature selection based coral reefs optimization for breast cancer classification. In: Medical informatics and bioimaging using artificial intelligence. Springer, pp 53–72
Ahmadianfar I, Heidari AA, Gandomi AH, Chu X, Chen H (2021) Run beyond the metaphor: an efficient optimization algorithm based on Runge Kutta method. Expert Syst Appl 181:115079
Akram KM, Sihem S, Okba K (2021) Convolution neural network based marine predator algorithm for COVID-19 detection. In: 2021 International conference on information systems and advanced technologies (ICISAT). IEEE, 1–4
Alamir N, Kamel S, Megahed TF, Hori M, Abdelkader SM (2022) Energy management of multi-microgrid considering demand response using snake optimizer. In: 2022 23rd international middle east power systems conference (MEPCON). IEEE, pp 1–6
Albashish D, Hammouri AI, Braik M, Atwan J, Sahran S (2021) Binary biogeography-based optimization based SVM-RFE for feature selection. Appl Soft Comput 101:107026
Aljarah I, Mafarja M, Heidari AA, Faris H, Zhang Y, Mirjalili S (2018) Asynchronous accelerating multi-leader salp chains for feature selection. Appl Soft Comput 71:964–979
Altman NS (1992) An introduction to kernel and nearest-neighbor nonparametric regression. Am Stat 46(3):175–185
Alweshah M, Alkhalaileh S, Al-Betar MA, Bakar AA (2022) Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis. Knowl Based Syst 235:107629
Amit K, Rajneesh S (2022) Jaya-optimized fuzzy reinforcement learning classifier for COVID-19. IETE J Res. https://doi.org/10.1080/03772063.2022.2027287
Awadallah MA, Al-Betar MA, Braik MS, Hammouri AI, Doush IA, Zitar RA (2022a) An enhanced binary rat swarm optimizer based on local-best concepts of PSO and collaborative crossover operators for feature selection. Comput Biol Med 147:105675
Awadallah MA, Hammouri AI, Al-Betar MA, Braik MS, Abd EM (2022b) Binary horse herd optimization algorithm with crossover operators for feature selection. Comput Biol Med 141:105152
Ayça D, Ezgi KH, Ender S, Tansel D (2022) Predicting the severity of COVID-19 patients using a multi-threaded evolutionary feature selection algorithm. Expert Syst 39:e12949
Bandyopadhyay R, Basu A, Cuevas E, Sarkar R (2021) Harris Hawks optimisation with simulated annealing as a deep feature selection method for screening of COVID-19 CT-scans. Appl Soft Comput 111:107698
Braik M (2023) Enhanced Ali Baba and the forty thieves algorithm for feature selection. Neural Comput Appl 35(8):6153–6184
Braik MS, Hammouri AI, Awadallah MA, Al-Betar MA, Khtatneh K (2023a) An improved hybrid chameleon swarm algorithm for feature selection in medical diagnosis. Biomed Signal Process Control 85:105073
Braik M, Awadallah MA, Al-Betar MA, Hammouri AI, Alzubi OA (2023b) Cognitively enhanced versions of capuchin search algorithm for feature selection in medical diagnosis: a COVID-19 case study. Cognit Comput. https://doi.org/10.1007/s12559-023-10149-0
Cao Y, Li Y, Zhang G, Jermsittiparsert K, Nasseri M (2020) An efficient terminal voltage control for PEMFC based on an improved version of whale optimization algorithm. Energy Rep 6:530–542
Chen K, Zhou F-Y, Yuan X-F (2019) Hybrid particle swarm optimization with spiral-shaped mechanism for feature selection. Expert Syst Appl 128:140–156
Chen H, Wang M, Zhao X (2020) A multi-strategy enhanced sine cosine algorithm for global optimization and constrained practical engineering problems. Appl Math Comput 369:124872
Cheng R, Qiao Z, Li J, Huang J (2023) Traffic signal timing optimization model based on video surveillance data and snake optimization algorithm. Sensors 23(11):5157
Cheng K, Zhang J, Tian S, Liu H, Gong J, Xie Y (2022) Wifi localization algorithm based on snake optimization algorithm to optimize bp neural network. In: 2022 International conference on image processing, computer vision and machine learning (ICICML). IEEE, 615–618
Crow LH (1974) Reliability analysis for complex repairable systems, soc. industrial and applied mathematics, reliability and biometry. Proc Stat Anal Life Length 25:248–253
Dai Y, Pang J, Li Z, Li W, Wang Q, Li S (2022) Modeling of thermal error electric spindle based on KELM ameliorated by snake optimization. Case Stud Therm Eng 40:102504
Das AK, Sengupta S, Bhattacharyya S (2018) A group incremental feature selection for classification using rough set theory based genetic algorithm. Appl Soft Comput 65:400–411
Davazdahemami B, Zolbanin Hamed M, Delen D (2022) An explanatory machine learning framework for studying pandemics: the case of COVID-19 emergency department readmissions. Decis Support Syst 161:113730
Dey A, Chattopadhyay S, Singh PK, Ahmadian A, Ferrara M, Senu N, Sarkar R (2021) MRFGRO: a hybrid meta-heuristic feature selection method for screening COVID-19 using deep features. Sci Rep 11(1):1–15
Dokeroglu T, Deniz A, Kiziloz HE (2022) A comprehensive survey on recent metaheuristics for feature selection. Neurocomputing 494:1–2
Elminaam DSA, Nabil A, Ibraheem SA, Houssein EH (2021) An efficient marine predators algorithm for feature selection. IEEE Access 9:60136–60153
Eluri RK, Devarakonda N (2022) Binary golden eagle optimizer with time-varying flight length for feature selection. Knowl Based Syst 247:108771
Eluri RK, Devarakonda N (2023) Feature selection with a binary flamingo search algorithm and a genetic algorithm. Multimed Tools Appl 82:1–52
Fatih AM, Kadir S, Akif D, Fahri UM (2022) COVID-19 diagnosis using state-of-the-art CNN architecture features and Bayesian optimization. Comput Biol Med 142:105244
Gang H, Rui Y, Muhammad A, Guo W (2023) BEESO: multi-strategy boosted snake-inspired optimizer for engineering applications. J Bionic Eng 20:1–37
Gao D, Wang G-G, Pedrycz W (2020) Solving fuzzy job-shop scheduling problem using de algorithm improved by a selection mechanism. IEEE Trans Fuzzy Syst 28(12):3265–3275
Ghadimi N, Akbarimajd A, Shayeghi H, Abedinia O (2018) Two stage forecast engine with feature selection technique and improved meta-heuristic algorithm for electricity load forecasting. Energy 161:130–142
Hashim Fatma A, Hussien Abdelazim G (2022) Snake optimizer: a novel meta-heuristic optimization algorithm. Knowl Based Syst 242:108320
Heidari AA, Delavar MR (2016) A modified genetic algorithm for finding fuzzy shortest paths in uncertain networks. Int Arch Photogramm Remote Sens Spat Inf Sci 41:299
Heidari AA, Mirjalili S, Faris H, Aljarah I, Mafarja M, Chen H (2019) Harris Hawks optimization: algorithm and applications. Future Gener Comput Syst 97:849–872
Jaddi NS, Abadeh MS (2022) Cell separation algorithm with enhanced search behaviour in miRNA feature selection for cancer diagnosis. Inf Syst 104:101906
Jadhav S, He H, Jenkins K (2018) Information gain directed genetic algorithm wrapper feature selection for credit rating. Appl Soft Comput 69:541–553
Jain I, Jain VK, Jain R (2018) Correlation feature selection based improved-binary particle swarm optimization for gene selection and cancer classification. Appl Soft Comput 62:203–215
Jiao H, Chen H, Heidari AA, Wang M, Zhang X, Chen Y, Pan Z (2021) Orthogonal learning covariance matrix for defects of grey wolf optimizer: insights, balance, diversity, and feature selection. Knowl Based Syst 213:106684
Jiaze T, Chen H, Liu J, Heidari AA, Zhang X, Wang M, Ruby R, Pham Q-V (2021) Evolutionary biogeography-based whale optimization methods with communication structure: towards measuring the balance. Knowl Based Syst 212:106642
Jiaze T, Chen H, Wang M, Gandomi AH (2021) The colony predation algorithm. J Bionic Eng 18(3):674–710
Kanya KL, Naga JB (2022) An adaptive teaching learning based optimization technique for feature selection to classify mammogram medical images in breast cancer detection. Int J Syst Assur Eng Manag. https://doi.org/10.1007/s13198-021-01598-7
Khurma RA, Aljarah I, Sharieh A (2020) Rank based moth flame optimisation for feature selection in the medical application. In: 2020 IEEE congress on evolutionary computation (CEC). IEEE, pp 1–8
Khurma RA, Aljarah I, Sharieh A, Elaziz MA, Damaševičius R, Krilavičius T (2022) A review of the modification strategies of the nature inspired algorithms for feature selection problem. Mathematics 10(3):464
Khurma RA, Albashish D, Braik M, Alzaqebah A, Qasem A, Adwan O (2023) An augmented snake optimizer for diseases and COVID-19 diagnosis. Biomed Signal Process Control 84:104718
Klimov PV, Kelly J, Martinis JM, Neven H (2020) The snake optimizer for learning quantum processor control parameters. arXiv preprint arXiv:2006.04594
Kumar DC, Senthil AV, Jagadamba G, Musirin IB (2022) Evolving chaotic shuffled frog leaping memetic metaheuristic model-based feature subset selection for Alzheimerâ’s disease detection. In: Sustainable communication networks and application. Springer, pp 679–692
Kwak N, Choi C-H (2002) Input feature selection for classification problems. IEEE Trans Neural Netw 13(1):143–159
Le TM, Vo TM, Pham TN, Truong SVD (2020) A novel wrapper-based feature selection for early diabetes prediction enhanced with a metaheuristic. IEEE Access 9:7869–7884
Li S, Ye L (2023) Multi-level thresholding image segmentation for rubber tree secant using improved Otsuâ’s method and snake optimizer. Math Biosci Eng 20:9645–9669
Li S, Chen H, Wang M, Heidari AA, Mirjalili S (2020) Slime mould algorithm: a new method for stochastic optimization. Future Gener Comput Syst 111:300–323
Li Y, Tang B, Jiao S (2023) So-slope entropy coupled with SVMD: a novel adaptive feature extraction method for ship-radiated noise. Ocean Eng 280:114677
Li Z, Cheng Z, Wang Y (2022) Parameter tuning of active disturbance rejection control based on improved snake optimization algorithm. In: 2022 International conference on artificial intelligence, information processing and cloud computing (AIIPCC). IEEE, 316–322
Liping C, Tahereh R (2021) A new optimal diagnosis system for coronavirus (COVID-19) diagnosis based on Archimedes optimization algorithm on chest x-ray images. Comput Intell Neurosci 2021:7788491
Mafarja M, Aljarah I, Heidari AA, Hammouri AI, Faris H, Alaâ’M A-Z, Mirjalili S (2018) Evolutionary population dynamics and grasshopper optimization approaches for feature selection problems. Knowl Based Syst 145:25–45
Malik B, Alaa S, Hamza T, Heba A (2020) A novel lifetime scheme for enhancing the convergence performance of salp swarm algorithm. Soft Comput 25:1–26
Malik B, Hussein A-Z, Mohammad R, Alaa S, Omar A (2022) Memory based hybrid crow search algorithm for solving numerical and constrained global optimization problems. Artif Intell Rev 56:1–73
Mazaheri V, Khodadadi H (2020) Heart arrhythmia diagnosis based on the combination of morphological, frequency and nonlinear features of ECG signals and metaheuristic feature selection algorithm. Expert Syst Appl 161:113697
Mirjalili S, Lewis A (2013) S-shaped versus v-shaped transfer functions for binary particle swarm optimization. Swarm Evolut Comput 9:1–14
Moranda PB (1975) Prediction of software reliability during debugging. In: Proceedings of the 1975 annual reliability and maintainability symposium
Musa JD (1975) A theory of software reliability and its application. IEEE Trans Softw Eng 1(03):312–327
Nadimi-Shahraki MH, Banaie-Dezfouli M, Zamani H, Taghian S, Mirjalili S (2021) B-MFO: a binary moth-flame optimization for feature selection from medical datasets. Computers 10(11):136
Nadimi-Shahraki MH, Zamani H, Mirjalili S (2022) Enhanced whale optimization algorithm for medical feature selection: a COVID-19 case study. Comput Biol Med 148:105858
Peng H, Long F, Ding C (2005) Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Pattern Anal Mach Intell 27(8):1226–1238
Qtaish A, Albashish D, Braik M, Alshammari MT, Alreshidi A, Alreshidi EJ (2023) Memory-based sand cat swarm optimization for feature selection in medical diagnosis. Electronics 12(9):2042
Ragab M, Eljaaly K, Alhakamy NA, Alhadrami HA, Bahaddad AA, Abo-Dahab SM, Khalil EM (2022) Deep ensemble model for COVID-19 diagnosis and classification using chest CT images. Biology 11(1):43
Rawa M (2022) Towards avoiding cascading failures in transmission expansion planning of modern active power systems using hybrid snake-sine cosine optimization algorithm. Mathematics 10(8):1323
Sefer K et al (2022) Feature selection for diagnose coronavirus (COVID-19) disease by neural network and Caledonian crow learning algorithm. Appl Nanosci 13(4):3129
Sheta A (2006) Reliability growth modeling for software fault detection using particle swarm optimization. In: 2006 IEEE international conference on evolutionary computation. IEEE, pp 3071–3078
Shiny Irene D, Beulah Rene J (2022) An efficient COVID-19 detection from CT images using ensemble support vector machine with ludo game-based swarm optimisation. Comput Methods Biomech Biomed Eng Imaging Vis 10:675–686
Song S, Wang P, Heidari AA, Wang M, Zhao X, Chen H, He W, Suling X (2021) Dimension decided Harris Hawks optimization with gaussian mutation: balance analysis and diversity patterns. Knowl Based Syst 215:106425
Sun G, Yang B, Yang Z, Geni X (2020) An adaptive differential evolution with combined strategy for global numerical optimization. Soft Comput 24(9):6277–6296
Tao W, Jinde C, Lianglin X, Haiyang Z (2019) New stabilization results for semi-Markov chaotic systems with fuzzy sampled-data control. Complexity 2019:1–15
Tao W, Xiong L, Cheng J, Xie X (2020) New results on stabilization analysis for fuzzy semi-Markov jump chaotic systems with state quantized sampled-data controller. Inf Sci 521:231–250
Too J, Mirjalili S (2021) A hyper learning binary dragonfly algorithm for feature selection: a COVID-19 case study. Knowl Based Syst 212:106553
Wang G-G, Deb S, Cui Z (2019) Monarch butterfly optimization. Neural Comput Appl 31(7):1995–2014
Wang H, Tan L, Niu B (2019) Feature selection for classification of microarray gene expression cancers using bacterial colony optimization with multi-dimensional population. Swarm Evolut Comput 48:172–181
Yamada S, Ohba M, Osaki S (1984) S-shaped software reliability growth models and their applications. IEEE Trans Reliab 33(4):289–292
Yang Y, Chen H, Heidari AA, Gandomi AH (2021) Hunger games search: visions, conception, implementation, deep analysis, perspectives, and towards performance shifts. Expert Syst Appl 177:114864
Yao L, Yuan P, Tsai C-Y, Zhang T, Yao L, Ding S (2023) ESO: an enhanced snake optimizer for real-world engineering problems. Expert Syst Appl 230:120594
Yousri R, Elbayoumi M, Soltan A, Darweesh MS (2023) A power-aware task scheduler for energy harvesting-based wearable biomedical systems using snake optimizer. Analog Integr Circuits Signal Process 115(2):183–194
Yuxing L, Luqi X, Bingzhao T, Lili L, Yilan L, Xinyao G, Xiaohui X (2022) A denoising method for ship-radiated noise based on optimized variational mode decomposition with snake optimization and dual-threshold criteria of correlation coefficient. Math Probl Eng 2022:8024753
Zakeri A, Hokmabadi A (2019) Efficient feature selection method using real-valued grasshopper optimization algorithm. Expert Syst Appl 119:61–72
Zhang X, Wang D, Zhou Z, Ma Y (2019) Robust low-rank tensor recovery with rectification and alignment. IEEE Trans Pattern Anal Mach Intell 43(1):238–255
Zhang Y, Liu R, Wang X, Chen H, Li C (2021) Boosted binary Harris Hawks optimizer and feature selection. Eng Comput 37(4):3741–3770
Zhang H, Wang Z, Chen W, Heidari AA, Wang M, Zhao X, Liang G, Chen H, Zhang X (2021) Ensemble mutation-driven salp swarm algorithm with restart mechanism: framework and fundamental analysis. Expert Syst Appl 165:113897
Zhao D, Liu L, Fanhua Yu, Heidari AA, Wang M, Oliva D, Muhammad K, Chen H (2021) Ant colony optimization with horizontal and vertical crossover search: fundamental visions for multi-threshold image segmentation. Expert Syst Appl 167:114122
Zivkovic M, Nebojsa Bacanin K, Venkatachalam AN, Djordjevic A, Strumberger I, Al-Turjman F (2021) COVID-19 cases prediction by using hybrid machine learning and beetle antennae search approach. Sustain Cities Soc 66:102669
Zivkovic M, Bacanin N, Antonijevic M, Nikolic B, Kvascev G, Marjanovic M, Savanovic N (2022) Hybrid CNN and XGBoost model tuned by modified arithmetic optimization algorithm for COVID-19 early diagnostics from x-ray images. Electronics 11(22):3798
Funding
No funding was received.
Author information
Authors and Affiliations
Contributions
MB: presented and developed the mathematical models of the proposed algorithms. He also developed the programs and pseudo-code of the proposed algorithms. Malik also ran statistical tests and went through the results. Last but not least, Malik thoroughly revised the entire paper. AH: executed the programs and experimental scenarios of the work. He prepared the experiments, tables, and diagrams. Abdelaziz also discussed the evaluation results of the developed algorithms and other algorithms. MA: tested the technical concepts in the paper, the readability of the overall work and English grammar. He provided English proof for it as well. MA-B: verified that the references and findings were genuine. Additionally, he provided feedback on the work and assisted with its development and analysis. OA: discussed the convergence results of the proposed algorithms. He also made revisions to the paper’s conclusion and abstract.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Ethical approval
No ethical approval is required in this research.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Braik, M.S., Hammouri, A.I., Awadallah, M.A. et al. Improved versions of snake optimizer for feature selection in medical diagnosis: a real case COVID-19. Soft Comput 27, 17833–17865 (2023). https://doi.org/10.1007/s00500-023-09062-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-023-09062-3