Skip to main content
Log in

A novel feature selection using binary hybrid improved whale optimization algorithm

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Some features in a dataset that contain irrelevant or unnecessary data may adversely affect both classification accuracy and the size of data. These negative effects are minimized by using feature selection (FS). Recently, researchers have tried to develop more effective methods by using swarm-based optimization methods in FS, apart from the usual FS methods used in data mining. In this study, a novel wrapper feature selection method based on binary hybrid optimization, called BWPLFS, consisting of a Whale Optimization Algorithm, Particle Swarm Optimization and Lévy Flight is proposed. Ten standard benchmark datasets from the UCI repository for performance evaluation of the proposed algorithm are employed and compared with other literature algorithms. Support vector machines are used both in the objective function of the proposed FS and for classification. The system created for feature selection and classification is run twenty times. As a result of these runs, the average of the fitness values, the average of the classification accuracies, the worst of the fitness values and the best of the fitness values, and the average number of the selected features are found. The BWPLFS is compared with methods in the literature in terms of these criteria. According to the results, it seems that the proposed method selects the most effective features and so it is very promising. In addition, by integrating the proposed algorithm with devices that provide decision support systems, it can be provided to produce more accurate and faster results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

Data availability

All the datasets used in this study are available at: http://archive.ics.uci.edu/ml.

References

  1. Baloochian H, Ghaffary HR (2019) Multiclass classification based on multi-criteria decision-making. J Classif 36(1):140–151. https://doi.org/10.1007/s00357-018-9286-6

    Article  MATH  MathSciNet  Google Scholar 

  2. Uzer MS, Yilmaz N, Inan O (2013) Feature selection method based on artificial bee colony algorithm and support vector machines for medical datasets classification. Sci World J. https://doi.org/10.1155/2013/419187

    Article  Google Scholar 

  3. Uzer MS, Inan O, Yılmaz N (2013) A hybrid breast cancer detection system via neural network and feature selection based on SBS, SFS and PCA. Neural Comput Appl 23(3):719–728. https://doi.org/10.1007/s00521-012-0982-6

    Article  Google Scholar 

  4. Irmak B, Karakoyun M, Gülcü Ş (2022) An improved butterfly optimization algorithm for training the feed-forward artificial neural networks. Soft Comput. https://doi.org/10.1007/s00500-022-07592-w

    Article  Google Scholar 

  5. Karakoyun M, Ozkis A (2022) A binary tree seed algorithm with selection-based local search mechanism for huge-sized optimization problems. Appl Soft Comput 129:109590

    Article  Google Scholar 

  6. Yilmaz O, Altun AA, Koklu M (2022) Optimizing the learning process of multi-layer perceptrons using a hybrid algorithm based on MVO and SA. Int J Ind Eng Comput 13(4):617–640. https://doi.org/10.5267/j.ijiec.2022.5.003

    Article  Google Scholar 

  7. Kennedy J, Eberhart R (1995) Particle swarm optimization. In: Proceedings of ICNN'95-international conference on neural networks 1995, pp 1942–1948 IEEE

  8. Karaboga D, Basturk B (2008) On the performance of artificial bee colony (ABC) algorithm. Appl Soft Comput 8(1):687–697

    Article  Google Scholar 

  9. Heidari AA, Mirjalili S, Faris H, Aljarah I, Mafarja M, Chen HL (2019) Harris hawks optimization: algorithm and applications. Futur Gener Comput Syst Int J Esci 97:849–872

    Article  Google Scholar 

  10. Moosavi SHS, Bardsiri VK (2017) Satin bowerbird optimizer: a new optimization algorithm to optimize ANFIS for software development effort estimation. Eng Appl Artif Intell 60:1–15

    Article  Google Scholar 

  11. Zhu GY, Zhang WB (2017) Optimal foraging algorithm for global optimization. Appl Soft Comput 51:294–313

    Article  Google Scholar 

  12. Arora S, Anand P (2019) Binary butterfly optimization approaches for feature selection. Exp Syst Appl 116:147–160

    Article  Google Scholar 

  13. Zhang QY, Wang RG, Yang J, Ding K, Li YF, Hu JG (2017) Collective decision optimization algorithm: a new heuristic optimization method. Neurocomputing 221:123–137

    Article  Google Scholar 

  14. Mirjalili S, Lewis A (2016) The whale optimization algorithm. Adv Eng Softw 95:51–67

    Article  Google Scholar 

  15. Samantaray S, Sahoo A (2021) Prediction of suspended sediment concentration using hybrid SVM-WOA approaches. Geocarto Int. https://doi.org/10.1080/10106049.2021.1920638

    Article  Google Scholar 

  16. Yan S, Lifeng W, Fan J, Zhang F, Zou Y, You W (2021) A novel hybrid WOA-XGB model for estimating daily reference evapotranspiration using local and external meteorological data: applications in arid and humid regions of China. Agric Water Manag 244:106594. https://doi.org/10.1016/j.agwat.2020.106594

    Article  Google Scholar 

  17. Al-Tashi Q, Kadir SJA, Rais HM, Mirjalili S, Alhussian H (2019) Binary optimization using hybrid grey wolf optimization for feature selection. IEEE Access 7:39496–39508. https://doi.org/10.1109/access.2019.2906757

    Article  Google Scholar 

  18. Moslehi F, Haeri A, Martinez-Alvarez F (2020) A novel hybrid GA-PSO framework for mining quantitative association rules. Soft Comput 24(6):4645–4666

    Article  Google Scholar 

  19. Kan X, Yang D, Cao L, Shu H, Li Y, Yao W, Zhang X (2020) A novel pso-based optimized lightweight convolution neural network for movements recognizing from multichannel surface electromyogram. Complexity 2020:1–15. https://doi.org/10.1155/2020/6642463

    Article  Google Scholar 

  20. Ba AF, Huang H, Wang MJ, Ye XJ, Gu ZY, Chen HL, Cai XD (2020) Levy-based antlion-inspired optimizers with orthogonal learning scheme. Eng Comput. https://doi.org/10.1007/s00366-020-01042-7

    Article  Google Scholar 

  21. Barshandeh S, Haghzadeh M (2021) A new hybrid chaotic atom search optimization based on tree-seed algorithm and Levy flight for solving optimization problems. Eng Comput 37(4):3079–3122

    Article  Google Scholar 

  22. Matsushita R, Da Silva S, Da Fonseca R, Nagata M (2020) Bypassing the truncation problem of truncated Levy flights. Phys A Stat Mech Appl 559:125035

    Article  MATH  Google Scholar 

  23. Mokeddem D (2021) Parameter extraction of solar photovoltaic models using enhanced levy flight based grasshopper optimization algorithm. J Electr Eng Technol 16(1):171–179

    Article  Google Scholar 

  24. Zheng YF, Li Y, Wang G, Chen YP, Xu Q, Fan JH, Cui XT (2019) A novel hybrid algorithm for feature selection based on whale optimization algorithm. IEEE Access 7:14908–14923

    Article  Google Scholar 

  25. Mafarja M, Mirjalili S (2018) Whale optimization approaches for wrapper feature selection. Appl Soft Comput 62:441–453

    Article  Google Scholar 

  26. Wang D, Chen HM, Li TR, Wan JH, Huang YY (2020) A novel quantum grasshopper optimization algorithm for feature selection. Int J Approx Reason 127:33–53

    Article  MathSciNet  Google Scholar 

  27. Mafarja MM, Mirjalili S (2017) Hybrid Whale Optimization Algorithm with simulated annealing for feature selection. Neurocomputing 260:302–312

    Article  Google Scholar 

  28. Emary E, Zawba HM, Hassanien AE (2016) Binary grey wolf optimization approaches for feature selection. Neurocomputing 172:371–381

    Article  Google Scholar 

  29. Blake CL, Merz CJ (1998) University of California at Irvine repository of machine learning databases. http://www.ics.uci.edu/~mlearn/MLRepository.html. (1998). Accessed 2012

  30. Turkoglu B, Uymaz SA, Kaya E (2022) Binary artificial algae algorithm for feature selection. Appl Soft Comput 120:108630

    Article  Google Scholar 

  31. Inan O, Uzer MS (2021) A method of classification performance improvement via a strategy of clustering-based data elimination integrated with k-fold cross-validation. Arab J Sci Eng 46(2):1199–1212. https://doi.org/10.1007/s13369-020-04972-y

    Article  Google Scholar 

  32. Yilmaz N, Inan O, Uzer MS (2014) A new data preparation method based on clustering algorithms for diagnosis systems of heart and diabetes diseases. J Med Syst 38(5):48. https://doi.org/10.1007/s10916-014-0048-7

    Article  Google Scholar 

  33. Marini F, Walczak B (2015) Particle swarm optimization (PSO). A tutorial. Chemom Intell Laboratory Syst 149:153–165. https://doi.org/10.1016/j.chemolab.2015.08.020

    Article  Google Scholar 

  34. Pavlyukevich I (2007) Levy flights, non-local search and simulated annealing. J Comput Phys 226(2):1830–1844

    Article  MATH  MathSciNet  Google Scholar 

  35. Reynolds AM, Frye MA (2007) Free-flight odor tracking in drosophila is consistent with an optimal intermittent scale-free search. PLoS ONE 2(4):e354. https://doi.org/10.1371/journal.pone.0000354

    Article  Google Scholar 

  36. Shlesinger MF (2006) Mathematical physics-search research. Nature 443(7109):281–282

    Article  Google Scholar 

  37. Mat AN, İnan O, Karakoyun M (2021) An application of the whale optimization algorithm with Levy flight strategy for clustering of medical datasets. Int J Optim Control Theor Appl (IJOCTA) 11(2):216–226

    Article  MathSciNet  Google Scholar 

  38. Saji Y, Barkatou M (2021) A discrete bat algorithm based on Lévy flights for euclidean traveling salesman problem. Exp Syst Appl 172:114639

    Article  Google Scholar 

  39. Mantegna RN (1994) Fast, accurate algorithm for numerical-simulation of levy stable stochastic-processes. Phys Rev E 49(5):4677–4683

    Article  Google Scholar 

  40. El-Kenawy EM, Mirjalili S, Ibrahim A, Alrahmawy M, El-Said M, Zaki RM, Eid MM (2021) advanced meta-heuristics, convolutional neural networks, and feature selectors for efficient COVID-19 X-ray chest image classification. IEEE Access 9:36019–36037. https://doi.org/10.1109/access.2021.3061058

    Article  Google Scholar 

  41. Singh N, Singh S (2017) Hybrid algorithm of particle swarm optimization and grey wolf optimizer for improving convergence performance. J Appl Math. https://doi.org/10.1155/2017/2030489

    Article  MATH  MathSciNet  Google Scholar 

Download references

Acknowledgements

The authors are grateful to Selcuk University Scientific Research Projects Coordinatorship for support of the manuscript.

Funding

All authors declare that there is no funding for this work.

Author information

Authors and Affiliations

Authors

Contributions

MSU and OI conducted the literature review of the manuscript and the design of the proposed method together. It also contributed equally to obtaining the results of the proposed method and interpreting the results. MSU and OI read and approved the final manuscript.

Corresponding author

Correspondence to Mustafa Serter Uzer.

Ethics declarations

Conflict of interest

The authors declare no conflicts of interest.

Ethical approval

Authors confirm that the appropriate ethics review has been followed.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Uzer, M.S., Inan, O. A novel feature selection using binary hybrid improved whale optimization algorithm. J Supercomput 79, 10020–10045 (2023). https://doi.org/10.1007/s11227-023-05067-9

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-023-05067-9

Keywords

Navigation