Abstract
Some features in a dataset that contain irrelevant or unnecessary data may adversely affect both classification accuracy and the size of data. These negative effects are minimized by using feature selection (FS). Recently, researchers have tried to develop more effective methods by using swarm-based optimization methods in FS, apart from the usual FS methods used in data mining. In this study, a novel wrapper feature selection method based on binary hybrid optimization, called BWPLFS, consisting of a Whale Optimization Algorithm, Particle Swarm Optimization and Lévy Flight is proposed. Ten standard benchmark datasets from the UCI repository for performance evaluation of the proposed algorithm are employed and compared with other literature algorithms. Support vector machines are used both in the objective function of the proposed FS and for classification. The system created for feature selection and classification is run twenty times. As a result of these runs, the average of the fitness values, the average of the classification accuracies, the worst of the fitness values and the best of the fitness values, and the average number of the selected features are found. The BWPLFS is compared with methods in the literature in terms of these criteria. According to the results, it seems that the proposed method selects the most effective features and so it is very promising. In addition, by integrating the proposed algorithm with devices that provide decision support systems, it can be provided to produce more accurate and faster results.
Similar content being viewed by others
Data availability
All the datasets used in this study are available at: http://archive.ics.uci.edu/ml.
References
Baloochian H, Ghaffary HR (2019) Multiclass classification based on multi-criteria decision-making. J Classif 36(1):140–151. https://doi.org/10.1007/s00357-018-9286-6
Uzer MS, Yilmaz N, Inan O (2013) Feature selection method based on artificial bee colony algorithm and support vector machines for medical datasets classification. Sci World J. https://doi.org/10.1155/2013/419187
Uzer MS, Inan O, Yılmaz N (2013) A hybrid breast cancer detection system via neural network and feature selection based on SBS, SFS and PCA. Neural Comput Appl 23(3):719–728. https://doi.org/10.1007/s00521-012-0982-6
Irmak B, Karakoyun M, Gülcü Ş (2022) An improved butterfly optimization algorithm for training the feed-forward artificial neural networks. Soft Comput. https://doi.org/10.1007/s00500-022-07592-w
Karakoyun M, Ozkis A (2022) A binary tree seed algorithm with selection-based local search mechanism for huge-sized optimization problems. Appl Soft Comput 129:109590
Yilmaz O, Altun AA, Koklu M (2022) Optimizing the learning process of multi-layer perceptrons using a hybrid algorithm based on MVO and SA. Int J Ind Eng Comput 13(4):617–640. https://doi.org/10.5267/j.ijiec.2022.5.003
Kennedy J, Eberhart R (1995) Particle swarm optimization. In: Proceedings of ICNN'95-international conference on neural networks 1995, pp 1942–1948 IEEE
Karaboga D, Basturk B (2008) On the performance of artificial bee colony (ABC) algorithm. Appl Soft Comput 8(1):687–697
Heidari AA, Mirjalili S, Faris H, Aljarah I, Mafarja M, Chen HL (2019) Harris hawks optimization: algorithm and applications. Futur Gener Comput Syst Int J Esci 97:849–872
Moosavi SHS, Bardsiri VK (2017) Satin bowerbird optimizer: a new optimization algorithm to optimize ANFIS for software development effort estimation. Eng Appl Artif Intell 60:1–15
Zhu GY, Zhang WB (2017) Optimal foraging algorithm for global optimization. Appl Soft Comput 51:294–313
Arora S, Anand P (2019) Binary butterfly optimization approaches for feature selection. Exp Syst Appl 116:147–160
Zhang QY, Wang RG, Yang J, Ding K, Li YF, Hu JG (2017) Collective decision optimization algorithm: a new heuristic optimization method. Neurocomputing 221:123–137
Mirjalili S, Lewis A (2016) The whale optimization algorithm. Adv Eng Softw 95:51–67
Samantaray S, Sahoo A (2021) Prediction of suspended sediment concentration using hybrid SVM-WOA approaches. Geocarto Int. https://doi.org/10.1080/10106049.2021.1920638
Yan S, Lifeng W, Fan J, Zhang F, Zou Y, You W (2021) A novel hybrid WOA-XGB model for estimating daily reference evapotranspiration using local and external meteorological data: applications in arid and humid regions of China. Agric Water Manag 244:106594. https://doi.org/10.1016/j.agwat.2020.106594
Al-Tashi Q, Kadir SJA, Rais HM, Mirjalili S, Alhussian H (2019) Binary optimization using hybrid grey wolf optimization for feature selection. IEEE Access 7:39496–39508. https://doi.org/10.1109/access.2019.2906757
Moslehi F, Haeri A, Martinez-Alvarez F (2020) A novel hybrid GA-PSO framework for mining quantitative association rules. Soft Comput 24(6):4645–4666
Kan X, Yang D, Cao L, Shu H, Li Y, Yao W, Zhang X (2020) A novel pso-based optimized lightweight convolution neural network for movements recognizing from multichannel surface electromyogram. Complexity 2020:1–15. https://doi.org/10.1155/2020/6642463
Ba AF, Huang H, Wang MJ, Ye XJ, Gu ZY, Chen HL, Cai XD (2020) Levy-based antlion-inspired optimizers with orthogonal learning scheme. Eng Comput. https://doi.org/10.1007/s00366-020-01042-7
Barshandeh S, Haghzadeh M (2021) A new hybrid chaotic atom search optimization based on tree-seed algorithm and Levy flight for solving optimization problems. Eng Comput 37(4):3079–3122
Matsushita R, Da Silva S, Da Fonseca R, Nagata M (2020) Bypassing the truncation problem of truncated Levy flights. Phys A Stat Mech Appl 559:125035
Mokeddem D (2021) Parameter extraction of solar photovoltaic models using enhanced levy flight based grasshopper optimization algorithm. J Electr Eng Technol 16(1):171–179
Zheng YF, Li Y, Wang G, Chen YP, Xu Q, Fan JH, Cui XT (2019) A novel hybrid algorithm for feature selection based on whale optimization algorithm. IEEE Access 7:14908–14923
Mafarja M, Mirjalili S (2018) Whale optimization approaches for wrapper feature selection. Appl Soft Comput 62:441–453
Wang D, Chen HM, Li TR, Wan JH, Huang YY (2020) A novel quantum grasshopper optimization algorithm for feature selection. Int J Approx Reason 127:33–53
Mafarja MM, Mirjalili S (2017) Hybrid Whale Optimization Algorithm with simulated annealing for feature selection. Neurocomputing 260:302–312
Emary E, Zawba HM, Hassanien AE (2016) Binary grey wolf optimization approaches for feature selection. Neurocomputing 172:371–381
Blake CL, Merz CJ (1998) University of California at Irvine repository of machine learning databases. http://www.ics.uci.edu/~mlearn/MLRepository.html. (1998). Accessed 2012
Turkoglu B, Uymaz SA, Kaya E (2022) Binary artificial algae algorithm for feature selection. Appl Soft Comput 120:108630
Inan O, Uzer MS (2021) A method of classification performance improvement via a strategy of clustering-based data elimination integrated with k-fold cross-validation. Arab J Sci Eng 46(2):1199–1212. https://doi.org/10.1007/s13369-020-04972-y
Yilmaz N, Inan O, Uzer MS (2014) A new data preparation method based on clustering algorithms for diagnosis systems of heart and diabetes diseases. J Med Syst 38(5):48. https://doi.org/10.1007/s10916-014-0048-7
Marini F, Walczak B (2015) Particle swarm optimization (PSO). A tutorial. Chemom Intell Laboratory Syst 149:153–165. https://doi.org/10.1016/j.chemolab.2015.08.020
Pavlyukevich I (2007) Levy flights, non-local search and simulated annealing. J Comput Phys 226(2):1830–1844
Reynolds AM, Frye MA (2007) Free-flight odor tracking in drosophila is consistent with an optimal intermittent scale-free search. PLoS ONE 2(4):e354. https://doi.org/10.1371/journal.pone.0000354
Shlesinger MF (2006) Mathematical physics-search research. Nature 443(7109):281–282
Mat AN, İnan O, Karakoyun M (2021) An application of the whale optimization algorithm with Levy flight strategy for clustering of medical datasets. Int J Optim Control Theor Appl (IJOCTA) 11(2):216–226
Saji Y, Barkatou M (2021) A discrete bat algorithm based on Lévy flights for euclidean traveling salesman problem. Exp Syst Appl 172:114639
Mantegna RN (1994) Fast, accurate algorithm for numerical-simulation of levy stable stochastic-processes. Phys Rev E 49(5):4677–4683
El-Kenawy EM, Mirjalili S, Ibrahim A, Alrahmawy M, El-Said M, Zaki RM, Eid MM (2021) advanced meta-heuristics, convolutional neural networks, and feature selectors for efficient COVID-19 X-ray chest image classification. IEEE Access 9:36019–36037. https://doi.org/10.1109/access.2021.3061058
Singh N, Singh S (2017) Hybrid algorithm of particle swarm optimization and grey wolf optimizer for improving convergence performance. J Appl Math. https://doi.org/10.1155/2017/2030489
Acknowledgements
The authors are grateful to Selcuk University Scientific Research Projects Coordinatorship for support of the manuscript.
Funding
All authors declare that there is no funding for this work.
Author information
Authors and Affiliations
Contributions
MSU and OI conducted the literature review of the manuscript and the design of the proposed method together. It also contributed equally to obtaining the results of the proposed method and interpreting the results. MSU and OI read and approved the final manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflicts of interest.
Ethical approval
Authors confirm that the appropriate ethics review has been followed.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Uzer, M.S., Inan, O. A novel feature selection using binary hybrid improved whale optimization algorithm. J Supercomput 79, 10020–10045 (2023). https://doi.org/10.1007/s11227-023-05067-9
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-023-05067-9