Abstract
Feature selection (FS) is one of the pre-processing methods that are widely used in the fields of Data Mining and Pattern Recognition. Elimination of redundant/irrelevant features of large data sets and finding a suitable feature subset are one of the main goals in FS. The utilization of evolutionary algorithms, including global search algorithms e.g. Genetic Algorithm and local search algorithms e.g. hill climbing, is known as the best way to solve a variety of optimization problems such as FS problem. They are never able to find a globally optimal solution because they are often trapped in one of the local optimum solutions and stop. Therefore, the researchers have tried to solve this major problem by escaping from the local solutions. In this article, we propose a two-stage method by applying a global search algorithm and a local search algorithm to find a sub-optimal solution for the FS problem. Here, we define a sub-optimal solution as a solution with the high reduction rate and the similar or even better classification performance. In the suggested two-stage method referred to as BGSA-SA, that is, the binary version of the Gravitational Search Algorithm (BGSA) and Simulated Annealing (SA) are selected as global and local search algorithms, respectively. For evaluating this proposed two-stage method, we utilized several UCI machine learning datasets and both classifiers SVM and K-NN. We compare the accuracy and reduction rate of the proposed two-stage method with three groups of methods, such as: (1) six singular meta-heuristic methods including BGA, BPSO, GSAPSO, CHGSA, BGSA, and SA, (2) the other two-stage methods namely BGA-SA and BPSO-SA, and (3) seven published methods as the state-of-art methods. The obtained results confirm that our BGSA-SA method has the rank 1 in the reduction rate whereas the accuracy of it using both SVM and K-NN classifiers is similar or even, in some cases, better than the other mentioned methods.
Similar content being viewed by others
References
Ashkezari AD, Ma H, Saha TK, Cui Y (2014) Investigation of feature selection techniques for improving efficiency of power transformer condition assessment. IEEE Trans Dielectr Electr Insul 21:836–844
Asuncion A, Newman D (2007) UCI machine learning repository. www.archive.ics.uci.edu/ml
Bajcsy P (2006) An overview of DNA microarray grid alignment and foreground separation approaches. EURASIP J Appl Signal Process 2006:1–13
Barani F, Mirhosseini M, Nezamabadi-pour H (2017) Application of binary quantum-inspired gravitational search algorithm in feature subset selection. Appl Intell 47:304–318
Biricik G, Diri B, Sonmez AC (2012) Abstract feature extraction for text classification. Turk J Electr Eng Comput Sci 20:1137–1159
Chen H, Li S, Tang Z (2011) Hybrid gravitational search algorithm with random-key encoding scheme combined with simulated annealing. IJCSNS Int J Comput Sci Netw Secur 11:208–217
Chengyi T (2014) Gravitational search algorithm based on simulated annealing. J Converg Inf Technol (JCIT) 9:231–237
Cover T, Hart P (1967) Nearest neighbor pattern classification. IEEE Trans Inf Theory 13:21–27
Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30
Fix E, Hodges J (1989) Discriminatory analysis-nonparametric discrimination: consistency properties. Int Stat Rev 57:238–247
Han X, Chang X (2012) A chaotic digital secure communication based on a modified gravitational search algorithm filter. Inf Sci 208:14–27
Han J, Kamber M (2006) Data Mining: concepts and techniques, 2nd edn. University of Illinois at Urbana-Champaign, printed on Elsevier Inc
Han M, Ren W (2015) Global mutual information-based feature selection approach using single-objective and multi-objective optimization. Neurocomputing 168:47–54
Khadanga R K, Satapathy J K (2015) A new hybrid GA–GSA algorithm for tuning damping controller parameters for a unified power flow controller. Electr Power Energy Syst 73:1060–1069
Kirkpatrick S, Gellat CD, Vechi MP (1983) Optimization by simulated annealing. Pattern Recogn 220:671–680
Kumar V, Kumar-Chahabra J, Kumar D (2015) Automatic unsupervised feature selection using gravitational search algorithm. IETE J Res 61:22–31
Li C, Zhou J (2011) Parameters identification of hydraulic turbine governing system using improved gravitational search algorithm. Energy Convers Manag 52:374–381
Li C, An X, Li R (2015) A chaos embedded GSA-SVM hybrid system for classification. Neural Comput Appl 26:713–721
Mirjalili S, Hashim S Z M (2010) A new hybrid PSOGSA algorithm for function optimization. In: Proceedings of 2010 international conference on computer and information application (ICCIA 2010). Tianjin, pp 374–377
Nezamabadi-pour H (2015) A quantum-inspired gravitational search algorithm for binary encoded optimization problems. Eng Appl Artif Intell 40:62–75
Papa JP, Pagnin A, Schellini SA, Spadotto A, Guido RC, Chiachia G, Falcao AX (2011) Feature selection through gravitational search algorithm. In: IEEE ICASSP, pp 2052–2055
Pedrajas N G, Rodriguez JP (2012) Multi-selection of instances: a straightforward way to improve evolutionary instance selection. Appl Soft Comput 12:3590–3602
Rashedi E, Nezamabadi-pour H, Saryazd S (2009) GSA: a gravitational search algorithm. Inf Sci 179:2232–2248
Rashedi E, Nezamabadi-pour H, Saryazdi S (2010) BGSA: binary gravitational search algorithm. Nat Comput 9:727–745
Rashedi E, Nezamabadi-pour H, Saryazd S (2013) A simultaneous feature adaptation and feature selection method for content-based image retrieval systems. Knowl-Based Syst 39:85–94
Sarafrazi S, Nezamabadi-pour H, Saryazdi S (2011) Disruption, A new operator in gravitational search algorithm. Sci Iran 18:539–548
Shuang C, Hongnian Y (2012) Mutual information based input feature selection for classification problems. Decis Support Syst 54:691–698
Sikora R, Piramuthu S (2007) Framework for efficient feature selection in genetic algorithm based data mining. Eur J Oper Res 180:723–737
Tsai CF, Eberle W, Chu CY (2013) Genetic algorithms in feature and instance selection. Knowl-Based Syst 39:240–247
Vapnik VN (1995) The nature of statistical learning theory. Springer, New York
Vierira SM, Mendonca LF, Farinha GJ, Sousa JMC (2013) Modified binary PSO for feature selection using SVM applied to mortality prediction of septic patients. Appl Soft Comput 13:3494–3504
Wahde M (2008) Biologically inspired optimization methods, 1st edn. WIT Press, Southampton
Wang CM, Huang YF (2009) Evolutionary-based feature selection approaches with new criteria for data mining: a case study of credit approval data. Expert Syst Appl 36:5900–5908
Wilcoxon F (1945) Individual comparisons by ranking methods. Biometrics 1:80–83
Xiang J, Han XH, Duan F, Qiang Y, Xiong XY, Lan Y, Chai H (2015) A novel hybrid system for feature selection based on an improved gravitational search algorithm and k-NN method. Appl Soft Comput 31:293–307
Xue B, Zhang M, Browne WN (2012) New fitness functions in binary particle swarm optimization for feature selection. In: IEEE world congress on computational intelligence, pp 10–15
Xue B, Zhang M, Browne WN (2014) Particle swarm optimization for feature selection in classification: novel initialization and updating mechanisms. Appl Soft Comput 18:261–276
Zahiri SH (2012) Fuzzy gravitational search algorithm an approach for data mining. Iran J Fuzzy Syst 9:21–37
Zhang H, Qin C, Jiang B, Luo Y (2014) On line adaptive policy learning algorithm for H1 state feedback control of unknown affine on linear discrete-time systems. IEEE Trans Cybern 44:2706–2718
Zhang H, Zhang J, Yang GH, Luo Y (2015) Leader-based optimal coordination control for the consensus problem of multi-agent differential games via fuzzy adaptive dynamic programming. IEEE Trans Fuzzy Syst 23:152–163
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Javidi, M.M., Zarisfi Kermani, F. Utilizing the advantages of both global and local search strategies for finding a small subset of features in a two-stage method. Appl Intell 48, 3502–3522 (2018). https://doi.org/10.1007/s10489-018-1159-5
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-018-1159-5