Abstract
High-dimensional microarray data suffer from the confounding effects of irrelevant, redundant and noisy genes on the scalability and efficiency of classification algorithms. In order for an effective dimensionality reduction and the selection of informative genes, this paper introduces a novel approach using fractional calculus concepts. This study proposes a modified version of binary the bat algorithm named fractional-order binary bat algorithm (FBBA) able to control the convergence process using more historical memory of bat behaviors. The gene selection technique contains a two-stage hybrid filter/wrapper method which employs a new correlation-based feature clustering (CFC) algorithm in the filter stage and the FBBA in the wrapper stage. The CFC-FBBA is evaluated on ten microarray gene expression datasets by employing the support vector machine classifier with a k-fold Monte Carlo cross validation data partitioning model. Furthermore, the results show that the CFC-FBBA, while minimizing the size of the gene subset, achieves the highest classification accuracy in most cases compared to several state-of-art hybrid techniques.



Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Al-Betar MA, Alomari OA, Abu-Romman SM (2020) A TRIZ-inspired bat algorithm for gene selection in cancer classification. Genomics 112:114–126
Alomari OA, Khader AT, Al-Betar MA, Abualigah LM (2017) Gene selection for cancer classification by combining minimum redundancy maximum relevancy and bat-inspired algorithm. Int J Data Min Bioinform 19:32–51
Alomari OA, Khader AT, Al-Betar MA, Awadallah MA (2018) A novel gene selection method using modified MRMR and hybrid bat-inspired algorithm with β-hill climbing. Appl Intell 48:4429–4447
Annavarapu CSR, Dara S (2021) Clustering-based hybrid feature selection approach for high dimensional microarray data. Chemom Intell Lab Syst 213:104305
Caponetto R (2010) Fractional order systems: modeling and control applications. World Scientific, Singapore
Che H, Wang P, Chi S, Sun Y, Yang T, Wang Z (2022) LED layout optimization in visible light communication system by a hybrid immune clonal bat algorithm. Opt Commun 520:128532. https://doi.org/10.1016/j.optcom.2022.128532
Chen S-B, Jahanshahi H, Abba OA, Solís-Pérez J, Bekiros S, Gómez-Aguilar J, Yousefpour A, Chu Y-M (2020) The effect of market confidence on a financial system from the perspective of fractional calculus: numerical investigation and circuit realization. Chaos Solitons Fractals 140:110223
Couceiro M, Ghamisi P (eds) (2016) Fractional-order Darwinian PSO. In: Fractional order darwinian particle swarm optimization. Springer, Berlin, pp 11–20. https://doi.org/10.1007/978-3-319-19635-0_2
Dabba A, Tari A, Meftali S (2021a) Hybridization of Moth flame optimization algorithm and quantum computing for gene selection in microarray data. J Ambient Intell Humaniz Comput 12:2731–2750
Dabba A, Tari A, Meftali S (2021b) A new multi-objective binary Harris Hawks optimization for gene selection in microarray data. J Ambient Intell Humaniz Comput. https://doi.org/10.1007/s12652-021-03441-0
Dabba A, Tari A, Meftali S, Mokhtari R (2021c) Gene selection and classification of microarray data method based on mutual information and moth flame algorithm. Expert Syst Appl 166:114012
Dashtban M, Balafar M, Suravajhala P (2018) Gene selection for tumor classification using a novel bio-inspired multi-objective approach. Genomics 110:10–17
Ding C, Peng H (2005) Minimum redundancy feature selection from microarray gene expression data. J Bioinform Comput Biol 3:185–205
Enache A-C, Sgarciu V (2015) An improved bat algorithm driven by support vector machines for intrusion detection. In: Presented at computational intelligence in security for information systems conference. Springer, Cham. https://doi.org/10.1007/978-3-319-19713-5_4
Enache A-C, Sgârciu V, Togan M (2017) Comparative study on feature selection methods rooted in swarm intelligence for intrusion detection. In: Presented at 2017 21st international conference on control systems and computer science (CSCS). https://doi.org/10.1109/CSCS.2017.40
Farivar F, Shoorehdeli MA, Manthouri M (2020) Improved teaching–learning based optimization algorithm using Lyapunov stability analysis. J Ambient Intell Humaniz Comput 13:3609–3618. https://doi.org/10.1007/s12652-020-02012-z
Ghamisi P, Couceiro MS, Benediktsson JA (2014) A novel feature selection approach based on FODPSO and SVM. IEEE Trans Geosci Remote Sens 53:2935–2947
Griffin DR, Webster FA, Michael CR (1960) The echolocation of flying insects by bats. Anim Behav 8:141–154
Jain I, Jain VK, Jain R (2018) Correlation feature selection based improved-binary particle swarm optimization for gene selection and cancer classification. Appl Soft Comput 62:203–215
Khaire UM, Dhanalakshmi R (2019) Stability of feature selection algorithm: a review. J King Saud Univ Comput Inf Sci 34:1060–1073. https://doi.org/10.1016/j.jksuci.2019.06.012
Lai C-M, Yeh W-C, Chang C-Y (2016) Gene selection using information gain and improved simplified swarm optimization. Neurocomputing 218:331–338
Li H, Song B, Tang X, Xie Y, Zhou X (2021) A multi-objective bat algorithm with a novel competitive mechanism and its application in controller tuning. Eng Appl Artif Intell 106:104453
Lu H, Chen J, Yan K, Jin Q, Xue Y, Gao Z (2017) A hybrid feature selection algorithm for gene expression data classification. Neurocomputing 256:56–62
Lv J, Peng Q, Chen X, Sun Z (2016) A multi-objective heuristic algorithm for gene expression microarray data classification. Expert Syst Appl 59:13–19
Menaga D, Revathi S (2021) Fractional-atom search algorithm-based deep recurrent neural network for cancer classification. J Ambient Intell Humaniz Comput. https://doi.org/10.1007/s12652-021-03008-z
Mirjalili S, Mirjalili SM, Yang X-S (2014) Binary bat algorithm. Neural Comput Appl 25:663–681
Monje CA, Chen Y, Vinagre BM, Xue D, Feliu-Batlle V (2010) Fractional-order systems and controls: fundamentals and applications. Springer Science & Business Media, Berlin
Moslehi F, Haeri A (2020) A novel hybrid wrapper–filter approach based on genetic algorithm, particle swarm optimization for feature subset selection. J Ambient Intell Humaniz Comput 11:1105–1127
Mousavi Y, Alfi A (2015) A memetic algorithm applied to trajectory control by tuning of fractional order proportional-integral-derivative controllers. Appl Soft Comput 36:599–617
Mousavi Y, Alfi A (2018) Fractional calculus-based firefly algorithm applied to parameter estimation of chaotic systems. Chaos Solitons Fractals 114:202–215
Nakamura RY, Pereira LA, Costa KA, Rodrigues D, Papa JP, Yang X-S (2012) BBA: a binary bat algorithm for feature selection. In: Presented at 2012 25th SIBGRAPI conference on graphics, patterns and images. https://doi.org/10.1109/SIBGRAPI.2012.47
Pires ES, Machado JT, de Moura OP, Cunha JB, Mendes L (2010) Particle swarm optimization with fractional-order velocity. Nonlinear Dyn 61:295–301
Qing Y, Ma C, Zhou Y, Zhang X, Xia H (2021) Cooperative coevolutionary multiobjective genetic programming for microarray data classification. In: Presented at proceedings of the genetic and evolutionary computation conference. https://doi.org/10.1145/3449639.3459400
Rauf HT, Malik S, Shoaib U, Irfan MN, Lali MI (2020) Adaptive inertia weight Bat algorithm with Sugeno-Function fuzzy search. Appl Soft Comput 90:106159
Rodrigues D, Pereira LA, Nakamura RY, Costa KA, Yang X-S, Souza AN, Papa JP (2014) A wrapper approach for feature selection based on bat algorithm and optimum-path forest. Expert Syst Appl 41:2250–2258
Santhakumar D, Logeswari S (2021) Hybrid ant lion mutated ant colony optimizer technique for Leukemia prediction using microarray gene data. J Ambient Intell Humaniz Comput 12:2965–2973
Sathananthavathi V, Indumathi G (2021) BAT optimization based retinal artery vein classification. Soft Comput 25:2821–2835
Schnitzler H-U, Kalko EK (2001) Echolocation by insect-eating bats: we define four distinct functional groups of bats and find differences in signal structure that correlate with the typical echolocation tasks faced by each group. Bioscience 51:557–569
Seah CS, Kasim S, Hassan R (2021) Significant directed walk framework to increase the accuracy of cancer classification using gene expression data. J Ambient Intell Humaniz Comput 12:7281–7298
Shahabi Sani N, Manthouri M, Farivar F (2020) A multi-objective ant colony optimization algorithm for community detection in complex networks. J Ambient Intell Humaniz Comput 11:5–21
Sharma P, Sharma K (2022) Fetal state health monitoring using novel Enhanced Binary Bat Algorithm. Comput Electr Eng 101:108035
Shukla AK, Singh P, Vardhan M (2019) A new hybrid wrapper TLBO and SA with SVM approach for gene expression data. Inf Sci 503:238–254
Song Q, Ni J, Wang G (2013) A fast clustering-based feature subset selection algorithm for high-dimensional data. IEEE Trans Knowl Data Eng 25:1–14
Tenreiro Machado J, Silva MF, Barbosa RS, Jesus IS, Reis CM, Marcos MG, Galhano AF (2010) Some applications of fractional calculus in engineering. Math Probl Eng 2010:639801. https://doi.org/10.1155/2010/639801
Tran B, Xue B, Zhang M (2017) A new representation in PSO for discretization-based feature selection. IEEE Trans Cybern 48:1733–1746
Wang J, Wei J-M, Yang Z, Wang S-Q (2017) Feature selection by maximizing independent classification information. IEEE Trans Knowl Data Eng 29:828–841
Wang Y-Y, Peng W-X, Qiu C-H, Jiang J, Xia S-R (2019) Fractional-order Darwinian PSO-based feature selection for media-adventitia border detection in intravascular ultrasound images. Ultrasonics 92:1–7
Xu R-F, Lee S-J (2015) Dimensionality reduction by feature clustering for regression problems. Inf Sci 299:42–57
Yan X, Nazmi S, Erol BA, Homaifar A, Gebru B, Tunstel E (2020) An efficient unsupervised feature selection procedure through feature clustering. Pattern Recognit Lett 131:277–284
Yang X-S (2010) A new metaheuristic bat-inspired algorithm. In: González JR, Pelta DA, Cruz C, Terrazas G, Krasnogor N (eds) Nature inspired cooperative strategies for optimization (NICSO 2010). Springer, Berlin, pp 65–74. https://doi.org/10.1007/978-3-642-12538-6_6
Yang B, Lu Y, Zhu K, Yang G, Liu J, Yin H (2017) Feature selection based on modified bat algorithm. IEICE Trans Inf Syst 100:1860–1869
Yang Q, Dong N, Zhang J (2021) An enhanced adaptive bat algorithm for microgrid energy scheduling. Energy 232:121014
Zhou Y, Kang J, Guo H (2020) Many-objective optimization of feature selection based on two-level particle cooperation. Inf Sci 532:91–109
Zhou Y, Kang J, Kwong S, Wang X, Zhang Q (2021a) An evolutionary multi-objective optimization framework of discretization-based feature selection for classification. Swarm Evol Comput 60:100770
Zhou Y, Lin J, Guo H (2021b) Feature subset selection via an improved discretization-based particle swarm optimization. Appl Soft Comput 98:106794
Zhou Y, Zhang W, Kang J, Zhang X, Wang X (2021c) A problem-specific non-dominated sorting genetic algorithm for supervised feature selection. Inf Sci 547:841–859
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Esfandiari, A., Farivar, F. & Khaloozadeh, H. Fractional-order binary bat algorithm for feature selection on high-dimensional microarray data. J Ambient Intell Human Comput 14, 7453–7467 (2023). https://doi.org/10.1007/s12652-022-04450-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-022-04450-3