Abstract
Features subset selection (FSS) generally plays an essential role in the implementation of data mining, particularly in the field of high-dimensional medical data analysis, as well as in supplying early detection with essential features and high accuracy. The latest modern feature selection models are now using the ability of optimization algorithms for extracting features of particular properties to get the highest accuracy performance possible. Many of the optimization algorithms, such as genetic algorithm, often use the required parameters that would need to be adjusted for better results. For the function selection procedure, tuning these parameter values is a difficult challenge. In this paper, a new wrapper-based feature selection approach called binary teaching learning based optimization (BTLBO) is introduced. The binary teaching learning based optimization (BTLBO) is among the most sophisticated meta-heuristic method which does not involve any specific algorithm parameters. It requires only standard process parameters such as population size and a number of iterations to extract a set of features selected from a data. This is a demanding process, to achieve the best possible set of features would be to use a method which is independent of the method controlling parameters. This paper introduces a new modified binary teaching–learning-based optimization (NMBTLBO) as a technique to select subset features and demonstrate support vector machine (SVM) accuracy of binary identification as a fitness function for the implementation of the feature subset selection process. The new proposed algorithm NMBTLBO contains two steps: first, the new updating procedure, second, the new method to select the primary teacher in teacher phase in binary teaching-learning based on optimization algorithm. The proposed technique NMBTLBO was used to classify the rheumatic disease datasets collected from Baghdad Teaching Hospital Outpatient Rheumatology Clinic during 2016–2018. Compared with the original BTLBO algorithm, the improved NMBTLBO algorithm has achieved a major difference in accuracy. Validation was carried out by testing the accuracy of four classification methods: K-nearest neighbors, decision trees, support vector machines and K-means. Study results showed that the classification accuracy of the four methods was increased for the proposed method of selection of features (NMBTLBO) compared to the BTLBO algorithm. SVM classifier provided 89% accuracy of BTLBO-SVM and 95% with NMBTLBO –SVM. Decision trees set the values of 94% with BTLBO-SVM and 95% with the feature selection of NMBTLBO-SVM. The analysis indicates that the latest method (NMBTLBO) enhances classification accuracy.
Similar content being viewed by others
References
Akhlaghi M, Emami F, Nozhat N (2014) Binary TLBO algorithm assisted for designing plasmonic nano bi-pyramids-based absorption coefficient. J Mod Opt 61(13):1092–1096. https://doi.org/10.1080/09500340.2014.920537
Allam M, Malaiyappan N (2020) Wrapper Based Feature Selection Using Integrative Teaching Learning Based Optimization Algorithm. International Arab Journal of Information Technology 17(6):885–894. https://doi.org/10.34028/iajit/17/6/7
Awad, Mariette, and Rahul Khanna. 2015. “Efficient Learning Machines: Theories, Concepts, and Applications for Engineers and System Designers.” Efficient Learning Machines: Theories, Concepts, and Applications for Engineers and System Designers, January: 1–248. https://doi.org/10.1007/978-1-4302-5990-9.
Chatra K, Kuppili V, Edla DR, Verma AK (2019) Cancer data classification using binary bat optimization and extreme learning machine with a novel fitness function. Med Biol Eng Comput 57(12):2673–2682. https://doi.org/10.1007/s11517-019-02043-5
Chuang LY, Tsai SW, Yang CH (2011) Improved binary particle swarm optimization using catfish effect for feature selection. Expert Syst Appl 38(10):12699–12707. https://doi.org/10.1016/j.eswa.2011.04.057
Cura OK, Atli SK, Türe HS, Akan A (2020) Epileptic seizure classifications using empirical mode decomposition and its derivative. Biomed Eng Online 19(1):1–22. https://doi.org/10.1186/s12938-020-0754-y
Deniz A, Kiziloz HE, Dokeroglu T, Cosar A (2017) Robust multiobjective evolutionary feature subset selection algorithm for binary classification using machine learning techniques. Neurocomputing 241:128–146. https://doi.org/10.1016/j.neucom.2017.02.033
Ghosh M, Guha R, Alam I, Lohariwal P, Jalan D, Sarkar R (2020) Binary genetic swarm optimization: a combination of Ga and Pso for feature selection. J Intell Syst 29(1):1598–1610. https://doi.org/10.1515/jisys-2019-0062
Gullo F (2015) From patterns in data to knowledge discovery: what data mining can do. Phys Procedia 62:18–22. https://doi.org/10.1016/j.phpro.2015.02.005
Hoffmann R, Napiórkowski J, Protasowicki T, Stanik J (2020) Risk Based Approach in Scope of Cybersecurity Threats and Requirements. Procedia Manufacturing 44:655–662. ScienceDirect. https://doi.org/10.1016/j.promfg.2020.02.243
Jain K, Bhadauria SS (2016) Enhanced content based image retrieval using feature selection using teacher learning based optimization. 14(11):1052–1057
Kaboli M, Akhlaghi M (2016) Binary teaching-learning-based optimization algorithm is used to investigate the superscattering plasmonic nanodisk. Optics and Spectroscopy (English Translation of Optika i Spektroskopiya) 120(6):958–963. https://doi.org/10.1134/S0030400X16060096
Kaoungku N, Kerdprasop K, Kerdprasop N (2017) A method to clustering the feature ranking on data classification using an ensemble feature selection. International Journal of Future Computer and Communication 6(3):81–85. https://doi.org/10.18178/ijfcc.2017.6.3.494
Khuat TT, Le MH (2019) Binary teaching–learning-based optimization algorithm with a new update mechanism for sample subset optimization in software defect prediction. Soft Comput 23(20):9919–9935. https://doi.org/10.1007/s00500-018-3546-6
Kiziloz HE, Deniz A, Dokeroglu T, Cosar A (2018) Novel multiobjective TLBO algorithms for the feature subset selection problem. Neurocomputing 306:94–107. https://doi.org/10.1016/j.neucom.2018.04.020
Manonmani, M., and Sarojini Balakrishnan. 2020. “Feature selection using improved teaching learning based algorithm on chronic kidney disease dataset.” Proc Comput Sci, 171:1660–1669. https://doi.org/10.1016/j.procs.2020.04.178.
Nayak PK, Mishra S, Dash PK, Bisoi R (2016) Comparison of modified teaching–learning-based optimization and extreme learning machine for classification of multiple power signal disturbances. Neural Comput & Applic 27(7):2107–2122. https://doi.org/10.1007/s00521-015-2010-0
Panigrahi SK, Pattnaik S (2016) Empirical study on clustering based on modified teaching learning based optimization. Proc Comput Sci 92:442–449. https://doi.org/10.1016/j.procs.2016.07.338
Rao RV, Savsani VJ, Balic J (2012) Teaching-learning-based optimization algorithm for unconstrained and constrained real-parameter optimization problems. Eng Optim 44(12):1447–1462. https://doi.org/10.1080/0305215X.2011.652103
Sameer F, Abu Bakar MR (2017) Modified Kohonen network algorithm for selection of the initial Centres of Gustafson-Kessel algorithm in credit scoring. Pertanika Journal of Science and Technology 25(1):77–90
Sameer FO, Abu Bakar MR, Zaidan AA, Zaidan BB (2019) A new algorithm of modified binary particle swarm optimization based on the Gustafson-Kessel for credit risk assessment. Neural Comput & Applic 31(2):337–346. https://doi.org/10.1007/s00521-017-3018-4
Sameer FO, Al-obaidi MJ, Al-bassam WW, Ad’hiah AH (2021) Multi-Objectives TLBO Hybrid Method to Select the Related Risk Features with Rheumatism Disease. Neural Computing and Applications:1–14. https://doi.org/10.1007/s00521-020-05665-1
Satapathy SK, Dehuri S, Jagadev AK, Mishra S (2019) Empirical study on the performance of the classifiers in EEG classification. EEG Brain Signal Classification for Epileptic Seizure Disorder Detection:45–65. https://doi.org/10.1016/b978-0-12-817426-5.00003-x
Shahbeig S, Helfroush MS, Rahideh A (2017) A fuzzy multi-objective hybrid TLBO–PSO approach to select the associated genes with breast Cancer. Signal Process 131:58–65. https://doi.org/10.1016/j.sigpro.2016.07.035
Shukla AK, Singh P, Vardhan M (2019) A new hybrid feature subset selection framework based on binary genetic algorithm and information theory. Int J Comput Intell Appl 18(3):1–22. https://doi.org/10.1142/S1469026819500202
Siddiqui MK, Morales-Menendez R, Huang X, Hussain N (2020) A review of epileptic seizure detection using machine learning classifiers. Brain Informatics 7(1):1–18. https://doi.org/10.1186/s40708-020-00105-1
Thaher T, Mafarja M, Turabieh H, Castillo PA, Faris H, Aljarah I (2021) Teaching learning-based optimization with evolutionary binarization schemes for tackling feature selection problems. IEEE Access 9:41082–41103. https://doi.org/10.1109/ACCESS.2021.3064799
Tuo S, Yong L, Deng F’A, Li Y, Lin Y, Qiuju L (2017) HSTLBO: a hybrid algorithm based on harmony search and teaching-learning- based optimization for complex highdimensional optimization problems. PLoS One 12(4):1–23. https://doi.org/10.1371/journal.pone.0175114
Xu Y, Wang L, Wang S y, Liu M (2015) An effective teaching-learning-based optimization algorithm for the flexible job-shop scheduling problem with fuzzy processing time. Neurocomputing 148:260–268. https://doi.org/10.1016/j.neucom.2013.10.042
Xue B, Zhang M, Browne WN, Yao X (2016) A survey on evolutionary computation approaches to feature selection. IEEE Trans Evol Comput 20(4):606–626. https://doi.org/10.1109/TEVC.2015.2504420
Zheng B, Yoon SW, Lam SS (2014) Breast Cancer Diagnosis Based on Feature Extraction Using a Hybrid of K-Means and Support Vector Machine Algorithms. Expert Systems with Applications 41(4 PART 1):1476–1482. https://doi.org/10.1016/j.eswa.2013.08.044
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Sameer, F.O. Comparison study on the performance of the multi classifiers with hybrid optimal features selection method for medical data diagnosis. Multimed Tools Appl 81, 18073–18090 (2022). https://doi.org/10.1007/s11042-022-12434-w
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-12434-w