Abstract
One of the crucial problems in in the fields of machine learning and data mining is data reduction by feature selection (FS). In this context, this paper proposes an FS method based on a hybrid of type 2 fuzzy rough k-nearest neighbors (T2FRKNN) and a weighted mean vector optimization method called FKNINFO. Thus, the significance of the features can be determined by the creation of the lower and upper fuzzy similarity partition matrices. The introduction of INFO is intended to enhance the T2FRKNN with the best parameters and feature subsets. The proposed method is a dynamic framework originally aimed at solving problems through continuous optimization. In this regard, we propose a binary version of FKNINFO (BFKNINFO), which uses the X-shaped function to improve the efficiency of FS. The BFKNINFO is tested using medical datasets and compared to the other optimization methods in terms of fitness, accuracy, precision, recall, ROC curves,Wilcoxon statistical test (P-value), running time, and number of features. BFKNINFO is used to detect the coronavirus disease (COVID-19) datasets. The results of the experiments demonstrate the effectiveness of BFKNINFO in navigating the problem space and identifying the most effective parameter and features by reducing the number of features.





Similar content being viewed by others
Data Availibility Statement
The datasets analyzed during the current study are available In: UCI repository and https://github.com/Atharva-Peshkar/Covid-19-Patient-Health-Analytics
References
Too J, Mirjalili S (2021) A hyper learning binary dragonfly algorithm for feature selection: A covid-19 case study. Knowl-Based Syst 212:106553
Albahli S, Meraj T, Chakraborty C, Rauf HT (2022) Ai-driven deep and handcrafted features selection approach for covid-19 and chest related diseases identification. Multimed Tools App 81(26):37569–37589
Bandyopadhyay R, Basu A, Cuevas E, Sarkar R (2021) Harris hawks optimisation with simulated annealing as a deep feature selection method for screening of covid-19 ct-scans. Appl Soft Comput 111:107698
Bania RK, Halder A (2021) R-hefs: Rough set based heterogeneous ensemble feature selection method for medical data classification. Artif Intell Med 114:102049
Azar AT, Anter AM, Fouad KM (2020) Intelligent system for feature selection based on rough set and chaotic binary grey wolf optimisation. Int J Comput Appl Technol 63(1–2):4–24
Mendel JM (2017) Uncertain rule-based fuzzy systems. Intro New Dir 684
Polkowski L (2002) Rough sets
Huang H, Meng F, Zhou S, Jiang F, Manogaran G (2019) Brain image segmentation based on fcm clustering algorithm and rough set. IEEE Access 7:12386–12396
Cao B, Fan S, Zhao J, Yang P, Muhammad K, Tanveer M (2020) Quantum-enhanced multiobjective large-scale optimization via parallelism. Swarm Evol Comput 57:100697
Huda RK, Banka H (2022) Efficient feature selection methods using pso with fuzzy rough set as fitness function. Soft Comput 1–21
Heidari AA, Mirjalili S, Faris H, Aljarah I, Mafarja M, Chen H (2019) Harris hawks optimization: Algorithm and applications. Futur Gener Comput Syst 97:849–872
Pathak Y, Arya K, Tiwari S (2019) Feature selection for image steganalysis using levy flight-based grey wolf optimization. Multimed Tools App 78:1473–1494
Chalabi NE, Attia A, Bouziane A, Akhtar Z (2021) Particle swarm optimization based block feature selection in face recognition system. Multimed Tools and App 80:33257–33273
Ahmadianfar I, Heidari AA, Noshadian S, Chen H, Gandomi AH (2022) Info: An efficient optimization algorithm based on weighted mean of vectors. Expert Syst Appl 195:116516
Yadav DC, Pal S (2020) Discovery of hidden pattern in thyroid disease by machine learning algorithms. Indian J Public Health Res Dev 11(1):61–66
Keller JM, Gray MR, Givens JA (1985) A fuzzy k-nearest neighbor algorithm. IEEE Trans Syst Man Cybern Part 4:580–585
Melin P, Ramirez E, Prado-Arechiga G (2018) A new variant of fuzzy k-nearest neighbor using interval type-2 fuzzy logic 1–7. IEEE
Sun L, Yin T, Ding W, Qian Y, Xu J (2021) Feature selection with missing labels using multilabel fuzzy neighborhood rough sets and maximum relevance minimum redundancy. IEEE Trans Fuzzy Syst 30(5):1197–1211
Sureshkumar V, Balasubramaniam S, Ravi V, Arunachalam A (2022) A hybrid optimization algorithm-based feature selection for thyroid disease classifier with rough type-2 fuzzy support vector machine. Expert Syst 39(1):12811
Wang A, An N, Chen G, Li L, Alterovitz G (2015) Accelerating wrapper-based feature selection with k-nearest-neighbor. Knowl-Based Syst 83:81–91
Kaur T, Saini BS, Gupta S (2019) An adaptive fuzzy k-nearest neighbor approach for mr brain tumor image classification using parameter free bat optimization algorithm. Multimed Tools App 78:21853–21890
Awotunde JB, Misra S, Pham QT (2022) An enhanced diabetes mellitus prediction using feature selection-based type-2 fuzzy model. Springer, pp 625–639
Hema M, Maheshprabhu R, Reddy KS, Guptha MN, Pandimurugan V (2023) Prediction analysis for parkinson disease using multiple feature selection & classification methods. Multimed Tools App 1–18
An S, Zhang M, Wang C, Ding W (2023) Robust fuzzy rough approximations with knn granules for semi-supervised feature selection. Fuzzy Sets Syst 461:108476
Thaher T, Heidari AA, Mafarja M, Dong JS, Mirjalili S (2020) Binary harris hawks optimizer for high-dimensional, low sample size feature selection pp 251–272
Asghari Varzaneh Z, Hosseini S, Javidi MM (2023) A novel binary horse herd optimization algorithm for feature selection problem. Multimed Tools App 1–35
Ghosh KK, Singh PK, Hong J, Geem ZW, Sarkar R (2020) Binary social mimic optimization algorithm with x-shaped transfer function for feature selection. IEEE Access 8:97890–97906
Li J, Wang Y, See J, Liu W (2019) Micro-expression recognition based on 3d flow convolutional neural network. Pattern Anal Applic 22(4):1331–1339
Zhao D, Huang C, Wei Y, Yu F, Wang M, Chen H (2017) An effective computational model for bankruptcy prediction using kernel extreme learning machine approach. Comput Econ 49(2):325–341
Dua D, Graff C et al (2017) Uci machine learning repository
Alweshah M, Alkhalaileh S, Al-Betar MA, Bakar AA (2022) Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis. Knowl-Based Syst 235:107629
Feng J, Gong Z (2022) A novel feature selection method with neighborhood rough set and improved particle swarm optimization. IEEE Access 10:33301–33312
Ahmed S, Sheikh KH, Mirjalili S, Sarkar R (2022) Binary simulated normal distribution optimizer for feature selection: Theory and application in covid-19 datasets. Expert Syst Appl 200:116834
Wang C, Huang Y, Shao M, Fan X (2019) Fuzzy rough set-based attribute reduction using distance measures. Knowl-Based Syst 164:205–212
Acknowledgements
The authors would like to acknowledge the financial support of this work by grants from General Direction of Scientific Research (DGRST), Tunisia, under the ARUB program.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The Authors declares that they do not have conflict of interest.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Informed consent
This article does not have any content needing any informed Consent.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Lahmar, I., Zaier, A., Yahia, M. et al. Binary weighted mean of vectors optimization based type-2 fuzzy-rough for feature selection. Multimed Tools Appl 83, 52089–52111 (2024). https://doi.org/10.1007/s11042-023-17580-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-17580-3