Robustness and Predictive Performance of Homogeneous Ensemble Feature Selection in Text Classification

Poornima Mehta, Satish Chandra

Source Title: International Journal of Information Retrieval Research (IJIRR)11(1)

ISSN: 2155-6377|EISSN: 2155-6385|EISBN13: 9781799861980|DOI: 10.4018/IJIRR.2021010104

MLA

Mehta, Poornima, and Satish Chandra. "Robustness and Predictive Performance of Homogeneous Ensemble Feature Selection in Text Classification." IJIRR vol.11, no.1 2021: pp.75-89. http://doi.org/10.4018/IJIRR.2021010104

APA

Mehta, P. & Chandra, S. (2021). Robustness and Predictive Performance of Homogeneous Ensemble Feature Selection in Text Classification. International Journal of Information Retrieval Research (IJIRR), 11(1), 75-89. http://doi.org/10.4018/IJIRR.2021010104

Chicago

Mehta, Poornima, and Satish Chandra. "Robustness and Predictive Performance of Homogeneous Ensemble Feature Selection in Text Classification," International Journal of Information Retrieval Research (IJIRR) 11, no.1: 75-89. http://doi.org/10.4018/IJIRR.2021010104

Export Reference

Favorite Full-Issue Download

View Full Text HTML

View Full Text PDF

Abstract

The use of ensemble paradigm with classifiers is a proven approach that involves combining the outcomes of several classifiers. It has recently been extrapolated to feature selection methods to find the most relevant features. Earlier, ensemble feature selection has been used in high dimensional, low sample size datasets like bioinformatics. To one's knowledge there is no such endeavor in the text classification domain. In this work, the ensemble feature selection using data perturbation in the text classification domain has been used with an aim to enhance predictability and stability. This approach involves application of the same feature selector to different perturbed versions of training data, obtaining different ranks for a feature. Previous works focus only on one of the metrics, that is, stability or accuracy. In this work, a combined framework is adopted that assesses both the predictability and stability of the feature selection method by using feature selection ensemble. This approach has been explored on univariate and multivariate feature selectors, using two rank aggregators.

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.

Username or email: *

Password: *

Forgot individual login password?

Create individual account

Robustness and Predictive Performance of Homogeneous Ensemble Feature Selection in Text Classification

MLA

APA

Chicago

Export Reference

Abstract

Request Access