Abstract
The selection of informative and non-redundant features has become a prominent step in pattern classification. However, despite the intensive research, it is still an open issue to identify valuable feature subsets, especially in highly dimensional feature spaces. This paper proposes a wrapper feature selection method, in the context of support vector machines (SVMs), named Wr-SVM-FuzCoC. Our method combines effectively the advantages of the wrapper and filter approaches, achieving three goals simultaneously: classification performance, dimensionality reduction, and computational efficiency. In the filter part, a forward feature search methodology is developed, driven by a fuzzy complementary criterion, whereby at each iteration a feature is selected that exhibits the maximum additional contribution in regard to the previously selected subset. The quality of single features or feature subsets is assessed via a fuzzy local evaluation criterion with respect to patterns. This is achieved by the so-called fuzzy partition vector (FPV), comprising the fuzzy membership grades of every pattern in their target classes. Derivation of the feature FPVs is accomplished by incorporating a fuzzy output kernel-based support vector machine. The proposed method is favorably compared with existing SVM-based wrapper methods, in terms of performance capability and computational speed. Experimental investigation is carried out using a diverse pool of real datasets, including moderate and high-dimensional feature spaces.
Similar content being viewed by others
References
Ben-Bassat M (1968) Pattern recognition and reduction of dimensionality. In: Krishnaiah PR, Kanal LN (eds) Handbook of statistics-II. North Holland, New York, pp 773–791
Bonev B, Escolano F, Gazorla M (2008) Feature selection, mutual information, and classification of high-dimensional patterns. Pattern Anal Appl 11(3):309–319
Bradley P, Mangasarian O (1998) Feature selection via concave minimization and support vector machines. In: Machine learning proceedings of the 13th international conference on machine learning (ICML’98), San Francisco, CA, pp 82–90
Burges CJC (1998) A tutorial on support vector machines for pattern recognition. Data Mining Knowl Discov 2(2):121–167
Dash M, Liu H (1997) Feature selection for classification, intelligent data analysis. Ann Int J 1(3):131–156
Dash M, Choi K, Scheuermann P, Liu H (2002) Feature selection for clustering—a filter solution. In: Proceedings of the second international conference on data mining, pp 115–122
Devijver PA, Kittler J (1982) Pattern recognition: a statistical approach. Prentice-Hall, Englewood
Dubois D, Prade H (1980) Fuzzy sets and systems: theory and applications. Academic Press, London
Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3:1157
Guyon I, Weston J, Barnhill S, Bapnik V (2002) Gene selection for cancer classification using support vector machines. Mach Learn 46(1–3):389–422
Hsu C-W, Lin C-J (2002) A comparison of methods for multiclass support vector machines. IEEE Trans Neural Netw 13(2):415–425
Inza I, Larranaga P, Blanco R, Cerrolaza AJ (2004) Filter versus wrapper gene selection approaches in DNA microarray domains. Artif Intell Med 31:91–103
Jain A, Zongker D (1997) Feature selection: evaluation, application, and small sample performance. IEEE Trans Pattern Anal Mach Intell 19(2):153–158
Jain AK, Duin RPW, Mao J (2000) Statistical pattern recognition: a review. IEEE Trans Pattern Anal Mach Intell 22(1):4–37
Kohavi R, John GH (1997) Wrappers for feature subset selection. Artif Intell 97(1-2):273–324
Kuncheva LI (2004) Combining pattern classifiers: methods and algorithms. Wiley, New York
Kwak N, Choi C-H (2002) Input feature selection for classification problems. IEEE Trans Neural Netw 13(1):143–159
Lai C, Reinders MJT, Wessels L (2006) Random subspace method for multivariate feature selection. Pattern Recognit Lett 27(10):1067–1076
Liu H, Motoda H (1998) Feature selection for knowledge discovery and data mining. Kluwer Academic, Boston
Liu Yi, Zheng Yuan F (2006) FS_SFS: a novel feature selection method for support vector machines. Pattern Recognit 39(7):1333–1345
Maldonado S, Weber R (2009) A wrapper method for feature selection using support vector machines. Inf Sci 179:2208–2217
Mao KZ (2004) Feature subset selection for support vector machines through discriminative function pruning analysis. IEEE Trans Syst Man Cybern 34(1):60–67
Melgani F, Bruzzone L (2004) Classification of hyperspectral remote sensing images with support vector machines. IEEE Trans Geosci Remote Sens 42(8):1778–1790
Miranda J, Montoya R, Weber R (2005) Linear penalization support vector machines for feature selection. Pattern Recognition and Machine Intelligence, LNCS, vol 3776. Springer, Berlin, pp 188–192
Mitra P, Murthy CA, Pal SK (2002) Unsupervised feature selection using feature similarity. IEEE Trans Pattern Anal Mach Intell 24(3):301–312
Moustakidis SP, Theocharis JB, Giakas G (2008) Subject recognition based on ground reaction force measurements of gait signals. IEEE Trans Syst Man Cybern B 38(6):1476–1485
Müller KR, Mika S, Rätsch G, Tsuda K, Schölkopf B (2001) An introduction to kernel-based learning algorithms. IEEE Trans Neural Netw 12(2):181–201
Pal M (2006) Support vector machine-based feature selection for land cover classification: a case study with DAIS hyperspectral data. Int J Remote Sens 27(14):2877–2894
Pal M, Foody GM (2010) Feature selection for classification of hyperspectral data by SVM. IEEE Trans Geosci Remote Sens 48(5):2297–2307
Peng H, Long F, Ding C (2005) Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Pattern Anal Mach Intell 27(8):1226–1238
Platt J (2000) Probabilistic outputs for support vector machines and comparison to regularized likehood methods. In: Smola AJ, Bartlett P, Schoelkopf B, Schurmans D (eds) Advances in large margin classifiers, pp 61–74
Punch WF, Goodman ED, Pei M, Chia-Sun L, Hovland P, Enbody R (1993) Further research on feature selection and classification using genetic algorithms. In: Proceedings of the international conference on genetic algorithms, pp 557–564
Rakotomanojy A (2003) Variable selection using SVM-based criteria. J Mach Learn Res 3:1357–1370
Raymer M, Punch W, Goodman E, Kuhn L, Jain AK (2000) Dimensionality reduction using genetic algorithms. IEEE Trans Evol Comput 4:164–171
Shieh M-D, Yang C-C (2008) Multiclass SVM-RFE for product form feature selection. Expert Syst Appl 35:531–541
Weston J, Mukherjee S, Chapelle O, Pontil M, Poggio T, Vapnik V (2001) Feature selection for SVMs, In Advances in neural information processing systems 13. MIT Press, Cambridge
Xiong MM, Fang X, Zhao JY (2001) Biomarker identification by feature wrappers. Genome Res 11:1878–1887
Xu L, Yan P, Chang T (1988) Best first strategy for feature selection. In: Proceedings of the ninth international conference on pattern recognition, pp 706–708
UCI machine learning repository. (http://www.ics.uci.edu/~mlearn/MLRepository.html)
Yang J, Honavar V (1998) Feature subset selection using a genetic algorithm. IEEE Intell Syst 13:44–49
Zadrozny B, Elkan C (2002) Transforming classifier scores into accurate multiclass probability estimates. In: Proceedings of the ACM SIGKDD international conference on knowledge discovery and data mining, pp 694–699
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Moustakidis, S.P., Theocharis, J.B. A fast SVM-based wrapper feature selection method driven by a fuzzy complementary criterion. Pattern Anal Applic 15, 379–397 (2012). https://doi.org/10.1007/s10044-012-0293-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10044-012-0293-7