A DC programming approach for feature selection in support vector machines learning

Le Thi, Hoai An; Le, Hoai Minh; Nguyen, Van Vinh; Pham Dinh, Tao

doi:10.1007/s11634-008-0030-7

A DC programming approach for feature selection in support vector machines learning

Regular Article
Published: 23 November 2008

Volume 2, pages 259–278, (2008)
Cite this article

Advances in Data Analysis and Classification Aims and scope Submit manuscript

Hoai An Le Thi¹,
Hoai Minh Le¹,
Van Vinh Nguyen¹ &
…
Tao Pham Dinh²

698 Accesses
83 Citations
Explore all metrics

Abstract

Feature selection consists of choosing a subset of available features that capture the relevant properties of the data. In supervised pattern classification, a good choice of features is fundamental for building compact and accurate classifiers. In this paper, we develop an efficient feature selection method using the zero-norm l ₀ in the context of support vector machines (SVMs). Discontinuity at the origin for l ₀ makes the solution of the corresponding optimization problem difficult to solve. To overcome this drawback, we use a robust DC (difference of convex functions) programming approach which is a general framework for non-convex continuous optimisation. We consider an appropriate continuous approximation to l ₀ such that the resulting problem can be formulated as a DC program. Our DC algorithm (DCA) has a finite convergence and requires solving one linear program at each iteration. Computational experiments on standard datasets including challenging feature-selection problems of the NIPS 2003 feature selection challenge and gene selection for cancer classification show that the proposed method is promising: while it suppresses up to more than 99% of the features, it can provide a good classification. Moreover, the comparative results illustrate the superiority of the proposed approach over standard methods such as classical SVMs and feature selection concave.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

Learning from imbalanced data: open challenges and future directions

Article Open access 22 April 2016

Supervised Classification Algorithms in Machine Learning: A Survey and Review

References

Amaldi E, Kann V (1998) On the approximability of minimizing non zero variables or unsatisfied relations in linear systems. Theor Comput Sci 209: 237–260
Article MATH MathSciNet Google Scholar
Bradley PS, Mangasarian OL (1998) Feature selection via concave minimization and support vector machines. In: Proceeding of international conference on machina learning ICML’98
Boser B, Guyon I, Vapnik VN (1992) A training algorithm for optimal margin classifiers. In: Proceedings of the fifth annual workshop on computational learning theory, Kaufmann, San Mateo, pp 144–152
Cortes C, Vapnik VN (1995) Support vector networks. Mach Learn 20(3): 273–297
MATH Google Scholar
Cristianini N., Shawe-Taylor J (2000) Introduction to support vector machines. Cambridge University Press, Cambridge
Google Scholar
Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP, Coller H, Loh ML, Downing JR, Caligiuri MA, Bloomfield CD, Lander ES (1999) Molecular classifcation of cancer: class discovery and class prediction by gene expression monitoring. Science 286: 531–537
Article Google Scholar
Guyon I, Weston J, Barnhill S, Vapnik VN (2002) Gene selection for cancer classification using support vector machines. Mach Learn 46(1–3): 389–422
Article MATH Google Scholar
Guyon I, Gunn S, Nikravesh M, Zadeh LA (2006) Feature extraction, foundations and applications. Springer, Berlin
MATH Google Scholar
Hastie T, Rosset S, Tibshirani R, Zhu J (2004) The entire regularization path for the support vector machine. J Mach Learn Res 5: 1391–1415
MathSciNet Google Scholar
Krause N, Singer Y (2004) Leveraging the margin more carefully. In: Proceedings of the twenty-first international conference on machine learning ICML 2004. Banff, Alberta, 63 pp. ISBN:1-58113-828-5
Le Thi HA (1997) Contribution á l’optimisation non convexe et l’optimisation globale: Théorie, Algorithmes et Applications, Habilitation á Diriger des Recherches, Université de Rouen
Le Thi HA, Pham Dinh T (1997) Solving a class of linearly constrained indefinite quadratic problems by DC algorithms. J Global Optim 11(3): 253–285
Article MATH MathSciNet Google Scholar
Le Thi HA, Pham Dinh T (2005) The DC (difference of convex functions) programming and DCA revisited with DC models of real world nonconvex optimization problems. Ann Oper Res 133: 23–46
Article MATH MathSciNet Google Scholar
Le Thi HA, Belghiti T, Pham Dinh T (2006) A new efficient algorithm based on DC programming and DCA for clustering. J Global Optim 37: 593–608
Google Scholar
Le Thi HA, Le Hoai M, Pham Dinh T (2007) Optimization based DC programming and DCA for Hierarchical Clustering. Eur J Oper Res 183: 1067–1085
Article MATH Google Scholar
Le Thi HA, Mamadou T, Pham Dinh T (2008) DC Programming approach for solving a class of nonconvex programs dealing with zero-norm. In: Modelling, computation and optimization in information systems and management science. CCIS 14. Springer, Heidelberg, pp 348–357
Liu Y, Shen X, Doss H (2005) Multicategory ψ-learning and support vector machine: computational tools. J Computat Graph Stat 14: 219–236
Article MathSciNet Google Scholar
Liu Y, Shen X (2006) Multicategoryψ-learning. J Am Stat Assoc 101: 500–509
Article MathSciNet Google Scholar
Neumann J, Schnörr C, Steidl G (2005) Combined SVM-based feature selection and classification. Mach Learn 61(1–3): 129–150
Article MATH Google Scholar
Pham Dinh T, Le Thi HA (1998) DC optimization algorithms for solving the trust region subproblem. SIAM J Optim 8: 476–505
Article MATH MathSciNet Google Scholar
Rakotomamonjy A (2003) Variable selection using SVM-based criteria. J Mach Learn Res 3: 1357–1370
Article MATH MathSciNet Google Scholar
Ronan C, Fabian S, Jason W, Lé B (2006) Trading convexity for scalability. In: Proceedings of the 23rd international conference on machine learning ICML 2006. Pittsburgh, pp 201–208. ISBN:1-59593-383-2
Singh D, Febbo GPG, Ross K, Jackson DG, Manola J, Ladd C, Tamayo P, Renshaw AA, D’Amico AV, Richie JP, Lander ES, Loda M, Kantoff PW, Golub TR, Sellers WR (2002) Gene expression correlates of clinical prostate cancer behavior Copyright © 2002 Cell Press, Cancer Cell, vol 1, pp 203–209, March 2002
Shen KQ, Ong CJ, Li XP, Wilder-Smith EPV (2008) Feature selection via sensitivity analysis of SVM probabilistic outputs. Mach Learn 70: 1–20
Article Google Scholar
Schmidt M, Fung G, Rosales R (2007) Fast optimization methods for L1 regularization: a comparative study and two new approaches. In: Proceedings of machine learning: ECML 2007, Lecture notes in computer science, vol 4701/2007, pp 286–297
Van’t Veer L et al (2002) Gene expression profiling predicts clinical outcome of breast cancer. Nature 415: 530–536
Article Google Scholar
Vapnik VN (1995) The nature of statistical learning theory. Springer, New York
MATH Google Scholar
Weston J, Elisseeff A, Scholkopf B, Tipping M (2003) Use of the zero-norm with linear models and kernel methods. J Mach Learn Res 3: 1439–1461
Article MATH Google Scholar
Yuille AL, Rangarajan A (2003) The convex concave procedure. Neural Computat 15(4):915–936. ISSN:0899-7667
Google Scholar

Download references

Author information

Authors and Affiliations

Laboratory of Theoretical and Applied Computer Science (LITA EA 3097), UFR MIM, University of Paul Verlaine, Metz, Ile du Saulcy, 57045, Metz, France
Hoai An Le Thi, Hoai Minh Le & Van Vinh Nguyen
Laboratory of Modelling, Optimization and Operations Research, National Institute for Applied Sciences, Rouen, BP 08, Place Emile Blondel, 76131, Mont Saint Aignan Cedex, France
Tao Pham Dinh

Authors

Hoai An Le Thi
View author publications
You can also search for this author in PubMed Google Scholar
Hoai Minh Le
View author publications
You can also search for this author in PubMed Google Scholar
Van Vinh Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Tao Pham Dinh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hoai An Le Thi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Le Thi, H.A., Le, H.M., Nguyen, V.V. et al. A DC programming approach for feature selection in support vector machines learning. Adv Data Anal Classif 2, 259–278 (2008). https://doi.org/10.1007/s11634-008-0030-7

Download citation

Received: 15 March 2008
Revised: 20 October 2008
Accepted: 22 October 2008
Published: 23 November 2008
Issue Date: December 2008
DOI: https://doi.org/10.1007/s11634-008-0030-7

Keywords

Mathematical Subject Classification (2000).

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A DC programming approach for feature selection in support vector machines learning

Abstract

Access this article

Similar content being viewed by others

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

Learning from imbalanced data: open challenges and future directions

Supervised Classification Algorithms in Machine Learning: A Survey and Review

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Mathematical Subject Classification (2000).

Navigation

A DC programming approach for feature selection in support vector machines learning

Abstract

Access this article

Similar content being viewed by others

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

Learning from imbalanced data: open challenges and future directions

Supervised Classification Algorithms in Machine Learning: A Survey and Review

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematical Subject Classification (2000).

Search

Navigation