Sparse Support Vector Machine with L p Penalty for Feature Selection

Yao, Lan; Zeng, Feng; Li, Dong-Hui; Chen, Zhi-Gang

doi:10.1007/s11390-017-1706-2

Sparse Support Vector Machine with L _p Penalty for Feature Selection

Regular Paper
Published: 11 January 2017

Volume 32, pages 68–77, (2017)
Cite this article

Journal of Computer Science and Technology Aims and scope Submit manuscript

Lan Yao¹,
Feng Zeng²,
Dong-Hui Li³ &
…
Zhi-Gang Chen²

291 Accesses
8 Citations
Explore all metrics

Abstract

We study the strategies in feature selection with sparse support vector machine (SVM). Recently, the socalled L _p-SVM (0 < p < 1) has attracted much attention because it can encourage better sparsity than the widely used L ₁-SVM. However, L _p-SVM is a non-convex and non-Lipschitz optimization problem. Solving this problem numerically is challenging. In this paper, we reformulate the L _p-SVM into an optimization model with linear objective function and smooth constraints (LOSC-SVM) so that it can be solved by numerical methods for smooth constrained optimization. Our numerical experiments on artificial datasets show that LOSC-SVM (0 < p < 1) can improve the classification performance in both feature selection and classification by choosing a suitable parameter p. We also apply it to some real-life datasets and experimental results show that it is superior to L ₁-SVM.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

Supervised Classification Algorithms in Machine Learning: A Survey and Review

Survey on SVM and their application in image classification

Article 11 January 2018

References

Vapnik V N. The Nature of Statistical Learning Theory (2nd edition). Springer, 2000.
Guyon I, Gunn S, Nikravesh M, Zadeh L A. Feature Extraction: Foundations and Applications (1st edition). Springer, 2006.
Saeys Y, Inza I, Larranagal P. A review of feature selection techniques in bioinformatics. Bioinformatics, 2007, 23(19): 2507-2517.
Guyon I, Weston J, Barnhill S, Vapnik V. Gene selection for cancer classification using support vector machines. Machine Learning, 2002, 46(1/2/3): 389-422.
Rakotomamonjy A. Variable selection using SVM based criteria. The Journal of Machine Learning Research, 2003, 3: 1357-1370.
Weston J, Mukherjee S, Chapelle O, Pontil M, Poggio T, Vapnik V. Feature selection for SVMs. In Advances in Neural Information Processing Systems 13, Leen T K, Diet-terich T G, Tresp V (eds.), Massachusetts Institute of Technology, 2001, pp.668-674.
Peleg D, Meir R. A feature selection algorithm based on the global minimization of a generalization error bound. In Advances in Neural Information Processing Systems 17, Saul L K, Weiss Y, Bottou L (eds.), Massachusetts Institute of Technology, 2005, pp.1065-1072.
Bradley P S, Mangasarian O L. Feature selection via concave minimization and support vector machines. In Proc. the 5th International Conference on Machine Learning, July 1998, pp.82-90.
Weston J, Elisseeff A, Schölkopf B, Tipping M. Use of the zero norm with linear models and kernel methods. The Journal of Machine Learning Research, 2003, 3: 1439-1461.
Amaldi E, Kann V. On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems. Theoretical Computer Science, 1998, 209(1/2): 237-260.
Chan A B, Vasconcelos N, Lanckriet G R G. Direct convex relaxations of sparse SVM. In Proc. the 24th International Conference on Machine Learning, June 2007, pp.145-153.
Fung G M, Mangasarian O L. A feature selection newton method for support vector machine classification. Computational Optimization and Applications, 2004, 28(2): 185-202.
Bi J B, Bennett K, Embrechts M, Breneman C, Song M H. Dimensionality reduction via sparse support vector machines. The Journal of Machine Learning Research, 2003, 3: 1229-1243.
Tibshirani R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society Series B (Methodological), 1996, 58(1): 267-288.
Neumann J, Schnörr C, Steidl G. Combined SVM-based feature selection and classification. Machine Learning, 2005, 61(1/2/3): 129-150.
Chartrand R. Exact reconstruction of sparse signals via nonconvex minimization. IEEE Signal Processing Letters, 2007, 14(10): 707-710.
Chartrand R. Nonconvex regularization for shape preservation. In Proc. the IEEE International Conference on Image Processing, September 16-October 19, 2007, pp.293-296.
Xu Z B, Zhang H, Wang Y, Chang X Y, Liang Y. L _1/2 regularization. Science China Information Sciences, 2010, 53(6): 1159-1169.
Liu J L, Li J P, Xu W X, Shi Y. A weighted L _q adaptive least squares support vector machine classifiers—Robust and sparse approximation. Expert Systems with Applications, 2011, 38(3): 2253-2259.
Chen W J, Tian Y J. Lp-norm proximal support vector machine and its applications. Procedia Computer Science, 2010, 1(1): 2417-2423.
Rakotomamonjy A, Flamary R, Gasso G, Canu S. l _p − l _q penalty for sparse linear and sparse multiple kernel multitask learning. IEEE Transactions on Neural Networks, 2011, 22(8): 1307-1320.
Liu Y F, Zhang H H, Park C, Ahn J. Support vector machines with adaptive L _q penalty. Computational Statistics and Data Analysis, 2007, 51(12): 6380-6394.
Liu Z Q, Lin S L, Tan M. Sparse support vector machines with L _p penalty for biomarker identification. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2010, 7(1): 100-107.
Tan J Y, Zhang Z Q, Zhen L, Zhang C H, Deng N Y. Adaptive feature selection via a new version of support vector machine. Neural Computing and Applications, 2013, 23(3/4): 937-945.
Tian Y J, Yu J, Chen W J. lp -norm support vector machine with CCCP. In Proc. the 7th International Conference on Fuzzy Systems and Knowledge Discovery, August 2010, pp.1560-1564.
Liu J W, Liu Y. Non-integer norm regularization SVM via Legendre-Fenchel duality. Neurocomputing, 2014, 144: 537-545.
Chen X J, Xu F M, Ye Y Y. Lower bound theory of nonzero entries in solutions of l ₂-l _p minimization. SIAM J. Sci. Comput., 2010, 32(5): 2832-2852.
Zhang C H, Shao Y H, Tan J Y, Deng N Y. Mixed-norm linear support vector machine. Neural Computing and Applications, 2013, 23(7): 2159-2166.
Li D H, Wu L, Sun Z, Zhang X J. A constrained optimization reformulation and a feasible descent direction method for L _1/2 regularization. Computational Optimization and Applications, 2014, 59(1/2): 263-284.
Newman D J, Hettich S, Blake C L, Merz C J. UCI repository of machine learning databases. Technical Report 9702, Department of Information and Computer Science, University of California, Irvine, 1998. http://archive.ics.uci.edu/ml/, Nov. 2016

Download references

Author information

Authors and Affiliations

College of Mathematics and Econometrics, Hunan University, Changsha, 410082, China
Lan Yao
School of Software, Central South University, Changsha, 410083, China
Feng Zeng & Zhi-Gang Chen
School of Mathematical Sciences, South China Normal University, Guangzhou, 510631, China
Dong-Hui Li

Authors

Lan Yao
View author publications
You can also search for this author in PubMed Google Scholar
Feng Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Dong-Hui Li
View author publications
You can also search for this author in PubMed Google Scholar
Zhi-Gang Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Feng Zeng.

Additional information

This work is supported in part by the National Natural Science Foundation of China under Grant Nos. 61502159, 61379057, 11101081, and 11271069, and the Research Foundation of Central South University of China under Grant No. 2014JSJJ019.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yao, L., Zeng, F., Li, DH. et al. Sparse Support Vector Machine with L _p Penalty for Feature Selection. J. Comput. Sci. Technol. 32, 68–77 (2017). https://doi.org/10.1007/s11390-017-1706-2

Download citation

Received: 28 February 2016
Revised: 07 September 2016
Published: 11 January 2017
Issue Date: January 2017
DOI: https://doi.org/10.1007/s11390-017-1706-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Sparse Support Vector Machine with L _p Penalty for Feature Selection

Abstract

Access this article

Similar content being viewed by others

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

Supervised Classification Algorithms in Machine Learning: A Survey and Review

Survey on SVM and their application in image classification

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Sparse Support Vector Machine with L p Penalty for Feature Selection

Abstract

Access this article

Similar content being viewed by others

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

Supervised Classification Algorithms in Machine Learning: A Survey and Review

Survey on SVM and their application in image classification

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

Sparse Support Vector Machine with L _p Penalty for Feature Selection