Abstract
This paper proposes an adaptive huberized support vector machine for simultaneous classification and gene selection. By introducing the data-driven weights, the proposed support vector machine can adaptively identify the important genes in groups, thus encouraging an adaptive grouping effect. Furthermore, the shrinkage biases for the coefficients of important genes are largely reduced. A reasonable correlation between the two regularization parameters is also given, based on which the solution paths are shown to be piecewise linear with respect to the single regularization parameter. Experiment results on leukaemia data set are provided to illustrate the effectiveness of the proposed method.




Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Cawley GC, Talbot NLC (2006) Gene selection in cancer classification using sparse logistic regression with Bayesian regularization. Bioinformatics 22(19):2438-–2355
Golub T, Slonim D, Tamayo P, Huard C, Gaasenbeek M, Mesirov J, Coller H, Loh M, Downing J, Caligiuri M (1999) Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286:531–536
Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3:1157–1182
Guyon I, Weston J, Barnhill S, Vapnik V (2002) Gene selection for cancer classification using support vector machines. Mach Learn 46:389–422
Li GZ, Meng HH, Yang MQ, Yang JY (2009) Combining support vector regression with feature selection for multivariate calibration. Neural Comput Appl 18:813–820
Li GZ, Yang J, Ye CZ, Geng D (2006) Degree prediction of malignancy in brain glioma using support vector machines. Comput Biol Med 36:315–325
Li GZ, Yang J, Liu GP, Xue L (2004) Feature selection for multi-class problems using support vector machines, In: Proceedings of 8th pacific rim international conference on artificial intelligence (PRICAI-04), Sheraton, Auckland, August 9 to August 13, pp 292–300
Li J, Jia Y, Du J, Yu F (2009) A new support vector machine for microarray classification and adaptive gene selection. 2009 American control conference. Hyatt Regency Riverfront, St. Louis, MO, USA, pp 5410–5415
Mao Y, Zhou X, Yin Z, Pi D, Sun Y, Wong STC (2006) Gene selection using gaussian kernel support vector machine based recursive feature elimination with adaptive kernel width strategy. Lect Notes Comput Sci Rough Sets Knowl Technol 4062:799–806
Mukherjee S, Tamayo R, Slonim D, Verri A, Golub T, Mesirov J, Poggio T (1999) Support vector machine classification of microarray data (Technical Report). Artificial Intelligence Laboratory, Massachusetts Institute of Technology
Park M, Hastie T (2008) Penalized logistic regression for detecting gene interactions. Biostatistics 9:30–50
Ramón DU, de Andrés A (2006) Gene selection and classification of microarray data using random forest. BMC Bioinformatics 7, available: http://www.biomedcentral.com/1471-2105/7/3
Wang L, Zhu J, Zou H (2008) Hybrid huberized support vector machines for microarray classification and gene selection. Bioinformatics 24:412–419
Wang L, Zhu J, Zou H (2006) The doubly regularized support vector machine. Stat Sinic 16:589–615
Yuan M, Lin Y (2006) Model selection and estimation in regression with grouped variables. J Roy Stat Soc Ser B 68:49–67
Zhu J, Rosset S, Hastie T, Tibshirani R (2004) 1-norm support vector machines. Adv Neural Infor Process Syst 16:49–56
Zhu J, Hastie T (2004) Classification of gene microarrays by penalized logistic regression. Biostatistics 46:505–510
Ma S, Song X, Huang J (2007) Supervised group lasso with applications to microarray data analysis. BMC Bioinformatics 8, 2007, available: http://www.biomedcentral.com/1471-2105/8/60
Tibshirani R (1996) Regression shrinkage and selection via the lasso. J Roy Stat Soc Ser B 58:267–288
Zou H (2007) An improved 1-norm support vector machine for simultaneous classification and variable selection. Eleventh international conference on artificial intelligence and statistics
Zhou X, Wang X, Dougherty E (2005) Gene selection using logistic regressions based on AIC, BIC and MDL criteria. New Math Nat Comput 1:129–145
Zou H, Hastie T (2005) Regularization and variable selection via the elastic net. J Roy Stat Soc Ser B 67:301–320
Rosset S, Zhu J (2007) Piecewise linear regularized solution paths. Ann Stat 35:1012–1030
Schokpof B, Smola A, Williamson RC, Bartlett PL (2000) New support vector algorithms. Neural Comput 12:1207–1245
Shevade SK, Keerthi SS, Bhattacharyya C, Murthy KRK (2000) Improvements to the SMO algorithm for SVM regression. IEEE Trans Neural Netw 11:1188–1194
Vapnik V (1995) The nature of statistical learning theory. Springer, New York
Dong JX, Krzyzak A, Suen CY (2005) Fast SVM training algorithm with decomposition on very large data sets. IEEE Trans Pattern Anal Mach Intell 27:603–618
Jiao L, Bo L, Wang L (2007) Fast sparse approximation for least square support vector machines. IEEE Trans Neural Netw 18:685–697
Wu Q, Zhou DX (2005) SVM soft margin classifiers: linear programming versus quadratic programming. Neural Comput 17:1160–1187
Hastie T, Rosset S, Tibshirani R, Zhu J (2004) The entire regularization path for the support vector machine. J Mach Learn Res 5:1391–1415
Acknowledgments
This work was supported by the NSFC (60774003, 60727002,60850004), the National 973 program (2005CB321902) and the COSTIND (A2120061303).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Li, J., Jia, Y. & Li, W. Adaptive huberized support vector machine and its application to microarray classification. Neural Comput & Applic 20, 123–132 (2011). https://doi.org/10.1007/s00521-010-0371-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-010-0371-y