Bayesian Group Feature Selection for Support Vector Learning Machines

Du, Changde; Du, Changying; Zhe, Shandian; Luo, Ali; He, Qing; Long, Guoping

doi:10.1007/978-3-319-31753-3_20

Changde Du^19,20,22,
Changying Du^19,21,22,
Shandian Zhe²³,
Ali Luo²⁰,
Qing He¹⁹ &
…
Guoping Long²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9651))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

2605 Accesses

Abstract

Group Feature Selection (GFS) has proven to be useful in improving the interpretability and prediction performance of learned model parameters in many machine learning and data mining applications. Existing GFS models were mainly based on square loss and logistic loss for regression and classification, leaving the \(\epsilon \)-insensitive loss and the hinge loss popularized by Support Vector Learning (SVL) machines still unexplored. In this paper, we present a Bayesian GFS framework for SVL machines based on the pseudo likelihood and data augmentation idea. With Bayesian inference, our method can circumvent the cross-validation for regularization parameters. Specifically, we apply the mean field variational method in an augmented space to derive the posterior distribution of model parameters and hyper-parameters for Bayesian estimation. Both regression and classification experiments conducted on synthetic and real-world data sets demonstrate that our proposed approach outperforms a number of competitors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Anguita, D., Ghio, A., Oneto, L., Parra, X., Reyes-Ortiz, J.L.: A public domain dataset for human activity recognition using smartphones. In: ESANN (2013)
Google Scholar
Babacan, S.D., Nakajima, S., Do, M.N.: Bayesian group-sparse modeling and variational inference. IEEE Trans. Sig. Process 62(11), 2906–2921 (2014)
Article MathSciNet Google Scholar
Crammer, K., Singer, Y.: On the algorithmic implementation of multiclass kernel-based vector machines. J. Mach. Learn. Res. 2, 265–292 (2002)
MATH Google Scholar
Drucker, H., Burges, C.J., Kaufman, L., Smola, A., Vapnik, V.: Support vector regression machines. In: NIPS, pp. 155–161 (1997)
Google Scholar
Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: Liblinear: a library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)
MATH Google Scholar
Hall, D.L., McMullen, S.A.: Mathematical Techniques in Multisensor Data Fusion. Artech House, Norwood (2004)
MATH Google Scholar
Hernández-Lobato, D., Hernández-Lobato, J.M., Dupont, P.: Generalized spike-and-slab priors for bayesian group feature selection using expectation propagation. J. Mach. Learn. Res. 14(1), 1891–1945 (2013)
MathSciNet MATH Google Scholar
Hull, J.: A database for handwritten text recognition research. IEEE Trans. PAMI 16(5), 550–554 (1994)
Article Google Scholar
Jacob, L., Obozinski, G., Vert, J.P.: Group lasso with overlap and graph lasso. In: ICML, pp. 433–440 (2009)
Google Scholar
Liu, J., Ji, S., Ye, J.: Slep: Sparse Learning with Efficient Projections. Arizona State University, Tempe (2009)
Google Scholar
Meier, L., Van De Geer, S., Bühlmann, P.: The group lasso for logistic regression. J. R. Stat. Soc. Ser. B (Stat. Methodol.) 70(1), 53–71 (2008)
Article MathSciNet MATH Google Scholar
Polson, N.G., Scott, S.L.: Data augmentation for support vector machines. Bayesian Anal. 6(1), 1–23 (2011)
Article MathSciNet MATH Google Scholar
Raman, S., Fuchs, T.J., Wild, P.J., Dahl, E., Roth, V.: The bayesian group-lasso for analyzing contingency tables. In: ICML, pp. 881–888 (2009)
Google Scholar
Roth, V., Fischer, B.: The group-lasso for generalized linear models: uniqueness of solutions and efficient algorithms. In: ICML, pp. 848–855 (2008)
Google Scholar
Simon, N., Friedman, J., Hastie, T., Tibshirani, R.: A sparse-group lasso. J. Comput. Graph. Stat. 22(2), 231–245 (2013)
Article MathSciNet Google Scholar
Subrahmanya, N., Shin, Y.C.: Sparse multiple kernel learning for signal processing applications. IEEE Trans. PAMI 32(5), 788–798 (2010)
Article Google Scholar
Subrahmanya, N., Shin, Y.C.: A variational bayesian framework for group feature selection. Int. J. Mach. Learn. Cybern. 4(6), 609–619 (2013)
Article Google Scholar
Tan, M., Wang, L., Tsang, I.W.: Learning sparse svm for feature selection on very high dimensional datasets. In: ICML, pp. 1047–1054 (2010)
Google Scholar
Tibshirani, R.: Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B (Methodol.) 58(1), 267–288 (1996)
MathSciNet MATH Google Scholar
Tipping, M.E.: Sparse bayesian learning and the relevance vector machine. J. Mach. Learn. Res. 1, 211–244 (2001)
MathSciNet MATH Google Scholar
Wang, J., Zhao, Z.Q., Hu, X., Cheung, Y.M., Wang, M., Wu, X.: Online group feature selection. In: IJCAI, pp. 1757–1763 (2013)
Google Scholar
Yang, H., Xu, Z., King, I., Lyu, M.R.: Online learning for group lasso. In: ICML, pp. 1191–1198 (2010)
Google Scholar
Yuan, M., Lin, Y.: Model selection and estimation in regression with grouped variables. J. R. Stat. Soc. Series B 68(1), 49–67 (2006)
Article MathSciNet MATH Google Scholar
Zhu, J., Rosset, S., Hastie, T., Tibshirani, R.: 1-norm support vector machines. In: NIPS, pp. 49–56 (2004)
Google Scholar
Zhu, J., Chen, N., Perkins, H., Zhang, B.: Gibbs max-margin topic models with data augmentation. J. Mach. Learn. Res. 15, 1073–1110 (2014)
MathSciNet MATH Google Scholar

Download references

Acknowledgments

This work was supported by the National Natural Science Foundation of China (No. 9154610306, 61573335, 61473273, 61473274, 11390371, 11233004), National Key Basic Research Program of China (Grant No. 2014CB845700), National High-tech R&D Program of China (863 Program) (No. 2014AA015105), Guangdong provincial science and technology plan projects (No. 2015 B010109005).

Author information

Authors and Affiliations

Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, 100190, China
Changde Du, Changying Du & Qing He
Key Laboratory of Optical Astronomy, National Astronomical Observatories, Chinese Academy of Sciences, Beijing, 100012, China
Changde Du & Ali Luo
Laboratory of Parallel Software and Computational Science, Institute of Software, Chinese Academy of Sciences, Beijing, 100190, China
Changying Du & Guoping Long
University of Chinese Academy of Sciences, Beijing, 100049, China
Changde Du & Changying Du
Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
Shandian Zhe

Authors

Changde Du
View author publications
You can also search for this author in PubMed Google Scholar
Changying Du
View author publications
You can also search for this author in PubMed Google Scholar
Shandian Zhe
View author publications
You can also search for this author in PubMed Google Scholar
Ali Luo
View author publications
You can also search for this author in PubMed Google Scholar
Qing He
View author publications
You can also search for this author in PubMed Google Scholar
Guoping Long
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Changying Du .

Editor information

Editors and Affiliations

The University of Melbourne, Melbourne, Victoria, Australia
James Bailey
The University of Texas at Dallas, Richardson, Texas, USA
Latifur Khan
Osaka University, Osaka, Japan
Takashi Washio
University of Auckland, Auckland, New Zealand
Gill Dobbie
Shenzhen University, Shenzhen, China
Joshua Zhexue Huang
Massey University, Auckland, New Zealand
Ruili Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Du, C., Du, C., Zhe, S., Luo, A., He, Q., Long, G. (2016). Bayesian Group Feature Selection for Support Vector Learning Machines. In: Bailey, J., Khan, L., Washio, T., Dobbie, G., Huang, J., Wang, R. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2016. Lecture Notes in Computer Science(), vol 9651. Springer, Cham. https://doi.org/10.1007/978-3-319-31753-3_20

Download citation

DOI: https://doi.org/10.1007/978-3-319-31753-3_20
Published: 12 April 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-31752-6
Online ISBN: 978-3-319-31753-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics