Learning Bounds for Support Vector Machines with Learned Kernels

Srebro, Nathan; Ben-David, Shai

doi:10.1007/11776420_15

Nathan Srebro²⁰ &
Shai Ben-David²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4005))

Included in the following conference series:

International Conference on Computational Learning Theory

2983 Accesses

Abstract

Consider the problem of learning a kernel for use in SVM classification. We bound the estimation error of a large margin classifier when the kernel, relative to which this margin is defined, is chosen from a family of kernels based on the training sample. For a kernel family with pseudodimension d _φ, we present a bound of \(\sqrt{\tilde{\mathcal{O}}{({d_{\phi}}+1/\gamma^2)}/n}\) on the estimation error for SVMs with margin γ. This is the first bound in which the relation between the margin term and the family-of-kernels term is additive rather then multiplicative. The pseudodimension of families of linear combinations of base kernels is the number of base kernels. Unlike in previous (multiplicative) bounds, there is no non-negativity requirement on the coefficients of the linear combinations. We also give simple bounds on the pseudodimension for families of Gaussian kernels.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

PAC-Bayes Bounds for Supervised Classification

Multiple Kernel Learning with One-Level Optimization of Radius and Margin

Evaluating Support Vector Machines with Multiple Kernels by Random Search

References

Lanckriet, G.R., Cristianini, N., Bartlett, P., Ghaoui, L.E., Jordan, M.I.: Learning the kernel matrix with semidefinite programming. J. Mach. Learn Res. 5, 27–72 (2004)
Google Scholar
Bousquet, O., Herrmann, D.J.L.: On the complexity of learning the kernel matrix. In: Adv. in Neural Information Processing Systems, vol. 15 (2003)
Google Scholar
Crammer, K., Keshet, J., Singer, Y.: Kernel design using boosting. In: Advances in Neural Information Processing Systems 15 (2003)
Google Scholar
Lanckriet, G.R.G., De Bie, T., Cristianini, N., Jordan, M.I., Noble, W.S.: A statistical framework for genomic data fusion. Bioinformatics 20 (2004)
Google Scholar
Sonnenburg, S., Rätsch, G., Schafer, C.: Learning interpretable SVMs for biological sequence classification. In: Research in Computational Molecular Biology (2005)
Google Scholar
Ben-Hur, A., Noble, W.S.: Kernel methods for predicting protein-protein interactions. Bioinformatics 21 (2005)
Google Scholar
Cristianini, N., Campbell, C., Shawe-Taylor, J.: Dynamically adapting kernels in support vector machines. In: Adv. in Neural Information Proceedings Systems 11 (1999)
Google Scholar
Chapelle, O., Vapnik, V., Bousquet, O., Makhuerjee, S.: Choosing multiple parameters for support vector machines. Machine Learning 46, 131–159 (2002)
Article MATH Google Scholar
Keerthi, S.S.: Efficient tuning of SVM hyperparameters using radius/margin bound and iterative algorithms. IEEE Tran. on Neural Networks 13, 1225–1229 (2002)
Article Google Scholar
Glasmachers, T., Igel, C.: Gradient-based adaptation of general gaussian kernels. Neural Comput. 17, 2099–2105 (2005)
Article MathSciNet MATH Google Scholar
Ong, C.S., Smola, A.J., Williamson, R.C.: Learning the kernel with hyperkernels. J. Mach. Learn. Res. 6 (2005)
Google Scholar
Micchelli, C.A., Pontil, M.: Learning the kernel function via regularization. J. Mach. Learn. Res. 6 (2005)
Google Scholar
Argyriou, A., Micchelli, C.A., Pontil, M.: Learning convex combinations of continuously parameterized basic kernels. In: 18th Annual Conf. on Learning Theory (2005)
Google Scholar
Micchelli, C.A., Pontil, M., Wu, Q., Zhou, D.X.: Error bounds for learning the kernel. Research Note RN/05/09, University College London Dept. of Computer Science (2005)
Google Scholar
Koltchinskii, V., Panchenko, D.: Empirical margin distributions and bounding the generalization error of combined classifiers. Ann. Statist. 30 (2002)
Google Scholar
Smola, A.J., Schölkopf, B.: Learning with Kernels. MIT Press, Cambridge (2002)
Google Scholar
Anthony, M., Bartlett, P.L.: Neural Networks Learning: Theoretical Foundations. Cambridge University Press, Cambridge (1999)
Book Google Scholar
Bhatia, R.: Matrix Analysis. Springer, Heidelberg (1997)
Google Scholar
Warren, H.E.: Lower bounds for approximation by nonlinear manifolds. T. Am. Math. Soc. 133, 167–178 (1968)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Toronto, Toronto, ON, Canada
Nathan Srebro
School of Computer Science, University of Waterloo, Waterloo, ON, Canada
Shai Ben-David

Authors

Nathan Srebro
View author publications
You can also search for this author in PubMed Google Scholar
Shai Ben-David
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

ICREA and Department of Economics, Universitat Pompeu Fabra, Ramon Trias Fargas 25-27, 08005, Barcelona, Spain
Gábor Lugosi
Ruhr-Universität Bochum, Germany
Hans Ulrich Simon

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Srebro, N., Ben-David, S. (2006). Learning Bounds for Support Vector Machines with Learned Kernels. In: Lugosi, G., Simon, H.U. (eds) Learning Theory. COLT 2006. Lecture Notes in Computer Science(), vol 4005. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11776420_15

Download citation

DOI: https://doi.org/10.1007/11776420_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-35294-5
Online ISBN: 978-3-540-35296-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics