Learning Convex Combinations of Continuously Parameterized Basic Kernels

Argyriou, Andreas; Micchelli, Charles A.; Pontil, Massimiliano

doi:10.1007/11503415_23

Andreas Argyriou²⁰,
Charles A. Micchelli²¹ &
Massimiliano Pontil²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3559))

Included in the following conference series:

International Conference on Computational Learning Theory

3792 Accesses

Abstract

We study the problem of learning a kernel which minimizes a regularization error functional such as that used in regularization networks or support vector machines. We consider this problem when the kernel is in the convex hull of basic kernels, for example, Gaussian kernels which are continuously parameterized by a compact set. We show that there always exists an optimal kernel which is the convex combination of at most m+1 basic kernels, where m is the sample size, and provide a necessary and sufficient condition for a kernel to be optimal. The proof of our results is constructive and leads to a greedy algorithm for learning the kernel. We discuss the properties of this algorithm and present some preliminary numerical simulations.

This work was supported by EPSRC Grant GR/T18707/01, NSF Grant ITR-0312113 and the PASCAL European Network of Excellence.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Nyström-SGD: Fast Learning of Kernel-Classifiers with Conditioned Stochastic Gradient Descent

Continuous Kernel Learning

Optimal Learning Rates for Kernel Partial Least Squares

Article 07 April 2017

References

Aronszajn, N.: Theory of reproducing kernels. Trans. Amer. Math. Soc. 686, 337–404 (1950)
Article MathSciNet Google Scholar
Aubin, J.P.: Mathematical Methods of Game and Economic Theory. In: Studies in Mathematics and its applications, vol. 7, North-Holland, Amsterdam (1982)
Google Scholar
Bach, F.R., Lanckriet, G.R.G., Jordan, M.I.: Multiple kernels learning, conic duality, and the SMO algorithm. In: Proc. of the Int. Conf. on Machine Learning (2004)
Google Scholar
Bousquet, O., Herrmann, D.J.L.: On the complexity of learning the kernel matrix. Advances in Neural Information Processing Systems 15 (2003)
Google Scholar
Borwein, J.M., Lewis, A.S.: Convex Analysis and Nonlinear Optimization. Theory and Examples. CMS (Canadian Math. Soc.). Springer, New York (2000)
Google Scholar
Chapelle, O., Vapnik, V.N., Bousquet, O., Mukherjee, S.: Choosing multiple parameters for support vector machines. Machine Learning 46(1), 131–159 (2002)
Article MATH Google Scholar
Herbster, M.: Relative Loss Bounds and Polynomial-time Predictions for the K-LMS-NET Algorithm. In: Proc. of the 15-th Int. Conference on Algorithmic Learning Theory (October 2004)
Google Scholar
Lanckriet, G.R.G., Cristianini, N., Bartlett, P., El Ghaoui, L., Jordan, M.I.: Learning the kernel matrix with semi-definite programming. J. of Machine Learning Research 5, 27–72 (2004)
Google Scholar
Micchelli, C.A., Pontil, M.: Learning the kernel function via regularization. To appear in J. of Machine Learning Research (see also Research Note RN/04/11, Department of Computer Science, UCL (June 2004)
Google Scholar
Micchelli, C.A., Rivlin, T.J.: Lectures on optimal recovery. In: Turner, P.R. (ed.) Lecture Notes in Mathematics, vol. 1129, Springer, Heidelberg (1985)
Google Scholar
Ong, C.S., Smola, A.J., Williamson, R.C.: Hyperkernels. In: Becker, S., Thrun, S., Obermayer, K. (eds.) Advances in Neural Information Processing Systems 15, MIT Press, Cambridge (2003)
Google Scholar
Royden, H.L.: Real Analysis, 3rd edn. Macmillan Publ. Company, New York (1988)
MATH Google Scholar
Schoenberg, I.J.: Metric spaces and completely monotone functions. Annals of Mathematics 39, 811–841 (1938)
Article MathSciNet Google Scholar
Shawe-Taylor, J., Cristianini, N.: Kernel Methods for Pattern Analysis. Cambridge University Press, Cambridge (2004)
Google Scholar
Vapnik, V.N.: Statistical Learning Theory. Wiley, New York (1998)
MATH Google Scholar
Wahba, G.: Spline Models for Observational Data. Series in Applied Mathematics, vol. 59. SIAM, Philadelphia (1990)
MATH Google Scholar
Zhang, T.: On the dual formulation of regularized linear systems with convex risks. Machine Learning 46, 91–129 (2002)
Article MATH Google Scholar
Wu, Q., Ying, Y., Zhou, D.X.: Multi-kernel regularization classifiers. In: Preprint, City University of Hong Kong (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University College London, Gower Street, London, WC1E 6BT, England, UK
Andreas Argyriou & Massimiliano Pontil
Department of Mathematics and Statistics, State University of New York, The University at Albany, 1400 Washington Avenue, Albany, NY, 12222, USA
Charles A. Micchelli

Authors

Andreas Argyriou
View author publications
You can also search for this author in PubMed Google Scholar
Charles A. Micchelli
View author publications
You can also search for this author in PubMed Google Scholar
Massimiliano Pontil
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Leoben, A-8700, Leoben, Austria
Peter Auer
Department of Electrical Engineering, Technion, P.O. Box, 3200, Haifa, Israel
Ron Meir

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Argyriou, A., Micchelli, C.A., Pontil, M. (2005). Learning Convex Combinations of Continuously Parameterized Basic Kernels. In: Auer, P., Meir, R. (eds) Learning Theory. COLT 2005. Lecture Notes in Computer Science(), vol 3559. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11503415_23

Download citation

DOI: https://doi.org/10.1007/11503415_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26556-6
Online ISBN: 978-3-540-31892-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics