Abstract
The PAC-Bayesian framework is a frequentist approach to machine learning which encodes learner bias as a “prior probability” over hypotheses. This chapter reviews basic PAC-Bayesian theory, including Catoni’s basic inequality and Catoni’s localization theorem.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Boser, B.E., Guyon, I.M., Vapnik, V.N.: A training algorithm for optimal margin classifiers. In: Proceedings of the 5th Annual Workshop on Computational Learning Theory, Pittsburgh, pp. 144–152. ACM (1992)
Catoni, O.: PAC-Bayesian supervised classification: the thermodynamics of statistical learning. arXiv:0712.0248 (2007, preprint)
Germain, P., Lacasse, A., Laviolette, F., Marchand, M.: PAC-Bayesian learning of linear classifiers. In: Proceedings of the 26th Annual International Conference on Machine Learning, Montreal, pp. 353–360. ACM (2009)
Hoeffding, W.: Probability inequalities for sums of bounded random variables. J. Am. Stat. Assoc. 58(301), 13–30 (1963)
Langford, J.: Tutorial on practical prediction theory for classification. J. Mach. Learn. Res. 6(1), 273 (2006)
Langford, J., Blum, A.: Microchoice bounds and self bounding learning algorithms. In: Proceedings of the 12th Annual Conference on Computational Learning Theory, Santa Cruz, pp. 209–214. ACM (1999)
Lever, G., Laviolette, F., Shawe-Taylor, J.: Distribution-dependent PAC-Bayes priors. In: Algorithmic Learning Theory, pp. 119–133. Springer, Berlin/Heidelberg (2010)
Maurer, A.: A note on the PAC Bayesian theorem. arXiv:cs/0411099 (2004, preprint)
McAllester, D.A.: PAC-Bayesian model averaging. In: COLT, Santa Cruz, pp. 164–170 (1999)
McAllester, D.: Simplified PAC-Bayesian margin bounds. In: Learning Theory and Kernel Machines, Washington, DC, pp. 203–215 (2003)
McAllester, D., Keshet, J.: Generalization bounds and consistency for latent structural probit and ramp loss. In: Proceedings of the 25th Annual Conference on Neural Information Processing Systems (NIPS), Granada (2011)
Seeger, M.: PAC-Bayesian generalisation error bounds for Gaussian process classification (English). J. Mach. Learn. Res. 3(2), 233–269 (2003)
Shawe-Taylor, J., Langford, J.: PAC-Bayes & margins. In: Advances in Neural Information Processing Systems 15: Proceedings of the 2002 Conference, Vancouver, vol. 15, p. 439. MIT (2003)
Valiant, L.G.: A theory of the learnable. Commun. ACM 27(11), 1134–1142 (1984)
Vapnik, V.N., Chervonenkis, A.Y.: On the uniform convergence of relative frequencies of events to their probabilities. Theory Probab. Appl. 16(2), 264–280 (1971)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
McAllester, D., Akinbiyi, T. (2013). PAC-Bayesian Theory. In: Schölkopf, B., Luo, Z., Vovk, V. (eds) Empirical Inference. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41136-6_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-41136-6_10
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41135-9
Online ISBN: 978-3-642-41136-6
eBook Packages: Computer ScienceComputer Science (R0)