Confidence estimates of classification accuracy on new examples

Shawe-Taylor, John

doi:10.1007/3-540-62685-9_22

John Shawe-Taylor¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1208))

Included in the following conference series:

European Conference on Computational Learning Theory

117 Accesses

Abstract

Following recent results [6] showing the importance of the fat shattering dimension in explaining the beneficial effect of a large margin on generalization performance, the current paper investigates how the margin on a test example can be used to give greater certainty of correct classification in the distribution independent model. The results show that even if the classifier does not classify all of the training examples correctly, the fact that a new example has a larger margin than that on the misclassified examples, can be used to give very good estimates for the generalization performance in terms of the fat shattering dimension measured at a scale proportional to the excess margin. The estimate relies on a sufficiently large number of the correctly classified training examples having a margin roughly equal to that used to estimate generalization, indicating that the corresponding output values need to be ‘well sampled’. If this is not the case it may be better to use the estimate obtained from a smaller margin.

This work was supported by the ESPRIT Neurocolt Working Group No. 8556.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Multi-step Training of a Generalized Linear Classifier

Article 29 September 2018

STOCS: An Efficient Self-Tuning Multiclass Classification Approach

Generalization Bound for Imbalanced Classification

References

Noga Alon, Shai Ben-David, Nicolò Cesa-Bianchi, David Haussler, “Scale-sensitive Dimensions, Uniform Convergence, and Learnability,” in Proceedings of the Conference on Foundations of Computer Science (FOCS), 1993. Also to appear in Journal of the ACM.
Google Scholar
Martin Anthony and John Shawe-Taylor, “A Result of Vapnik with Applications,” Discrete Applied Mathematics, 47, 207–217, (1993).
Google Scholar
Peter Bartlett, “The Sample Complexity of Pattern Classification with Neural Networks: the Size of the Weights is More Important than the Size of the Network,” Technical Report, Department of Systems Engineering, Australian National University, May 1996.
Google Scholar
Bernhard E. Boser, Isabelle M. Guyon, and Vladimir N. Vapnik, “A Training Algorithm for Optimal Margin Classifiers,” pages 144–152 in Proceedings of the Fifth Annual Workshop on Computational Learning Theory, Pittsburgh ACM, (1992)
Google Scholar
D.J.C. MacKay, Bayesian Methods for Adaptive Models, Ph.D. Thesis, Caltech, 1991.
Google Scholar
John Shawe-Taylor, Peter Bartlett, Robert Williamson and Martin Anthony, Structural Risk Minimization over Data-Dependent Hierarchies, NeuroCOLT Technical Report, NC-TR-96-51.
Google Scholar
Vladimir N. Vapnik, Estimation of Dependences Based on Empirical Data, Springer-Verlag, New York, 1982.
Google Scholar
Vladimir N. Vapnik, The Nature of Statistical Learning Theory, Springer-Verlag, New York, 1995.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science Royal Holloway, University of London, TW20 0EX, Egham, UK
John Shawe-Taylor

Authors

John Shawe-Taylor
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Shai Ben-David

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shawe-Taylor, J. (1997). Confidence estimates of classification accuracy on new examples. In: Ben-David, S. (eds) Computational Learning Theory. EuroCOLT 1997. Lecture Notes in Computer Science, vol 1208. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-62685-9_22

Download citation

DOI: https://doi.org/10.1007/3-540-62685-9_22
Published: 03 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-62685-5
Online ISBN: 978-3-540-68431-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics