Abstract
This paper deals with an unusual phenomenon where most machine learning algorithms yield good performance on the training set but systematically worse than random performance on the test set. This has been observed so far for some natural data sets and demonstrated for some synthetic data sets when the classification rule is learned from a small set of training samples drawn from some high dimensional space.
The initial analysis presented in this paper shows that anti-learning is a property of data sets and is quite distinct from over-fitting of a training data. Moreover, the analysis leads to a specification of some machine learning procedures which can overcome anti-learning and generate machines able to classify training and test data consistently.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bamber, D.: The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. J. Math. Psych. 12, 387–415 (1975)
Cortes, C., Vapnik, V.: Support vector networks. Machine Learning 20, 273–297 (1995)
Craven, M.: The Genomics of a Signaling Pathway: A KDD Cup Challenge Task. SIGKDD Explorations 4(2) (2002)
Cristianini, N., Shawe-Taylor, J.: An introduction to support vector machines and other kernel-based learning methods. Cambridge University Press, Cambridge (2000)
Freund, Y., Schapire, R.E.: Large margin classification using the perceptron algorithm. Machine Learning 37, 277–296 (1999)
Hall, P., Marron, J.S., Neeman, A.: Geometric representation of high dimension low sample size data, preprint, to appear in the Journal of the Royal Statistical Society, Series B (2005)
Kowalczyk, A., Chapelle, O., Baldwin, G.: Analysis of the anti-learning phenomenon (2005), http://users.rsise.anu.edu.au/~akowalczyk/antilearning/
Kowalczyk, A., Ong, C.S.: Anti-learning in binary classification (2005), http://users.rsise.anu.edu.au/~akowalczyk/antilearning/
Kowalczyk, A., Raskutti, B.: One Class SVM for Yeast Regulation Prediction. SIGKDD Explorations 4(2) (2002)
Langford, J.: (2005), http://hunch.net/index.php?p=35
Raskutti, B., Kowalczyk, A.: Extreme re-balancing for SVMs: a case study. SIGKDD Explorations 6(1), 60–69 (2004)
Schölkopf, B., Smola, A.J.: Learning with Kernels: Support Vector Machines, Regularization, Optimization and Beyond. MIT Press, Cambridge (2001)
Vapnik, V.: Statistical learning theory. Wiley, New York (1998)
Warmuth, M.K., Vishwanathan, S.V.N.: Leaving the Span. In: COLT 2005 (2005) (to appear)
Wolpert, D.H.: The supervised learning no-free-lunch theorems. In: World Conference on Soft Computing 2001 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kowalczyk, A., Chapelle, O. (2005). An Analysis of the Anti-learning Phenomenon for the Class Symmetric Polyhedron. In: Jain, S., Simon, H.U., Tomita, E. (eds) Algorithmic Learning Theory. ALT 2005. Lecture Notes in Computer Science(), vol 3734. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11564089_8
Download citation
DOI: https://doi.org/10.1007/11564089_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29242-5
Online ISBN: 978-3-540-31696-1
eBook Packages: Computer ScienceComputer Science (R0)