Constructing Multiclass Learners from Binary Learners: A Simple Black-Box Analysis of the Generalization Errors

Fakcharoenphol, Jittat; Kijsirikul, Boonserm

doi:10.1007/11564089_12

Jittat Fakcharoenphol²¹ &
Boonserm Kijsirikul²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3734))

Included in the following conference series:

International Conference on Algorithmic Learning Theory

2088 Accesses
1 Citations

Abstract

Multiclass learning is widely solved by reducing to a set of binary problems. By considering base binary classifiers as black boxes, we analyze generalization errors of various constructions, including Max-Win, Decision Directed Acyclic Graphs, Adaptive Directed Acyclic Graphs, and the unifying approach based on coding matrix with Hamming decoding of Allwein, Schapire, and Singer, using only elementary probabilistic tools. Many of these bounds are new, some are much simpler than previously known. This technique also yields a simple proof of the equivalences of the learnability and polynomial-learnability of the multiclass problem and the induced pairwise problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 55, 119–139 (1997)
Article MATH MathSciNet Google Scholar
Vapnik, V.: Statistical Learning Theory. Wiley, Chichester (1998)
MATH Google Scholar
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20, 273–297 (1995)
MATH Google Scholar
Friedman, J.H.: Another approach to polychotomous classification. Technical report, Department of Statistics, Stanford University (1996)
Google Scholar
Hastie, T., Tibshirani, R.: Classification by pairwise coupling. In: NIPS 1997: Proceedings of the 1997 conference on Advances in neural information processing systems 10, pp. 507–513. MIT Press, Cambridge (1998)
Google Scholar
Platt, J., Cristianini, N., Shawe-Taylor, J.: Large margin DAGs for multiclass classification. In: Advance in Neural Information Processing System, vol. 12. MIT Press, Cambridge (2000)
Google Scholar
Kreßel, U.H.G.: Pairwise classification and support vector machines. In: Advances in kernel methods: support vector learning, pp. 255–268. MIT Press, Cambridge (1999)
Google Scholar
Kijsirikul, B., Ussivakul, N., Meknavin, S.: Adaptive directed acyclic graphs for multiclass classification. In: PRICAI 2002, pp. 158–168 (2002)
Google Scholar
Dietterich, T.G., Bakiri, G.: Error-correcting output codes: a general method for improving multiclass inductive learning programs. In: Dean, T.L., McKeown, K. (eds.) Proceedings of the Ninth AAAI National Conference on Artificial Intelligence, pp. 572–577. AAAI Press, Menlo Park (1991)
Google Scholar
Guruswami, V., Sahai, A.: Multiclass learning, boosting, and error-correcting codes. In: Computational Learning Theory, pp. 145–155 (1999)
Google Scholar
Allwein, E.L., Schapire, R.E., Singer, Y.: Reducing multiclass to binary: a unifying approach for margin classifiers. J. Mach. Learn. Res. 1, 113–141 (2001)
Article MATH MathSciNet Google Scholar
Har-Peled, S., Roth, D., Zimak, D.: Constraint classification: A new approach to multiclass classification and ranking. In: NIPS. (2003)
Google Scholar
Bar-Hillel, A., Weinshall, D.: Learning with equivalence constraints, and the relation to multiclass learning. In: COLT. (2003)
Google Scholar
Fakcharoenphol, J.: A note on random DDAG. Manuscript (2003)
Google Scholar
Schapire, R.E., Freund, Y., Bartlett, P.L., Lee, W.S.: Boosting the margin: a new explanation for the effectiveness of voting methods. Annals of Statistics 26, 1651–1686 (1998)
Article MATH MathSciNet Google Scholar
Paugam-Moisy, H., Elisseeff, A., Guermeur, Y.: Generalization performance of multiclass discriminant models. In: Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks, IJCNN 2000, Neural Computing: New Challenges and Perspectives for the New Millennium, Como, Italy, July 24-27, vol. 4. IEEE, Los Alamitos (2000)
Google Scholar
Bennett, K.P., Cristianini, N., Shawe-Taylor, J., Wu, D.: Enlarging the margins in perceptron decision trees. Mach. Learn. 41, 295–313 (2000)
Article MATH Google Scholar
Shawe-Taylor, J., Bartlett, P.L., Williamson, R.C., Anthony, M.: A framework for structural risk minimisation. In: COLT 1996: Proceedings of the ninth annual conference on Computational learning theory, pp. 68–76. ACM Press, New York (1996)
Chapter Google Scholar
Ben-David, S., Cesa-Bianchi, N., Haussler, D., Long, P.M.: Characterizations of learnability for classes of {0, .̇, n}-valued functions. J. Comput. Syst. Sci. 50, 74–86 (1995)
Article MATH MathSciNet Google Scholar
Natarajan, B.K.: On learning sets and functions. Mach. Learn. 4, 67–97 (1989)
Google Scholar
Vapnik, V.N., Chervonenkis, A.Y.: On the uniform convergence of relative frequencies of events to their probabilities. Theoret. Probi. and its Appl. 16, 264–280 (1971)
Article MATH Google Scholar
Blumer, A., Ehrenfeucht, A., Haussler, D., Warmuth, M.K.: Learnability and the vapnik-chervonenkis dimension. J. ACM 36, 929–965 (1989)
Article MATH MathSciNet Google Scholar
Phetkaew, T., Kijsirikul, B., Rivepiboon, W.: Reordering adaptive directed acyclic graphs for multiclass support vector machines. In: Proceedings of the Third International Conference on Intelligent Technologies, InTech 2002 (2002)
Google Scholar
Klautau, A., Jevtić, N., Orlitsky, A.: On nearest-neighbor error-correcting output codes with application to all-pairs multiclass support vector machines. J. Mach. Learn. Res. 4, 1–15 (2003)
Article Google Scholar
Rifkin, R., Klautau, A.: In defense of one-vs-all classification. J. Mach. Learn. Res. 5, 101–141 (2004)
MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Engineering, Kasetsart University, Bangkok, Thailand
Jittat Fakcharoenphol
Department of Computer Engineering, Chulalongkorn University, Bangkok, Thailand
Boonserm Kijsirikul

Authors

Jittat Fakcharoenphol
View author publications
You can also search for this author in PubMed Google Scholar
Boonserm Kijsirikul
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computing, National University of Singapore, 117590, Singapore
Sanjay Jain
Ruhr-Universität Bochum, Germany
Hans Ulrich Simon
Department of Information and Communication Engineering, Faculty of Electro-Communications, The University of Electro-Communications, Chofugaoka 1–5–1, Chofu, 182-8585, Tokyo, Japan
Etsuji Tomita

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fakcharoenphol, J., Kijsirikul, B. (2005). Constructing Multiclass Learners from Binary Learners: A Simple Black-Box Analysis of the Generalization Errors. In: Jain, S., Simon, H.U., Tomita, E. (eds) Algorithmic Learning Theory. ALT 2005. Lecture Notes in Computer Science(), vol 3734. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11564089_12

Download citation

DOI: https://doi.org/10.1007/11564089_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29242-5
Online ISBN: 978-3-540-31696-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics