Abstract
Geometry-based ensembles is a newly proposed algorithm based on the concept of characterizing boundary points. These points are found from the geometry of the data set and belong to the optimal boundary between classes under a certain notion of robustness. The characterizing boundary points can be used to build a classifier. Based on these points, a set of locally robust linear classifiers is defined and assembled in an additive model to create a final decision rule. As a result a strong classifier able to compete with nowadays state-of-the-art classifiers is obtained. The main drawback of the original proposal comes from the fact that the complexity of the created model can be arbitrarily high and depends on the data set. Moreover, outliers and noise may increase this number. In this article, small complexity models with strong generalization capability are explored. Two incremental non-parametric additive building algorithms are considered: boosting and least squared residual fitting approaches. Moreover, the last method is extended to deal with incremental L2 penalized solutions (which implicitly combines the advantages of sparse models and smooth ones due to the complexity limit). The validation of the approach on the UCI database achieves very promising results assessing the validity of CBP based classifiers ensembles.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Pujol, O., Masip, D.: Geometry-Based Ensembles: Toward a Structural Characterization of the Classification Boundary. IEEE Transactions on Pattern Analysis and Machine Intelligence 31(6), 1140–1146 (2009)
Friedman, J.H.: Greedy Function Approximation: A Gradient Boosting Machine. The Annals of Statistics 29(5), 1189–1232 (2001)
Murphy, P.M., Aha, D.W.: UCI Repository of machine learning databases. University of California, Dept. of Information and Computer Science, Irvine (1994)
Friedman, J.H., Hastie, T., Tibshirani, R.: Additive logistic regression: a statistical view of boosting. The Annals of Statistics 28(2), 337–407 (2000)
Freund, Y., Schapire, R.E.: Experiments with a new boosting algorithm. In: Proc. XIIIth ICML, pp. 148–156 (1996)
Bottou, L., Bousquet, O.: The Tradeoffs of Large Scale Learning. In: Platt, J.C., Koller, D., Singer, Y., Roweis, S. (eds.) NIPS. NIPS Foundation, vol. 20, pp. 161–168 (2008), http://books.nips.cc
Kuncheva, L.: Classifier ensembles for changing environments. In: Roli, F., Kittler, J., Windeatt, T. (eds.) MCS 2004. LNCS, vol. 3077, pp. 1–15. Springer, Heidelberg (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pujol, O. (2010). Boosted Geometry-Based Ensembles. In: El Gayar, N., Kittler, J., Roli, F. (eds) Multiple Classifier Systems. MCS 2010. Lecture Notes in Computer Science, vol 5997. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12127-2_20
Download citation
DOI: https://doi.org/10.1007/978-3-642-12127-2_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12126-5
Online ISBN: 978-3-642-12127-2
eBook Packages: Computer ScienceComputer Science (R0)