That Elusive Diversity in Classifier Ensembles

Kuncheva, Ludmila I.

doi:10.1007/978-3-540-44871-6_130

That Elusive Diversity in Classifier Ensembles

Ludmila I. Kuncheva⁵

Conference paper
First Online: 01 January 2003

1052 Accesses
69 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2652))

Abstract

Is “useful diversity” a myth? Many experiments and the little available theory on diversity in classifier ensembles are either inconclusive, too heavily assumption-bound or openly non-supportive of the intuition that diverse classifiers fare better than non-divers ones. Although a rough general tendency was confirmed in our previous studies, no prominent link appeared between diversity of the ensemble and its accuracy. Diversity alone is a poor predictor of the ensemble accuracy. But there is no agreed definition of diversity to start with! Can we borrow a concept of diversity from biology? How can diversity, as far as we can define and measure it, be used to improve the ensemble? Here we argue that even without a clear-cut definition and theory behind it, studying diversity may prompt viable heuristic solutions. We look into some ways in which diversity can be used in analyzing, selecting or training the ensemble.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Breiman, L.: Random forests. Machine Learning 45, 5–32 (2001)
Article Google Scholar
Cunningham, P., Carney, J.: Diversity versus quality in classification ensembles based on feature selection. Technical Report TCD-CS-2000-02, Department of Computer Science, Trinity College Dublin (2000)
Google Scholar
Dietterich, T.: An experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting and randomization. Machine Learning 40(2), 139–157 (2000)
Article Google Scholar
Fleiss, J.L.: Statistical Methods for Rates and Proportions. John Wiley & Sons, Chichester (1981)
MATH Google Scholar
Ghosh, J.: Multiclassifier systems: Back to the future. In: Roli, F., Kittler, J. (eds.) MCS 2002. LNCS, vol. 2364, pp. 1–15. Springer, Heidelberg (2002)
Chapter Google Scholar
Giacinto, G., Roli, F.: Design of effective neural network ensembles for image classification processes. Image Vision and Computing Journal 19(9-10), 699–707 (2001)
Article Google Scholar
Ho, T.K.: The random space method for constructing decision forests. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(8), 832–844 (1998)
Article Google Scholar
Ho, T.K.: Multiple classifier combination: Lessons and the next steps. In: Kandel, A., Bunke, H. (eds.) Hybrid Methods in Pattern Recognition, pp. 171–198. World Scientific Publishing, Singapore (2002)
Chapter Google Scholar
Kleinberg, E.M.: Stochastic discrimination. Annals of Mathematics and Artificial Intelligence 1, 207–239 (1990)
Article Google Scholar
Kohavi, R., Wolpert, D.H.: Bias plus variance decomposition for zero-one loss functions. In: Saitta, L. (ed.) Machine Learning: Proc. 13th International Conference, pp. 275–283. Morgan Kaufmann, San Francisco (1996)
Google Scholar
Krogh, A., Vedelsby, J.: Neural network ensembles, cross validation and active learning. In: Tesauro, G., Touretzky, D.S., Leen, T.K. (eds.) Advances in Neural Information Processing Systems, vol. 7, pp. 231–238. MIT Press, Cambridge (1995)
Google Scholar
Kuncheva, L.I.: Fuzzy Classifier Design. Studies in Fuzziness and Soft Computing. Springer, Heidelberg (2000)
Book Google Scholar
Kuncheva, L.I., Whitaker, C.J.: Measures of diversity in classifier ensembles. Machine Learning 51, 181–207 (2003)
Article Google Scholar
Lam, L.: Classifier combinations: implementations and theoretical issues. In: Kittler, J., Roli, F. (eds.) MCS 2000. LNCS, vol. 1857, pp. 77–86. Springer, Heidelberg (2000)
Chapter Google Scholar
Littlewood, B., Miller, D.R.: Conceptual modeling of coincident failures in multiversion software. IEEE Transactions on Software Engineering 15(12), 1596–1614 (1989)
Article MathSciNet Google Scholar
Margineantu, D.D., Dietterich, T.G.: Pruning adaptive boosting. In: Proc. 14th International Conference on Machine Learning, San Francisco, pp. 378–387. Morgan Kaufmann, San Francisco (1997)
Google Scholar
Pękalska, E.z., Duin, R.P.W., Skurichina, M.: A discussion on the classifier projection space for classifier combining. In: Roli, F., Kittler, J. (eds.) MCS 2002. LNCS, vol. 2364, pp. 137–148. Springer, Heidelberg (2002)
Chapter Google Scholar
Rao, C.R.: Diversity: Its measurement, decomposition, apportionment and analysis. Sankya: The Indian Journal of Statistics, Series A 44(1), 1–22 (1982)
MathSciNet MATH Google Scholar
Roli, F., Giacinto, G., Vernazza, G.: Methods for designing multiple classifier systems. In: Kittler, J., Roli, F. (eds.) MCS 2001. LNCS, vol. 2096, pp. 78–87. Springer, Heidelberg (2001)
Chapter Google Scholar
Rosen, B.E.: Ensemble learning using decorrelated neural networks. Connection Science 8(3/4), 373–383 (1996)
Article Google Scholar
Shipp, C.A., Kuncheva, L.I.: Relationships between combination methods and measures of diversity in combining classifiers. Information Fusion 3(2), 135–148 (2002)
Article Google Scholar
Skalak, D.B.: The sources of increased accuracy for two proposed boosting algorithms. In: Proc. American Association for Artificial Intelligence, AAAI 1996, Integrating Multiple Learned Models Workshop (1996)
Google Scholar
Sneath, P.H.A., Sokal, R.R.: Numerical Taxonomy. W.H. Freeman & Co, New York (1973)
MATH Google Scholar
Tumer, K., Ghosh, J.: Linear and order statistics combiners for pattern classification. In: Sharkey, A.J.C. (ed.) Combining Artificial Neural Nets, pp. 127–161. Springer, London (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Informatics, University of Wales, Bangor, Gwynedd, LL57 1UT, United Kingdom
Ludmila I. Kuncheva

Authors

Ludmila I. Kuncheva
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Unitat de Gràfics i Visió per Ordinador Departament de Ciències Matemàtiques i Informàtica, Universitat de les Illes Balears Edifici Anselm Turmeda, Ctra. de Valldemossa km 7,5, 07122, Palma de Mallorca, Spain
Francisco José Perales
FEUP - Faculdade de Engenharia, Universidade do Porto, Rua Dr. Roberto Frias, 4200-465, Porto, Portugal
Aurélio J. C. Campilho
Departamento de Ciencias da la Computacíon e I.A., Universidad de Granada, E.T. S. Ing. Informática, 18071, Granada, Spain
Nicolás Pérez de la Blanca
Dept. System Engineering and Automation, Universitat Politècnica de Catalunya (UPC) Barcelona, Spain
Alberto Sanfeliu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kuncheva, L.I. (2003). That Elusive Diversity in Classifier Ensembles. In: Perales, F.J., Campilho, A.J.C., de la Blanca, N.P., Sanfeliu, A. (eds) Pattern Recognition and Image Analysis. IbPRIA 2003. Lecture Notes in Computer Science, vol 2652. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-44871-6_130

Download citation

DOI: https://doi.org/10.1007/978-3-540-44871-6_130
Published: 18 September 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40217-6
Online ISBN: 978-3-540-44871-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics