A New Bootstrapping Method to Improve Classification Performance in Learning Classifier Systems

Holmes, John H.; Durbin, Dennis R.; Winston, Flaura K.

doi:10.1007/3-540-45356-3_73

John H. Holmes⁷,
Dennis R. Durbin^7,8 &
Flaura K. Winston⁸

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1917))

Included in the following conference series:

International Conference on Parallel Problem Solving from Nature

7674 Accesses
6 Citations

Abstract

A new technique for improving the classification performance of learning classifier systems (LCS) was developed and applied to a real-world data mining problem. EpiCS, a stimulus-response LCS, was adapted to perform prevalence-based bootstrapping, wherein data from training and testing sets were sampled according to the prevalence of the individual classes, rather than randomly using the class distribution inherent in the data. Prevalence-based bootstrapping was shown to improve classification performance significantly on training and testing. Furthermore, this procedure was shown to enhance EpiCS’s classification performance on testing compared to a well-known decision tree inducer (C4.5) when similar bootstrapping procedures were applied to the latter.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Assessing the Reliability of a Multi-Class Classifier

A power-controlled reliability assessment for multi-class probabilistic classifiers

Article 17 November 2022

Dynamic Classifier Selection Based on Imprecise Probabilities: A Case Study for the Naive Bayes Classifier

References

Abe, N. and Mamitsuka, H.: Query learning strategies using boosting and bagging. In: Shavlik, J. (ed.): Machine Learning. Proceedings of the Fifteenth International Conference (ICML’98). San Francisco, Morgan Kaufmann Publishers (1998) 1–9.
Google Scholar
Association for the Advancement of Automotive Medicine: The Abbreviated Injury Scale, 1990 Revision. Des Plaines, IL (1990).
Google Scholar
Bauer, E. and Kohavi, R.: An empirical comparison of voting classification algorithms bagging, boosting, and variants. Machine Learning 36 (1999) 105–139.
Article Google Scholar
Bonelli, P., Parodi, A., Sen, S., and Wilson, S.: NEWBOOLE: A fast GBML system, in: Porter, B. and Mooney, R. (eds.), Machine Learning: Proceedings of the Seventh International Conference. Morgan Kaufmann, San Mateo, CA (1990) 153–159.
Google Scholar
Efron, B. and Tibshirani, R.J.: An Introduction to the Bootstrap. Chapman and Hall, New York (1993).
MATH Google Scholar
Goldberg, D.E.: Genetic Algorithms in Search, Optimization, and Machine Learning. Addison-Wesley, New York (1989).
MATH Google Scholar
Harries, M.: Boosting a strong learner: evidence against the minimum margin. In: Bratko, I. and Dzeroski, S. (eds.): Machine Learning. Proceedings of the Sixteenth International Conference (ICML’ 99). Morgan Kaufmann Publishers, San Francisco (1999) 171–180.
Google Scholar
Holland, J.H., Holyoak, K.J., Nisbett, R.E., and Thagard, P.R.: Induction: Processes of Inference, Learning, and Discovery. The MIT Press, Cambridge, MA (1986).
Google Scholar
Holmes, J.H.: A genetics-based machine learning approach to knowledge discovery in clinical data, Journal of the American Medical Informatics Association Suppl (1996) 883.
Google Scholar
Holmes, J.H.: Discovery of Disease Risk with a Learning Classifier System, in: Baeck, T. (ed.): Proceedings of the Seventh International Conference on Genetic Algorithms (SanFrancisco, Morgan Kaufmann (1997) 426–433.
Google Scholar
Holmes, J.H., Winston, F.K., Durbin, D.R., et al: The Partners for Child Passenger Safety Project: An information infrastructure for injury surveillance, Journal of the American Medical Informatics Association Suppl (1998) 1016.
Google Scholar
Holmes J.H.: Differential negative reinforcement improves classifier system learning rate in two-class problems with unequal base rates. In: Koza J.R., Banzhaf W., Chellapilla K., et al (eds.): Genetic Programming 1998: Proceedings of the Third Annual Conference, Morgan Kaufmann, San Francisco (1998) 635–644.
Google Scholar
Holmes J.H.: Quantitative methods for evaluating learning classifier system performance In forced two-choice decision tasks. In: Wu, A. (ed.) Proceedings of the Second International Workshop on Learning Classifier Systems (IWLCS99). Morgan Kaufmann, SanFrancisco (1999) 250–257.
Google Scholar
Iba, H.: Bagging, Boosting, and bloating in genetic programming. In: Banzhaf, J., Daida, J., Eiben, et al (eds.): GECCO-99. Proceedings of the Genetic and Evolutionary Computation Conference. Morgan Kaufmann, San Francisco (1999) 1053–1060.
Google Scholar
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, CA (1993).
Google Scholar
Robertson, G.G. and Riolo, R.L.: A tale of two classifier systems, Machine Learning 3 (1988) 139–159.
Google Scholar
Schapire, R.E.: Theoretical views of boosting. In: Computational Learning Theory, 4^th European Conference, EuroCOLT99. Springer-Verlag, Berlin (1999) 1–10.
Chapter Google Scholar
Ting, K.M. and Zheng, Z.: Improving the performance of boosting for naive Bayesian classification. In: Zhong, N. and Zhou, L. (eds.): Proceedings of PAKDD-00, Third Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer-Verlag, Berlin (1999) 296–305.
Google Scholar
Weiss, S.M. and Indurkhya, N.: Predictive Data Mining. Morgan Kaufmann Publishers, Inc., San Francisco (1998).
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Center for Clinical Epidemiology and Biostatistics, University of Pennsylvania School of Medicine, Philadelphia, PA, 19104, USA
John H. Holmes & Dennis R. Durbin
The Children’s Hospital of Philadelphia, Philadelphia, PA, 19104, USA
Dennis R. Durbin & Flaura K. Winston

Authors

John H. Holmes
View author publications
You can also search for this author in PubMed Google Scholar
Dennis R. Durbin
View author publications
You can also search for this author in PubMed Google Scholar
Flaura K. Winston
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

CMAP, Ecole Polytechnique, 91128, Palaiseau Cedex, France
Marc Schoenauer
Dept. of Mechanical Engineering Kanpur Genetic Algorithms Laboratory, Indian Institute of Technology Kanpur, Kanpur, Pin, 208 016, India
Kalyanmoy Deb
Fachbereich Informatik, Lehrstuhl für Systemanalyse, Universität Dortmund, Joseph-von-Fraunhofer-Str. 20, 44221, Dortmund, Germany
Günther Rudolph & Hans-Paul Schwefel &
School of Computer Science, The University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK
Xin Yao
Projet Fractales, INRIA Rocquencourt, BP 105, 78153, Le Chesnay Cedex, France
Evelyne Lutton
Dept. de Arquitectura y Technologa de los Computadores, GeNeura Team, Universidad de Granada, Campus Fuenetenueva, s/n, Granada
Juan Julian Merelo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Holmes, J.H., Durbin, D.R., Winston, F.K. (2000). A New Bootstrapping Method to Improve Classification Performance in Learning Classifier Systems. In: Schoenauer, M., et al. Parallel Problem Solving from Nature PPSN VI. PPSN 2000. Lecture Notes in Computer Science, vol 1917. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45356-3_73

Download citation

DOI: https://doi.org/10.1007/3-540-45356-3_73
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41056-0
Online ISBN: 978-3-540-45356-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

A New Bootstrapping Method to Improve Classification Performance in Learning Classifier Systems

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Assessing the Reliability of a Multi-Class Classifier

A power-controlled reliability assessment for multi-class probabilistic classifiers

Dynamic Classifier Selection Based on Imprecise Probabilities: A Case Study for the Naive Bayes Classifier

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A New Bootstrapping Method to Improve Classification Performance in Learning Classifier Systems

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Assessing the Reliability of a Multi-Class Classifier

A power-controlled reliability assessment for multi-class probabilistic classifiers

Dynamic Classifier Selection Based on Imprecise Probabilities: A Case Study for the Naive Bayes Classifier

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation