Skip to main content

Attribute Selection with a Multi-objective Genetic Algorithm

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2507))

Abstract

In this paper we address the problem of multi-objective attribute selection in data mining. We propose a multi-objective genetic algorithm (GA) based on the wrapper approach to discover the best subset of attributes for a given classification algorithm, namely C4.5, a well-known decision-tree algorithm. The two objectives to be minimized are the error rate and the size of the tree produced by C4.5. The proposed GA is a multi-objective method in the sense that it discovers a set of non-dominated solutions (attribute subsets), according to the concept of Pareto dominance.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bhattacharyya, S., Evolutionary Algorithms in Data mining: Multi-Objective Performance Modeling for Direct Marketing. In: Proc KDD-2000, ACM Press (2000)465–471

    Google Scholar 

  2. Deb, K, Multi-Objective Evolutionary Algorithms: Introducing Bias Among Pareto-Optimal Solutions. Kanpur Genetic Algorithms Laboratory Report n° 99002, India (1999)

    Google Scholar 

  3. Deb, K., Multi-Objective Optimization using Evolutionary Algorithms, John Wiley & Sons, England (2001)

    MATH  Google Scholar 

  4. Fidelis, M.V., Lopes, H.S., Freitas, A.A., Discovering Comprehensible Classification Rules with a Genetic Algorithm. In: Proc. Congress on Evolutionary Computation (2000)

    Google Scholar 

  5. Goldberg, D. E., Genetic Algorithms in Search, Optimization, and Machine Learning. Addison-Wesley Publishing Company (1989)

    Google Scholar 

  6. Freitas, A. A.; Understanding the Crucial Role of Attribute Interaction in Data Mining. In: Artificial Intelligence Review 16, Kluwer Academic Publishers (2001) 177–199

    Article  MATH  Google Scholar 

  7. Freitas, A.A., Data Mining and Knowledge Discovery with Evolutionary Algorithms (forthcoming book). Springer-Verlag (2002)

    Google Scholar 

  8. Holsheimer, M., Siebes, A., Data Mining — The Search for Knowledge in Databases. Report CS-R9406, Amsterdam: CWI (1991)

    Google Scholar 

  9. Ishibuchi, H., Nakashima, T., Multi-objective Pattern and Feature Selection by a Genetic Algorithm. In: Proc. Genetic and Evolutionary Computation Conf. (GECCO-2000), Morgan Kaufmann (2000) 1069–1076

    Google Scholar 

  10. Liu, H.; Motoda, H., Feature Selection for Knowledge Discovery and Data Mining. Kluwer Academic Publishers (1998)

    Google Scholar 

  11. Martin-Bautista, M. J., Vila, M. A., A Survey of Genetic Feature Selection in Mining Issues. In: Proc. IEEE Conference on Evolutionary Computation, Washington (1999) 1314–1321.

    Google Scholar 

  12. Michalewicz, Z., Genetic Algorithms + Data Structures = Evolution Programs. 3rd edn. Springer-Verlag (1996)

    Google Scholar 

  13. Murphy, P.M., Aha, D.W., UCI Repository of Machine Learning databases. [http://www.ics.uci.edu/~mlearn/MLRepository.html]. Irvine, CA: University of California, Departament of information and Computer Science (1994)

    Google Scholar 

  14. Rozspyal, A., Kubat, M., Using Genetic Algorithm to Reduce the Size of a Nearest-Neighbor Classifier and Select Relevant Attributes. Proc. Int. Conf. Machine Learning (ICML-2001), Morgan Kauf. (2001)

    Google Scholar 

  15. Quinlan, J.R., C4.5: Programs for Machine Learning. Morgan Kaufmann (1993)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pappa, G.L., Freitas, A.A., Kaestner, C.A.A. (2002). Attribute Selection with a Multi-objective Genetic Algorithm. In: Bittencourt, G., Ramalho, G.L. (eds) Advances in Artificial Intelligence. SBIA 2002. Lecture Notes in Computer Science(), vol 2507. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36127-8_27

Download citation

  • DOI: https://doi.org/10.1007/3-540-36127-8_27

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-00124-9

  • Online ISBN: 978-3-540-36127-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics