Skip to main content

Feature Construction and Feature Selection in Presence of Attribute Interactions

  • Conference paper
Hybrid Artificial Intelligence Systems (HAIS 2009)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5572))

Included in the following conference series:

Abstract

When used for data reduction, feature selection may successfully identify and discard irrelevant attributes, and yet fail to improve learning accuracy because regularities in the concept are still opaque to the learner. In that case, it is necessary to highlight regularities by constructing new characteristics that abstract the relations among attributes. This paper highlights the importance of feature construction when attribute interaction is the main source of learning difficulty and the underlying target concept is hard to discover by a learner using only primitive attributes. An empirical study centered on predictive accuracy shows that feature construction significantly outperforms feature selection because, even when done perfectly, detection of interacting attributes does not sufficiently facilitates discovering the target concept.

Supported by the Spanish Ministry of Science and Technology, TIN2008-02081.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Liu, H., Motoda, H. (eds.): Feature Extraction, Construction and Selection: A Data Mining Perspective, vol. 453. Kluwer Academic Publishers, MA (1998)

    MATH  Google Scholar 

  2. Liu, H., Motoda, H. (eds.): Computational Methods of Feature Selection. Data Mining and Knowledge Discovery Series. Chapman & Hall/CRC, Boca Raton (2007)

    MATH  Google Scholar 

  3. Shafti, L.S., Pérez, E.: Fitness function comparison for GA-based feature construction. In: Borrajo, D., Castillo, L., Corchado, J.M. (eds.) CAEPIA 2007. LNCS, vol. 4788, pp. 249–258. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  4. Shafti, L.S., Pérez, E.: Data reduction by genetic algorithms and non-algebraic feature construction: A case study. In: Proceedings of HIS (2008)

    Google Scholar 

  5. Blake, C., Merz, C.: UCI repository of machine learning databases (1998)

    Google Scholar 

  6. Holte, R.C.: Very simple classification rules perform well on most commonly used datasets. Machine Learning 11, 63–91 (1993)

    Article  MATH  Google Scholar 

  7. Jakulin, A., Bratko, I., et al.: Attribute interactions in medical data analysis. In: Dojat, M., Keravnou, E.T., Barahona, P. (eds.) AIME 2003. LNCS, vol. 2780, pp. 229–238. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  8. Danyluk, A.P., Provost, F.J.: Small disjuncts in action: Learning to diagnose errors in the local loop of the telephone network. In: Proceedings of ICML (1993)

    Google Scholar 

  9. Rendell, L.A., Seshu, R.: Learning hard concepts through constructive induction: Framework and rationale. Computational Intelligence 6, 247–270 (1990)

    Article  Google Scholar 

  10. Freitas, A.A.: Understanding the crucial role of attribute interaction in data mining. Artificial Intelligence Review 16(3), 177–199 (2001)

    Article  MATH  Google Scholar 

  11. Zhao, Z., Liu, H.: Searching for interacting features. In: Veloso, M.M. (ed.) Proceedings of IJCAI, Hyderabad, India, pp. 1156–1161 (January 2007)

    Google Scholar 

  12. Bloedorn, E., Michalski, R.S.: Data-driven constructive induction: Methodology and applications. In: [1], pp. 51–68

    Google Scholar 

  13. Michalski, R.S.: Pattern recognition as knowledge-guided computer induction. Technical Report 927, Dept. of Computer Science, University of Illinois (1978)

    Google Scholar 

  14. Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)

    Google Scholar 

  15. Vafaie, H., DeJong, K.: Feature space transformation using genetic algorithms. IEEE Intelligent Systems 13(2), 57–65 (1998)

    Article  Google Scholar 

  16. Pérez, E., Rendell, L.A.: Using multidimensional projection to find relations. In: Proceedings of the Twelfth ICML, pp. 447–455. Morgan Kaufmann, San Francisco (1995)

    Google Scholar 

  17. Pazzani, M.: Constructive induction of cartesian product attributes. In: [1], pp. 341–354

    Google Scholar 

  18. Zupan, B., Bratko, I., et al.: Function decomposition in machine learning. Machine Learning and Its Applications, Advanced Lectures, 71–101 (2001)

    Google Scholar 

  19. Grunwald, P.D.: The Minimum Description Length Principle. MIT Press, Cambridge (2007)

    Google Scholar 

  20. Shafti, L.S.: Multi-feature construction based on genetic algorithms and non-algebraic feature representation to facilitate learning concepts with complex interactions. Ph.D thesis, EPS, Universidad Autonoma de Madrid (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Shafti, L.S., Pérez, E. (2009). Feature Construction and Feature Selection in Presence of Attribute Interactions. In: Corchado, E., Wu, X., Oja, E., Herrero, Á., Baruque, B. (eds) Hybrid Artificial Intelligence Systems. HAIS 2009. Lecture Notes in Computer Science(), vol 5572. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02319-4_71

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-02319-4_71

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-02318-7

  • Online ISBN: 978-3-642-02319-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics