Skip to main content

An Appropriate Abstraction for an Attribute-Oriented Induction

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1721))

Abstract

An attribute-oriented induction is a useful data mining method that generalizes databases under an appropriate abstraction hierarchy to extract meaningful knowledge. The hierarchy is well designed so as to exclude meaningless rules from a particular point of view. However, there may exist several ways of generalizing databases according to user’s intention. It is therefore important to provide a multi-layered abstraction hierarchy under which several generalizations are possible and are well controlled. In fact, too-general or too-specific databases are inappropriate for mining algorithms to extract significant rules. From this viewpoint, this paper proposes a generalization method based on an information theoretical measure to select an appropriate abstraction hierarchy. Furthermore, we present a system, called ITA (Information Theoretical Abstraction), based on our method and an attribute-oriented induction. We perform some practical experiments in which ITA discovers meaningful rules from a census database US Census Bureau and discuss the validity of ITA based on the experimental results.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Adriaans, P. and Zantinge, D.: Data Mining, Addison Wesley Longman Ltd., 1996.

    Google Scholar 

  2. Arimoto, S: Probability, Information, Entropy, Morikita Shuppan, 1980 (in Japanese)

    Google Scholar 

  3. Fayyad, U.N., Piatetsky-Shapiro, G., Smyth, P. and Uthurusamy, R. (eds.): Advances in Knowledge Discovery and Data Mining, AAAI/MIT Press, 1996.

    Google Scholar 

  4. Fayyad, U.N., Piatetsky-Shapiro, G., Smyth, P.: From Data Mining to Knowledge Discovery: an Overview. In Piatetsky-Shapiro, G., Smyth, P. and Uthurusamy, R. (eds.) Advances in Knowledge Discovery and Data Mining, AAAI/MIT Press, 1996 [3], pp.1–33

    Google Scholar 

  5. Han, J., Cai, Y. and Cercone, N.: Knowledge Discovery in Databases: An Attribute-Oriented Approach, Proceeding of VLDB’92, Canada, pp. 547–559, 1992.

    Google Scholar 

  6. Han, J. and Fu, Y.: Attribute-Oriented Induction in Data Mining. In Piatetsky-Shapiro, G., Smyth, P. and Uthurusamy, R. (eds.) Advances in Knowledge Discovery and Data Mining, AAAI/MIT Press, 1996 [3], pp.399–421

    Google Scholar 

  7. Matsumoto, K., Morita, C. and Tsukimoto, H. Generalized Rule Discovery in Databases by Finding Similarities In: SIG-J-9401-15, pp.111–118, Japanese Society for Artificial Intelligence, 1994.

    Google Scholar 

  8. Miller, G.A.: Nouns in WordNet: a lexical inheritance system, In: International Journal of Lexicography 3(4), pp. 245–264, 1990.

    Article  Google Scholar 

  9. Murphy, P.M. and Aha, D.W.: UCI Repository of machine learning databases, http://www.ics.uci.edu/mlearn/MLRepository.html.

    Google Scholar 

  10. Quinlan, J.R.: C4.5-Programs for Machine Learning, Morgan Kaufmann, 1993.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kudoh, Y., Haraguchi, M. (1999). An Appropriate Abstraction for an Attribute-Oriented Induction. In: Arikawa, S., Furukawa, K. (eds) Discovery Science. DS 1999. Lecture Notes in Computer Science(), vol 1721. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46846-3_5

Download citation

  • DOI: https://doi.org/10.1007/3-540-46846-3_5

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-66713-1

  • Online ISBN: 978-3-540-46846-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics