Skip to main content

CCAIIA: Clustering categorical attributes into interesting association rules

  • Papers
  • Conference paper
  • First Online:
Research and Development in Knowledge Discovery and Data Mining (PAKDD 1998)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1394))

Included in the following conference series:

Abstract

We investigate the problem of mining interesting association rules over a pair of categorical attributes at any level of data granularity. We do this by integrating the rule discovery process with a form of clustering. This allows associations between groups of ;items to be formed where the groping of items is based on maximising the “interestingness” of the associations discovered. Previous work on mining generalised associations assumes either a distance metric on the attribute values or a taxonomy over the items mined. These methods use the metric/taxonomy to limit the space of possible associations that can be found. We develop a measure of the interestingness of association rules based on support and the dependency between the item sets and use this measure to guide the search. We apply the method to a data set and observe the extraction of “interesting” associations. This method could allow interesting and unexpected associations to be discovered as the search space is not being limited by user defined hierarchies.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Rakesh Agrawal, Tomasz Imielinski, and Arun Swami. Mining association rules between sets of items in large databases. In Proceedings of the ACM SIGMOD Conference on Management of Data, pages 207–216, Washington, May 1993.

    Google Scholar 

  2. Rakesh Agrawal, Heikki Mannila, Ramakrishnan Srikant, Hannu Toivonen, and A. Inkeri Verkamo. Advances in knowledge discovery and data mining, chapter Fast discovery of association rules. AAAI Press/The MIT Press, 1996.

    Google Scholar 

  3. Anna Andrusiewicz and M. E. Orlowska. On data granularity factors that affect data mining. In Proceedings of the 8th International Database Workshop, 1997.

    Google Scholar 

  4. Ming-Sayn Chen, Jiawei Han, and Philip S. Yu. Data mining: An overview from database perspective. IEEE Transactions on knowledge and data engineering, 1997.

    Google Scholar 

  5. Martin Easter, Hans-Peter Kriegel, Jorg Sander, and Xiaowei Xu. A density based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining, 1996.

    Google Scholar 

  6. Douglas Fisher. Improving inference through conceptual clustering. In Proc. 1987 AAAI Conf., pages 461–465, July 1987.

    Google Scholar 

  7. Takeshi Fukuda, Yasuhiko Morimoto, Shinichi Morishita, and Takeshi Tokuyama. Data mining using two-dimensional optimized association rules: Scheme, algorithms, and visualization. In Proceedings of the ACM-SIGMOD Conference on Management of data, Montreal. Canada, 1996.

    Google Scholar 

  8. Brett Gray and M. E. Orlowska. The use of clustering to mine interesting association rules. Technical Report TR425, School of Computer Science and Electrical Engineering, The University of Queensland, 1998.

    Google Scholar 

  9. Jiawei Han and Yongjian Fu. Discovery of multiple-level association rules from large databases. In Proceedings of the 21st VLDB Conference, Zurich, Swizerland, 1995.

    Google Scholar 

  10. Ryszard S. Michalski and Robert E. Stepp. Automated construction of classifications: Conceptual clustering versus numerical taxonomy. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-5(4), July 1983.

    Google Scholar 

  11. Raymond T. Ng and Jiawei Han. Efficient and effective clustering methods for spatial data mining. In Proceeding of the 20th VLDB Conference, Santiago, Chile, 1994.

    Google Scholar 

  12. Ramakrishnan Srikant and Rakesh Agrawal. Mining generalized association rules. In Proceedings of the 21st VLDB Conference, Zurich, Swizerland, 1995.

    Google Scholar 

  13. Ramakrishnan Srikant and Rakesh Agrawal. Mining qantitative association rules in large relational tables. In Proceedings of the ACM-SIGMOD Conference on Management of data, Montreal, Canada, 1996.

    Google Scholar 

  14. Hannu Toivonen. Sampling large databases for association rules. In Proceedings of the 22nd VLDB Conference, Mumbai(Bombay), India, 1996.

    Google Scholar 

  15. Tian Zhang, Raghu Ramakrishnan, and Miron Livny. BIRCH: An efficient data clustering method for very large databases. In Proceedings of the ACM-SIGMOD Conference on Management of data. Montreal. Canada, 1996.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gray, B., Orlowska, M.E. (1998). CCAIIA: Clustering categorical attributes into interesting association rules. In: Wu, X., Kotagiri, R., Korb, K.B. (eds) Research and Development in Knowledge Discovery and Data Mining. PAKDD 1998. Lecture Notes in Computer Science, vol 1394. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-64383-4_12

Download citation

  • DOI: https://doi.org/10.1007/3-540-64383-4_12

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-64383-8

  • Online ISBN: 978-3-540-69768-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics