CCAIIA: Clustering categorical attributes into interesting association rules

Gray, Brett; Orlowska, M. E.

doi:10.1007/3-540-64383-4_12

Brett Gray⁹ &
M. E. Orlowska⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1394))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

1734 Accesses
32 Citations

Abstract

We investigate the problem of mining interesting association rules over a pair of categorical attributes at any level of data granularity. We do this by integrating the rule discovery process with a form of clustering. This allows associations between groups of ;items to be formed where the groping of items is based on maximising the “interestingness” of the associations discovered. Previous work on mining generalised associations assumes either a distance metric on the attribute values or a taxonomy over the items mined. These methods use the metric/taxonomy to limit the space of possible associations that can be found. We develop a measure of the interestingness of association rules based on support and the dependency between the item sets and use this measure to guide the search. We apply the method to a data set and observe the extraction of “interesting” associations. This method could allow interesting and unexpected associations to be discovered as the search space is not being limited by user defined hierarchies.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Rakesh Agrawal, Tomasz Imielinski, and Arun Swami. Mining association rules between sets of items in large databases. In Proceedings of the ACM SIGMOD Conference on Management of Data, pages 207–216, Washington, May 1993.
Google Scholar
Rakesh Agrawal, Heikki Mannila, Ramakrishnan Srikant, Hannu Toivonen, and A. Inkeri Verkamo. Advances in knowledge discovery and data mining, chapter Fast discovery of association rules. AAAI Press/The MIT Press, 1996.
Google Scholar
Anna Andrusiewicz and M. E. Orlowska. On data granularity factors that affect data mining. In Proceedings of the 8th International Database Workshop, 1997.
Google Scholar
Ming-Sayn Chen, Jiawei Han, and Philip S. Yu. Data mining: An overview from database perspective. IEEE Transactions on knowledge and data engineering, 1997.
Google Scholar
Martin Easter, Hans-Peter Kriegel, Jorg Sander, and Xiaowei Xu. A density based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining, 1996.
Google Scholar
Douglas Fisher. Improving inference through conceptual clustering. In Proc. 1987 AAAI Conf., pages 461–465, July 1987.
Google Scholar
Takeshi Fukuda, Yasuhiko Morimoto, Shinichi Morishita, and Takeshi Tokuyama. Data mining using two-dimensional optimized association rules: Scheme, algorithms, and visualization. In Proceedings of the ACM-SIGMOD Conference on Management of data, Montreal. Canada, 1996.
Google Scholar
Brett Gray and M. E. Orlowska. The use of clustering to mine interesting association rules. Technical Report TR425, School of Computer Science and Electrical Engineering, The University of Queensland, 1998.
Google Scholar
Jiawei Han and Yongjian Fu. Discovery of multiple-level association rules from large databases. In Proceedings of the 21st VLDB Conference, Zurich, Swizerland, 1995.
Google Scholar
Ryszard S. Michalski and Robert E. Stepp. Automated construction of classifications: Conceptual clustering versus numerical taxonomy. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-5(4), July 1983.
Google Scholar
Raymond T. Ng and Jiawei Han. Efficient and effective clustering methods for spatial data mining. In Proceeding of the 20th VLDB Conference, Santiago, Chile, 1994.
Google Scholar
Ramakrishnan Srikant and Rakesh Agrawal. Mining generalized association rules. In Proceedings of the 21st VLDB Conference, Zurich, Swizerland, 1995.
Google Scholar
Ramakrishnan Srikant and Rakesh Agrawal. Mining qantitative association rules in large relational tables. In Proceedings of the ACM-SIGMOD Conference on Management of data, Montreal, Canada, 1996.
Google Scholar
Hannu Toivonen. Sampling large databases for association rules. In Proceedings of the 22nd VLDB Conference, Mumbai(Bombay), India, 1996.
Google Scholar
Tian Zhang, Raghu Ramakrishnan, and Miron Livny. BIRCH: An efficient data clustering method for very large databases. In Proceedings of the ACM-SIGMOD Conference on Management of data. Montreal. Canada, 1996.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Technology, University of Queensland, QLD 4072, Australia
Brett Gray & M. E. Orlowska

Authors

Brett Gray
View author publications
You can also search for this author in PubMed Google Scholar
M. E. Orlowska
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science and Software Engineering, Monash university, 900 Dandenong Road, Caulfield East, Victoria, 3145, Australia
Xindong Wu
Department of Computer Science, The University of Melbourne, Parkville, Victoria, 3052, Australia
Ramamohanarao Kotagiri
School of Computer Science and Engineering, Monash university, Clayton, Victoria, 3168, Australia
Kevin B. Korb

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gray, B., Orlowska, M.E. (1998). CCAIIA: Clustering categorical attributes into interesting association rules. In: Wu, X., Kotagiri, R., Korb, K.B. (eds) Research and Development in Knowledge Discovery and Data Mining. PAKDD 1998. Lecture Notes in Computer Science, vol 1394. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-64383-4_12

Download citation

DOI: https://doi.org/10.1007/3-540-64383-4_12
Published: 25 August 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64383-8
Online ISBN: 978-3-540-69768-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics