Abstract
We investigate the problem of mining interesting association rules over a pair of categorical attributes at any level of data granularity. We do this by integrating the rule discovery process with a form of clustering. This allows associations between groups of ;items to be formed where the groping of items is based on maximising the “interestingness” of the associations discovered. Previous work on mining generalised associations assumes either a distance metric on the attribute values or a taxonomy over the items mined. These methods use the metric/taxonomy to limit the space of possible associations that can be found. We develop a measure of the interestingness of association rules based on support and the dependency between the item sets and use this measure to guide the search. We apply the method to a data set and observe the extraction of “interesting” associations. This method could allow interesting and unexpected associations to be discovered as the search space is not being limited by user defined hierarchies.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Rakesh Agrawal, Tomasz Imielinski, and Arun Swami. Mining association rules between sets of items in large databases. In Proceedings of the ACM SIGMOD Conference on Management of Data, pages 207–216, Washington, May 1993.
Rakesh Agrawal, Heikki Mannila, Ramakrishnan Srikant, Hannu Toivonen, and A. Inkeri Verkamo. Advances in knowledge discovery and data mining, chapter Fast discovery of association rules. AAAI Press/The MIT Press, 1996.
Anna Andrusiewicz and M. E. Orlowska. On data granularity factors that affect data mining. In Proceedings of the 8th International Database Workshop, 1997.
Ming-Sayn Chen, Jiawei Han, and Philip S. Yu. Data mining: An overview from database perspective. IEEE Transactions on knowledge and data engineering, 1997.
Martin Easter, Hans-Peter Kriegel, Jorg Sander, and Xiaowei Xu. A density based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining, 1996.
Douglas Fisher. Improving inference through conceptual clustering. In Proc. 1987 AAAI Conf., pages 461–465, July 1987.
Takeshi Fukuda, Yasuhiko Morimoto, Shinichi Morishita, and Takeshi Tokuyama. Data mining using two-dimensional optimized association rules: Scheme, algorithms, and visualization. In Proceedings of the ACM-SIGMOD Conference on Management of data, Montreal. Canada, 1996.
Brett Gray and M. E. Orlowska. The use of clustering to mine interesting association rules. Technical Report TR425, School of Computer Science and Electrical Engineering, The University of Queensland, 1998.
Jiawei Han and Yongjian Fu. Discovery of multiple-level association rules from large databases. In Proceedings of the 21st VLDB Conference, Zurich, Swizerland, 1995.
Ryszard S. Michalski and Robert E. Stepp. Automated construction of classifications: Conceptual clustering versus numerical taxonomy. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-5(4), July 1983.
Raymond T. Ng and Jiawei Han. Efficient and effective clustering methods for spatial data mining. In Proceeding of the 20th VLDB Conference, Santiago, Chile, 1994.
Ramakrishnan Srikant and Rakesh Agrawal. Mining generalized association rules. In Proceedings of the 21st VLDB Conference, Zurich, Swizerland, 1995.
Ramakrishnan Srikant and Rakesh Agrawal. Mining qantitative association rules in large relational tables. In Proceedings of the ACM-SIGMOD Conference on Management of data, Montreal, Canada, 1996.
Hannu Toivonen. Sampling large databases for association rules. In Proceedings of the 22nd VLDB Conference, Mumbai(Bombay), India, 1996.
Tian Zhang, Raghu Ramakrishnan, and Miron Livny. BIRCH: An efficient data clustering method for very large databases. In Proceedings of the ACM-SIGMOD Conference on Management of data. Montreal. Canada, 1996.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gray, B., Orlowska, M.E. (1998). CCAIIA: Clustering categorical attributes into interesting association rules. In: Wu, X., Kotagiri, R., Korb, K.B. (eds) Research and Development in Knowledge Discovery and Data Mining. PAKDD 1998. Lecture Notes in Computer Science, vol 1394. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-64383-4_12
Download citation
DOI: https://doi.org/10.1007/3-540-64383-4_12
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64383-8
Online ISBN: 978-3-540-69768-8
eBook Packages: Springer Book Archive