Abstract
The performance of association rule mining in terms of computation time and number of redundant rules generated deteriorates as the size of database increases and/or support threshold used is smaller. In this paper, we present a new approach called SARM — succinct association rule mining, to enhance the association mining. Our approach is based on our understanding of the mining process that items become less useful as mining proceeds, and that such items can be eliminated to accelerate the mining and to reduce the number of redundant rules generated. We propose a new paradigm that an item becomes less useful when the most interesting rules involving the item have been discovered and deleting it from the mining process will not result in any significant loss of information. SARM generates a compact set of rules called succinct association rule (SAR) set that is largely free of redundant rules. SARM is efficient in association mining, especially when support threshold used is small. Experiments are conducted on both synthetic and real-life databases. SARM approach is especially suitable for applications where rules with small support may be of significant interest. We show that for such applications SAR set can be mined efficiently.
This research was supported in part by NSF Grant No. EIA-0091530, USDA RMA Grant NO. 02IE08310228, and NSF EPSCOR, Grant No. EPS-0346476.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bacchus, F.: Representing and Reasoning With Probabilistic Knowledge. MIT, Cambridge (1990)
Deogun, J., Jiang, L., Raghavan, V.: Discovering maximal potentially useful association rules based on probability logics. In: Proc. of Rough Sets and Current Trends in Computing, 4th International Conf. (2004)
Deogun, J., Jiang, L., Xie, Y., Raghavan, V.: Probability logic modeling of knowledge discovery in databases. In: Proc. of Foundations of Intelligent Systems, 14th International Symposium, ISMIS (2003)
Agrawal, R., et al.: Mining association rules between sets of items in large databases. In: Proc. of ACM SIGMOD Intern. Conf. on Management of Data (1993)
Ganter, B., Wille, R.: Formal Concept Analsis: Mathematical Foundations. Springer, Berlin (1999)
Kryszkiewicz, M.: Closed set based discovery of representative association rules. In: Advances in Intelligent Data Analysis, 4th International Conf. (2001)
Orlando, S., et al.: A scalable multi-strategy algorithm for counting frequent sets. In: Proc. of the 5th Intern. Workshop on High Performance Data Mining (2002)
Zheng, Z., Kohavi, R., Mason, L.: Real world performance of association rule algorithms. In: Proc. of the 7th ACM SIGKDD International Conf. on Knowledge Discovery and Data Mining (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Deogun, J., Jiang, L. (2005). SARM — Succinct Association Rule Mining: An Approach to Enhance Association Mining. In: Hacid, MS., Murray, N.V., Raś, Z.W., Tsumoto, S. (eds) Foundations of Intelligent Systems. ISMIS 2005. Lecture Notes in Computer Science(), vol 3488. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11425274_13
Download citation
DOI: https://doi.org/10.1007/11425274_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25878-0
Online ISBN: 978-3-540-31949-8
eBook Packages: Computer ScienceComputer Science (R0)