Abstract
An important problem in data mining is the discovery of association rules that identify relationships among sets of items. Finding frequent itemsets is computationally the most expensive step in association rules mining, and so most of the research attention has been focused on it. In this paper, we present a more efficient algorithm for mining frequent itemsets. In designing our algorithm, we have combined the ideas of pattern-growth, tid-intersection and prefix trees, with significant modifications. We present performance comparisons of our algorithm against the fastest Apriori algorithm, and the recently developed H-Mine algorithm. We have tested all the algorithms using several widely used test datasets. The performance results indicate that our algorithm significantly reduces the processing time for mining frequent itemsets in dense data sets that contain relatively long patterns.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Imielinski, T., Swami, A.: Mining Association Rules between Sets of Items in Large Databases. Proc. of ACM SIGMOD, Washington DC (1993)
Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules. Proc. of the 20th Int. Conf. on VLDB, Santiago, Chile (1994)
Han, J., Pei, J., Yin, Y.: Mining Frequent Patterns without Candidate Generation. Proc. of ACM-SIGMOD, Dallas, TX (2000)
Pei, J., Han, J., Lu, H., Nishio, S., Tang, S., Yang, D.: H-Mine: Hyper-Structure Mining of Frequent Patterns in Large Databases. Proc. of the 2001 IEEE ICDM, San Jose, California (2001)
Agarwal, R., Aggarwal, C., Prasad, V.V.V.: A Tree Projection Algorithm for Generation of Frequent Itemsets. Journal of Parallel and Distributed Computing (Special Issue on High Performance Data Mining) (2000)
Shenoy, P., et al. Turbo-charging Vertical Mining of Large Databases. Proc. of ACMSIGMOD, Dallas, TX USA (2000).
Zaki, M.J., Scalable Algorithms for Association Mining. IEEE Transactions on Knowledge and Data Engineering. 12(3) (May/June 2000) 372–390.
Zaki, M.J. Gouda, K., Fast Vertical Mining Using Diffsets. RPI Technical Report 01-1. Rensselaer Polytechnic Institute, Troy, NY 12180 USA: New York (2001)
Pei, J., Han, J., Lakshmanan, L.V.S.: Mining Frequent Itemsets with Convertible Constraints. Proc. of 17th ICDE, Heidelberg, Germany (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gopalan, R.P., Sucahyo, Y.G. (2002). TreeITL-Mine: Mining Frequent Itemsets Using Pattern Growth, Tid Intersection, and Prefix Tree. In: McKay, B., Slaney, J. (eds) AI 2002: Advances in Artificial Intelligence. AI 2002. Lecture Notes in Computer Science(), vol 2557. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36187-1_47
Download citation
DOI: https://doi.org/10.1007/3-540-36187-1_47
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00197-3
Online ISBN: 978-3-540-36187-9
eBook Packages: Springer Book Archive