TreeITL-Mine: Mining Frequent Itemsets Using Pattern Growth, Tid Intersection, and Prefix Tree

Gopalan, Raj P.; Sucahyo, Yudho Giri

doi:10.1007/3-540-36187-1_47

Raj P. Gopalan³ &
Yudho Giri Sucahyo³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2557))

Included in the following conference series:

Australian Joint Conference on Artificial Intelligence

1131 Accesses
5 Citations

Abstract

An important problem in data mining is the discovery of association rules that identify relationships among sets of items. Finding frequent itemsets is computationally the most expensive step in association rules mining, and so most of the research attention has been focused on it. In this paper, we present a more efficient algorithm for mining frequent itemsets. In designing our algorithm, we have combined the ideas of pattern-growth, tid-intersection and prefix trees, with significant modifications. We present performance comparisons of our algorithm against the fastest Apriori algorithm, and the recently developed H-Mine algorithm. We have tested all the algorithms using several widely used test datasets. The performance results indicate that our algorithm significantly reduces the processing time for mining frequent itemsets in dense data sets that contain relatively long patterns.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Imielinski, T., Swami, A.: Mining Association Rules between Sets of Items in Large Databases. Proc. of ACM SIGMOD, Washington DC (1993)
Google Scholar
Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules. Proc. of the 20th Int. Conf. on VLDB, Santiago, Chile (1994)
Google Scholar
Han, J., Pei, J., Yin, Y.: Mining Frequent Patterns without Candidate Generation. Proc. of ACM-SIGMOD, Dallas, TX (2000)
Google Scholar
Pei, J., Han, J., Lu, H., Nishio, S., Tang, S., Yang, D.: H-Mine: Hyper-Structure Mining of Frequent Patterns in Large Databases. Proc. of the 2001 IEEE ICDM, San Jose, California (2001)
Google Scholar
Agarwal, R., Aggarwal, C., Prasad, V.V.V.: A Tree Projection Algorithm for Generation of Frequent Itemsets. Journal of Parallel and Distributed Computing (Special Issue on High Performance Data Mining) (2000)
Google Scholar
Shenoy, P., et al. Turbo-charging Vertical Mining of Large Databases. Proc. of ACMSIGMOD, Dallas, TX USA (2000).
Google Scholar
Zaki, M.J., Scalable Algorithms for Association Mining. IEEE Transactions on Knowledge and Data Engineering. 12(3) (May/June 2000) 372–390.
Article MathSciNet Google Scholar
Zaki, M.J. Gouda, K., Fast Vertical Mining Using Diffsets. RPI Technical Report 01-1. Rensselaer Polytechnic Institute, Troy, NY 12180 USA: New York (2001)
Google Scholar
Pei, J., Han, J., Lakshmanan, L.V.S.: Mining Frequent Itemsets with Convertible Constraints. Proc. of 17th ICDE, Heidelberg, Germany (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing, Curtin University of Technology, Kent St, 6102, Bentley, Western Australia
Raj P. Gopalan & Yudho Giri Sucahyo

Authors

Raj P. Gopalan
View author publications
You can also search for this author in PubMed Google Scholar
Yudho Giri Sucahyo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Australian Defence Force Academy, University of New South Wales, ACT 2600, Canberra, Australia
Bob McKay
Computer Science Laboratory, Australian National University, RSISE Building, ACT 0200, Canberra, Australia
John Slaney

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gopalan, R.P., Sucahyo, Y.G. (2002). TreeITL-Mine: Mining Frequent Itemsets Using Pattern Growth, Tid Intersection, and Prefix Tree. In: McKay, B., Slaney, J. (eds) AI 2002: Advances in Artificial Intelligence. AI 2002. Lecture Notes in Computer Science(), vol 2557. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36187-1_47

Download citation

DOI: https://doi.org/10.1007/3-540-36187-1_47
Published: 08 November 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00197-3
Online ISBN: 978-3-540-36187-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics