Improving the Efficiency of Frequent Pattern Mining by Compact Data Structure Design

Gopalan, Raj P.; Sucahyo, Yudho Giri

doi:10.1007/978-3-540-45080-1_79

Raj P. Gopalan⁷ &
Yudho Giri Sucahyo⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2690))

Included in the following conference series:

International Conference on Intelligent Data Engineering and Automated Learning

968 Accesses
2 Citations

Abstract

Mining frequent patterns has been a topic of active research because it is computationally the most expensive step in association rule discovery. In this paper, we discuss the use of compact data structure design for improving the efficiency of frequent pattern mining. It is based on our work in developing efficient algorithms that outperform the best available frequent pattern algorithms on a number of typical data sets. We discuss improvements to the data structure design that has resulted in faster frequent pattern discovery. The performance of our algorithms is studied by comparing their running times on typical test data sets against the fastest Apriori, Eclat, FP-Growth and OpportuneProject algorithms. We discuss the performance results as well as the strengths and limitations of our algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Imielinski, T., Swami, A.: Mining Association Rules between Sets of Items in Large Databases. In: Proc. of ACM SIGMOD, Washington DC (1993)
Google Scholar
Han, J., Pei, J., Yin, Y.: Mining Frequent Patterns without Candidate Generation. In: Proc. ofACM-SIGMOD, Dallas, TX (2000)
Google Scholar
Pei, J., Han, J., Lu, H., Nishio, S., Tang, S., Yang, D.: H-Mine: Hyper-Structure Mining of Frequent Patterns in Large Databases. In: Proc. of IEEE ICDM, San Jose, California (2001)
Google Scholar
Zaki, M.J.: Scalable Algorithms for Association Mining. IEEE Transactions on Knowledge and Data Engineering 12, 372–390 (2000)
Article Google Scholar
Liu, J., Pan, Y., Wang, K., Han, J.: Mining Frequent Item Sets by Opportunistic Projection. In: Proc. ofACM SIGKDD, Edmonton, Alberta, Canada (2002)
Google Scholar
Gopalan, R.P., Sucahyo, Y.G.: ITL-Mine: Mining Frequent Itemsets More Efficiently. In: Proc. of 2002 Int. Conf. on Fuzzy Systems and Knowledge Discovery, Singapore (2002)
Google Scholar
Gopalan, R.P., Sucahyo, Y.G.: TreeITL-Mine: Mining Frequent Itemsets Using Pattern Growth, Tid Intersection and Prefix Tree. In: Proc. of 15th Australian Joint Conference on Artificial Intelligence, Canberra, Australia (2002)
Google Scholar
Gopalan, R.P., Nuruddin, T., Sucahyo, Y.G.: Building a Data Mining Query Optimizer. In: Proc. of Australasian Data Mining Workshop, Canberra, Australia (2002)
Google Scholar
Gopalan, R.P., Sucahyo, Y.G.: Fast Frequent Itemset Mining using Compressed Data Representation. In: Proc. of ILASTED Int. Conf. on Databases and Applications, Innsbruck, Austria (2003)
Google Scholar
http://fuzzy.cs.uni-magdeburg.de/~borgelt/
http://ww.ics.uci.edu/~mlearn/MLRepository.html
http://ugustus.csscr.washington.edu/census/

Download references

Author information

Authors and Affiliations

Department of Computing, Curtin University of Technology, Kent St, Bentley, Western Australia, 6102
Raj P. Gopalan & Yudho Giri Sucahyo

Authors

Raj P. Gopalan
View author publications
You can also search for this author in PubMed Google Scholar
Yudho Giri Sucahyo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Hong Kong Baptist University, Kowloon Tong, Hong Kong
Jiming Liu
Department of Computer Science, Hong Kong Baptist University, Hong Kong
Yiu-ming Cheung
School of Electrical and Electronic Engineering, University of Manchester, UK
Hujun Yin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gopalan, R.P., Sucahyo, Y.G. (2003). Improving the Efficiency of Frequent Pattern Mining by Compact Data Structure Design. In: Liu, J., Cheung, Ym., Yin, H. (eds) Intelligent Data Engineering and Automated Learning. IDEAL 2003. Lecture Notes in Computer Science, vol 2690. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45080-1_79

Download citation

DOI: https://doi.org/10.1007/978-3-540-45080-1_79
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40550-4
Online ISBN: 978-3-540-45080-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics