Skip to main content

Improving the Efficiency of Frequent Pattern Mining by Compact Data Structure Design

  • Conference paper
Book cover Intelligent Data Engineering and Automated Learning (IDEAL 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2690))

Abstract

Mining frequent patterns has been a topic of active research because it is computationally the most expensive step in association rule discovery. In this paper, we discuss the use of compact data structure design for improving the efficiency of frequent pattern mining. It is based on our work in developing efficient algorithms that outperform the best available frequent pattern algorithms on a number of typical data sets. We discuss improvements to the data structure design that has resulted in faster frequent pattern discovery. The performance of our algorithms is studied by comparing their running times on typical test data sets against the fastest Apriori, Eclat, FP-Growth and OpportuneProject algorithms. We discuss the performance results as well as the strengths and limitations of our algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Imielinski, T., Swami, A.: Mining Association Rules between Sets of Items in Large Databases. In: Proc. of ACM SIGMOD, Washington DC (1993)

    Google Scholar 

  2. Han, J., Pei, J., Yin, Y.: Mining Frequent Patterns without Candidate Generation. In: Proc. ofACM-SIGMOD, Dallas, TX (2000)

    Google Scholar 

  3. Pei, J., Han, J., Lu, H., Nishio, S., Tang, S., Yang, D.: H-Mine: Hyper-Structure Mining of Frequent Patterns in Large Databases. In: Proc. of IEEE ICDM, San Jose, California (2001)

    Google Scholar 

  4. Zaki, M.J.: Scalable Algorithms for Association Mining. IEEE Transactions on Knowledge and Data Engineering 12, 372–390 (2000)

    Article  Google Scholar 

  5. Liu, J., Pan, Y., Wang, K., Han, J.: Mining Frequent Item Sets by Opportunistic Projection. In: Proc. ofACM SIGKDD, Edmonton, Alberta, Canada (2002)

    Google Scholar 

  6. Gopalan, R.P., Sucahyo, Y.G.: ITL-Mine: Mining Frequent Itemsets More Efficiently. In: Proc. of 2002 Int. Conf. on Fuzzy Systems and Knowledge Discovery, Singapore (2002)

    Google Scholar 

  7. Gopalan, R.P., Sucahyo, Y.G.: TreeITL-Mine: Mining Frequent Itemsets Using Pattern Growth, Tid Intersection and Prefix Tree. In: Proc. of 15th Australian Joint Conference on Artificial Intelligence, Canberra, Australia (2002)

    Google Scholar 

  8. Gopalan, R.P., Nuruddin, T., Sucahyo, Y.G.: Building a Data Mining Query Optimizer. In: Proc. of Australasian Data Mining Workshop, Canberra, Australia (2002)

    Google Scholar 

  9. Gopalan, R.P., Sucahyo, Y.G.: Fast Frequent Itemset Mining using Compressed Data Representation. In: Proc. of ILASTED Int. Conf. on Databases and Applications, Innsbruck, Austria (2003)

    Google Scholar 

  10. http://fuzzy.cs.uni-magdeburg.de/~borgelt/

  11. http://ww.ics.uci.edu/~mlearn/MLRepository.html

  12. http://ugustus.csscr.washington.edu/census/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gopalan, R.P., Sucahyo, Y.G. (2003). Improving the Efficiency of Frequent Pattern Mining by Compact Data Structure Design. In: Liu, J., Cheung, Ym., Yin, H. (eds) Intelligent Data Engineering and Automated Learning. IDEAL 2003. Lecture Notes in Computer Science, vol 2690. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45080-1_79

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-45080-1_79

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40550-4

  • Online ISBN: 978-3-540-45080-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics