Parallel FP-Growth on PC Cluster

Pramudiono, Iko; Kitsuregawa, Masaru

doi:10.1007/3-540-36175-8_47

Iko Pramudiono⁵ &
Masaru Kitsuregawa⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2637))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

1435 Accesses
29 Citations

Abstract

FP-growth has become a popular algorithm to mine frequent patterns. Its metadata FP-tree has allowed significant performance improvement over previously reported algorithms. However that special data structure also restrict the ability for further extensions. There is also potential problem when FP-tree can not fit into the memory. In this paper, we report parallel execution of FP-growth. We examine the bottlenecks of the parallelization and also method to balance the execution efficiently on shared-nothing environment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

R. Agarwal, C. Aggarwal and V. V. V. Prasad “A Tree Projection Algorithm for Generation of Frequent Itemsets”. In J. Parallel and Distributed Computing, 2000
Google Scholar
R. Agrawal and R. Srikant. “Fast Algorithms for Mining Association Rules”. In Proceedings of the 20th International Conference on VLDB, pp. 487–499, September 1994.
Google Scholar
R. Agrawal and J. C. Shafer. “Parallel Mining of Associaton Rules”. In IEEE Transaction on Knowledge and Data Engineering, Vol. 8, No. 6, pp. 962–969, December, 1996.
Article Google Scholar
J. Han, J. Pei and Y. Yin “Mining Frequent Pattern without Candidate Generation” In Proc. of the ACM SIGMOD Conference on Management of Data, 2000
Google Scholar
J.S. Park, M.-S. Chen, P.S. Yu “Efficient Parallel Algorithms for Mining Association Rules” In Proc. of 4th International Conference on Information and Knowledge Management (CIKM’95), pp. 31–36, November, 1995
Google Scholar
T. Shintani and M. Kitsuregawa “Hash Based Parallel Algorithms for Mining Association Rules”. In IEEE Fourth International Conference on Parallel and Distributed Information Systems, pp. 19–30, December 1996.
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Industrial Science, The University of Tokyo, 4-6-1 Komaba, Meguro-ku, Tokyo, 153-8505, Japan
Iko Pramudiono & Masaru Kitsuregawa

Authors

Iko Pramudiono
View author publications
You can also search for this author in PubMed Google Scholar
Masaru Kitsuregawa
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science Department, Korea Advanced Institute of Science and Technology, 373-1 Koo-Sung Dong, Yoo-Sung Ku, Daejeon, 305-701, Korea
Kyu-Young Whang
Department of Statistics, Seoul National University, Sillimdong Kwanakgu, Seoul, 151-742, Korea
Jongwoo Jeon
School of Electrical Engineering and Computer Science, Seoul National University, Kwanak P.O. Box 34, Seoul, 151-742, Korea
Kyuseok Shim
Department of Computer Science and Engineering, University of Minnesota, 200 Union St SE, Minneapolis, MN, 55455, USA
Jaideep Srivastava

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pramudiono, I., Kitsuregawa, M. (2003). Parallel FP-Growth on PC Cluster. In: Whang, KY., Jeon, J., Shim, K., Srivastava, J. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2003. Lecture Notes in Computer Science(), vol 2637. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36175-8_47

Download citation

DOI: https://doi.org/10.1007/3-540-36175-8_47
Published: 30 April 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-04760-5
Online ISBN: 978-3-540-36175-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics