Abstract
FP-growth has become a popular algorithm to mine frequent patterns. Its metadata FP-tree has allowed significant performance improvement over previously reported algorithms. However that special data structure also restrict the ability for further extensions. There is also potential problem when FP-tree can not fit into the memory. In this paper, we report parallel execution of FP-growth. We examine the bottlenecks of the parallelization and also method to balance the execution efficiently on shared-nothing environment.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
R. Agarwal, C. Aggarwal and V. V. V. Prasad “A Tree Projection Algorithm for Generation of Frequent Itemsets”. In J. Parallel and Distributed Computing, 2000
R. Agrawal and R. Srikant. “Fast Algorithms for Mining Association Rules”. In Proceedings of the 20th International Conference on VLDB, pp. 487–499, September 1994.
R. Agrawal and J. C. Shafer. “Parallel Mining of Associaton Rules”. In IEEE Transaction on Knowledge and Data Engineering, Vol. 8, No. 6, pp. 962–969, December, 1996.
J. Han, J. Pei and Y. Yin “Mining Frequent Pattern without Candidate Generation” In Proc. of the ACM SIGMOD Conference on Management of Data, 2000
J.S. Park, M.-S. Chen, P.S. Yu “Efficient Parallel Algorithms for Mining Association Rules” In Proc. of 4th International Conference on Information and Knowledge Management (CIKM’95), pp. 31–36, November, 1995
T. Shintani and M. Kitsuregawa “Hash Based Parallel Algorithms for Mining Association Rules”. In IEEE Fourth International Conference on Parallel and Distributed Information Systems, pp. 19–30, December 1996.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pramudiono, I., Kitsuregawa, M. (2003). Parallel FP-Growth on PC Cluster. In: Whang, KY., Jeon, J., Shim, K., Srivastava, J. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2003. Lecture Notes in Computer Science(), vol 2637. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36175-8_47
Download citation
DOI: https://doi.org/10.1007/3-540-36175-8_47
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-04760-5
Online ISBN: 978-3-540-36175-6
eBook Packages: Springer Book Archive