Loading [a11y]/accessibility-menu.js
PNPFI: An Efficient Parallel Frequent Itemsets Mining Algorithm | IEEE Conference Publication | IEEE Xplore

PNPFI: An Efficient Parallel Frequent Itemsets Mining Algorithm


Abstract:

Frequent itemsets mining (FIM) plays an important role in many data mining areas. With the explosion of data scale, a number of parallel FIM algorithms have been proposed...Show More

Abstract:

Frequent itemsets mining (FIM) plays an important role in many data mining areas. With the explosion of data scale, a number of parallel FIM algorithms have been proposed. Although existing solutions have outstanding scalability, they suffer from high consumption of CPU and memory for recursively mining frequent itemsets based on a tree-structure. In this paper, we propose a novel parallel algorithm, named PNPFI. It employs three novel key optimizations. In detail, the itemsets are stored by the N-list structure, which is more compact than existing tree-based structure. It uses a new structure, called P-Subsume, to generate some frequent itemsets without the process of N-list intersection. In addition, PNPFI proposes a new load balancing strategy, which intelligently divides a large-scale FIM problem into a set of tasks based on the profiled load of each item. Compared with the state-of-the-art algorithms, experimental results show that PNPFI gets a performance improvement of 39% on average (max to 79%), and reduces the memory usage by 58% on average (max to 90%).
Date of Conference: 09-11 May 2018
Date Added to IEEE Xplore: 16 September 2018
ISBN Information:
Conference Location: Nanjing, China

Contact IEEE to Subscribe

References

References is not available for this document.