Abstract
Most algorithms for frequent pattern mining use a support-based pruning strategy to prune a combinatorial search space. However, they are not effective for finding correlated patterns with similar levels of support. In additional, traditional patterns mining algorithms rarely consider weighted pattern mining. In this paper, we present a new algorithm, WHFPMiner (Weighted Highly-correlated Frequent Patterns Miner) in which a new objective measure, called weighted h-confidence, is developed to mine weighted highly-correlated frequent patterns with similar levels of weighted support. Adopting an improved weighted FP-tree structure, this algorithm exploits both cross-weighted support and anti-monotone properties of the weighted h-confidence measure for the efficient discovery of weighted hyperclique patterns. A comprehensive performance study shows that WHFPMiner is efficient and fast for finding weighted highly-correlated frequent patterns. Moreover, it generates fewer but more valuable patterns with the high correlation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cheung, Y.L., Fu, A.W.: Mining Frequent Itemsets without Support Threshold: with and without Item Constraints. IEEE Transactions on Knowledge and Data Engineering 16, 1052–1069 (2004)
Wang, K., He, Y., Cheung, D., Chin, Y.: Mining Confident Rules without Support Requirement. In: 2001 ACM CIKM International Conference on Information and Knowledge Management, pp. 89–96. ACM Press, New York (2001)
Omiecinski, E.R.: Alternative Interest Measures for Mining Associations in Databases. IEEE Transactions on Knowledge and Data Engineering 15, 57–69 (2003)
Tan, P.N., Kumar, V., Srivastava, J.: Selecting the Right Interestingness Measure for Association Patterns. In: 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 32–41. ACM, New York (2002)
Wang, W., Yang, J., Yu, P.S.: Efficient Mining of Weighted Association Rules (WAR). In: 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 270–274. ACM, New York (2000)
Cai, C.H., Fu, A.W.C., Cheng, C.H., Kwong, W.W.: Mining Association Rules with Weighted Items. In: 998 International Database Engineering and Applications Symposium, pp. 68–77. IEEE Computer Society, Washington (1998)
Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules in Large Databases. In: 20th International Conference on Very Large Data Bases, pp. 487–499. Morgan Kaufmann, San Francisco (1994)
Han, J., Pei, J., Yin, Y.: Mining Frequent Patterns without Candidate Generation. In: 2000 ACM SIGMOD International Conference on Management of Data, pp. 1–12. ACM, New York (2000)
Wang, J., Pei, J., Han, J.: Closet+: Searching for the Best Strategies for Mining Frequent Closed Itemsets. In: 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 236–245. ACM, New York (2003)
Grahne, G., Zhu, J.: High Performance Mining of Maximal Frequent Itemsets. In: 6th SIAM International Workshop on High Performance Data Mining, pp. 135–143. SIAM, Philadelphia (2003)
Xiong, H., Tan, P.N., Kumar, V.: Mining Strong Affinity Association Patterns in Data Sets with Skewed Support Distribution. In: 3rd IEEE International Conference on Data Mining, pp. 387–394. IEEE Computer Society, Washington (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Geng, R., Dong, X., Zhao, J., Xu, W. (2008). WHFPMiner: Efficient Mining of Weighted Highly-Correlated Frequent Patterns Based on Weighted FP-Tree Approach. In: Sun, F., Zhang, J., Tan, Y., Cao, J., Yu, W. (eds) Advances in Neural Networks - ISNN 2008. ISNN 2008. Lecture Notes in Computer Science, vol 5264. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87734-9_83
Download citation
DOI: https://doi.org/10.1007/978-3-540-87734-9_83
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87733-2
Online ISBN: 978-3-540-87734-9
eBook Packages: Computer ScienceComputer Science (R0)