Abstract
Mining weighted interesting patterns (WIP) [5] is an important research issue in data mining and knowledge discovery with broad applications. WIP can detect correlated patterns with a strong weight and/or support affinity. However, it still requires two database scans which are not applicable for efficient processing of the real-time data like data streams. In this paper, we propose a novel tree structure, called SPWIP-tree (Single-pass Weighted Interesting Pattern tree), that captures database information using a single-pass of database and provides efficient mining performance using a pattern growth mining approach. Extensive experimental results show that our approach outperforms the existing WIP algorithm. Moreover, it is very efficient and scalable for weighted interesting pattern mining with a single database scan.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: ACM SIGMOD, pp. 207–216 (1993)
Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules. In: 20th Int. Conf. on Very Large Data Bases (VLDB), pp. 487–499 (1994)
Yun, U., Leggett, J.J.: WFIM: weighted frequent itemset mining with a weight range and a minimum weight. In: Fourth SIAM Int. Conf. on Data Mining, USA, pp. 636–640 (2005)
Yun, U., Leggett, J.J.: WLPMiner: Weighted frequent pattern mining with length decreasing support constraints. In: 9th Pacific-Asia Conf. on Knowledge Discovery and Data Mining (PAKDD), Vietnam, pp. 555–567 (2005)
Yun, U.: Efficient Mining of weighted interesting patterns with a strong weight and/or support affinity. Information Sciences 177, 3477–3499 (2007)
Cai, C.H., Fu, A.W., Cheng, C.H., Kwong, W.W.: Mining association rules with weighted items. In: Int. Database Engineering and Applications Symposium, IDEAS, pp. 68–77 (1998)
Tao, F.: Weighted association rule mining using weighted support and significant framework. In: 9th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, USA, pp. 661–666 (2003)
Wang, W., Yang, J., Yu, P.S.: WAR: weighted association rules for item intensities. Knowledge Information and Systems 6, 203–229 (2004)
Han, J., Pei, J., Yin, Y., Mao, R.: Mining frequent patterns without candidate generation: a frequent-pattern tree approach. Data Mining and Knowledge Discovery 8, 53–87 (2004)
Grahne, G., Zhu, J.: Fast Algorithms for frequent itemset mining using FP-Trees. IEEE Transactions on Knowledge and Data Engineering 17(10), 1347–1362 (2005)
Han, J., Cheng, H., Xin, D., Yan, X.: Frequent pattern mining: current status and future directions. Data Mining and Knowledge Discovery 15, 55–86 (2007)
Raissi, C., Poncelet, P., Teisseire, M.: Towards a new approach for mining frequent itemsets on data stream. Journal of Intelligent Information Systems 28, 23–36 (2007)
Jiang, N., Gruenwald, L.: Research Issues in Data Stream Association Rule Mining. SIGMOD Record 35(1), 14–19 (2006)
Metwally, A., Agrawal, D., Abbadi, A.E.: An Integrated Efficient Solution for Computing Frequent and Top-k Elements in Data Streams. ACM Transactions on Database Systems (TODS) 31(3), 1095–1133 (2006)
Leung, C.K.-S., Khan, Q.I.: DSTree: A Tree structure for the mining of frequent Sets from Data Streams. In: 6th IEEE Int. Conf. on Data Mining (ICDM), pp. 928–932 (2006)
Koh, J.-L., Shieh, S.-F.: An Efficient Approach for Maintaining Association Rules Based on Adjusting FP-tree Structures. In: Lee, Y., Li, J., Whang, K.-Y., Lee, D. (eds.) DASFAA 2004. LNCS, vol. 2973, pp. 417–424. Springer, Heidelberg (2004)
Li, X., Deng, Z.-H., Tang, S.: A fast algorithm for maintenance of association rules in incremental databases. In: Li, X., Zaïane, O.R., Li, Z. (eds.) ADMA 2006. LNCS (LNAI), vol. 4093, pp. 56–63. Springer, Heidelberg (2006)
Frequent itemset mining dataset repository, http://fimi.cs.helsinki.fi/data/
UCI machine learning repository, http://kdd.ics.uci.edu/
Xiong, H., Tan, P.-N., Kumar, V.: Hyperclique Pattern Discovery. Data Mining and Knowledge Discovery 13, 219–242 (2006)
Leung, C.K.-S., Khan, Q.I., Li, Z., Hoque, T.: CanTree: a canonical-order tree for incremental frequent-pattern mining. Knowledge and Information Systems 11(3), 287–311 (2007)
Yun, U.: Mining lossless closed frequent patterns with weight constraints. Knowledge-Based Systems 210, 86–97 (2007)
Tanbeer, S.K., Ahmed, C.F., Jeong, B.-S., Lee, Y.-K.: CP-tree: A tree structure for single-pass frequent pattern mining. In: Washio, T., Suzuki, E., Ting, K.M., Inokuchi, A. (eds.) PAKDD 2008. LNCS (LNAI), vol. 5012, pp. 1022–1027. Springer, Heidelberg (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ahmed, C.F., Tanbeer, S.K., Jeong, BS., Lee, YK. (2008). Efficient Single-Pass Mining of Weighted Interesting Patterns. In: Wobcke, W., Zhang, M. (eds) AI 2008: Advances in Artificial Intelligence. AI 2008. Lecture Notes in Computer Science(), vol 5360. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89378-3_41
Download citation
DOI: https://doi.org/10.1007/978-3-540-89378-3_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89377-6
Online ISBN: 978-3-540-89378-3
eBook Packages: Computer ScienceComputer Science (R0)