Abstract
A data stream is a massive unbounded sequence of data elements continuously generated at a rapid rate. Data mining over data streams should support the flexible trade-off between processing time and mining accuracy. This should occur without a fixed granule of data mining to catch the sensitive change of its mining results as soon as possible. The continuous characteristic of streaming data necessitates the use of algorithms that require only one scan over the stream for knowledge discovery. This paper focuses on research issues concerning mining frequent itemsets in data streams and presents an efficient algorithm WSFI(Weighted Support Frequent Itemsets)-mine to mine all frequent itemsets by one scan from the data stream. WSFI-mine’s novel contribution is to effectively execute frequent patterns by generating constraint candidate item sets and extended FPtree-based compact pattern representation under window sliding of the data stream. This method can be achieved effectively with less memory and lowered execution time.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Chang, J., Lee, W.: A Sliding Window Method for Finding Recently Frequent Itemsets over Online Data Streams. Journal of Information Science and Engineering 20(4) (July 2004)
Manku, G.S., Motwani, R.: Approximate Frequency Counts Over Data Streams. In: Proceedings of the 28th International Conference on Very Large Data Bases, pp. 346–357 (2002)
Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules. In: Conf. of the 20th VLDB conference, pp. 487–499 (1994)
Li., H.F., Lee, S.Y., Shan, M.K.: An Efficient Algorithm for Mining Frequent Itemsets over the Entire History of Data Streams. In: Proceedings of First International Workshop on Knowledge Discovery in Data Streams 9IWKDDS (2004)
Li., H.F., Lee, S.Y., Shan, M.K.: Online Mining (Recently) Maximal Frequent Itemsets over Data Streams. In: Proceedings of the 15th IEEE International Workshop on Research Issues on Data Engineering, RIDE (2005)
Chi, Y., Wang, H., Yu, P.S., Muntz, R.R.: Moment: Maintaining Closed Frequent Itemsets over a Stream Sliding Window. In: Proceedings of the Fourth IEEE International Conference on Data Mining (ICDM 2004) (2004)
Lee, C.H., Lin, C.R., Chen, M.S.: Sliding Window Filtering: An Efficient Method for Incremental Mining on a Time-variant Database. Information Systems 30, 227–244 (2005)
Lin, C.H., Chiu, D.Y., Wu, Y.H., Chen, A.L.P.: Mining Frequent Itemsets from Data Streams with a Time-sensitive Sliding Window. In: Proc. SIAM Int’l. Conference on Data Mining, pp. 68–79. SIAM, Philadelphia (2005)
Han, J., Pei, J., Yin, Y.: Mining Frequent Patterns without Candidate Generation. In: Proc. of 2000 ACM SIGMOD, pp. 1–12 (2000)
Li, H.F., Lee, S.Y.: Mining Frequent Itemsets over Data Streams using Efficient Window Sliding Techniques. Expert Systems with Applications (2008)
Li, H.F., Ho, C.C., Shan, M.K., Lee, S.Y.: Efficient Maintenance and Mining of Frequent Itemsets over Online Data Streams with a Sliding Window. In: IEEE SMC 2006 (2006)
Chu, C.J., Tseng, V.S., Liang, T.: An Efficient Algorithm for Mining Temporal High Utility Itemsets from Data Streams. The Journal of System and Software 81, 1105–1117 (2008)
Guo, Y., et al.: A FP-tree based method for inverse frequent set mining. In: Bell, D.A., Hong, J. (eds.) BNCOD 2006. LNCS, vol. 4042, pp. 152–163. Springer, Heidelberg (2006)
Leung, C.K.S., et al.: A tree-based approach for frequent pattern mining from uncertain data. In: Washio, T., Suzuki, E., Ting, K.M., Inokuchi, A. (eds.) PAKDD 2008. LNCS (LNAI), vol. 5012, pp. 653–661. Springer, Heidelberg (2008)
Leung, C.K.S., et al.: CanTree: a canonical-order tree for incremental frequent-pattern mining. KAIS 11(3), 287–311 (2007)
Han, J., Pei, J., Yin, Y., Mao, R.: Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach. Data Mining and Knowledge Discovery 8, 53–87 (2004)
Zhu, X.D., Huang, Z.Q.: Conceptual modeling rules extracting for data streams. Knowledge-Based Systems, 1–7 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, Y., Kim, U. (2009). WSFI-Mine: Mining Frequent Patterns in Data Streams. In: Yu, W., He, H., Zhang, N. (eds) Advances in Neural Networks – ISNN 2009. ISNN 2009. Lecture Notes in Computer Science, vol 5552. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01510-6_95
Download citation
DOI: https://doi.org/10.1007/978-3-642-01510-6_95
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01509-0
Online ISBN: 978-3-642-01510-6
eBook Packages: Computer ScienceComputer Science (R0)