MMFI_DSSW – A New Method to Incrementally Mine Maximal Frequent Itemsets in Transaction Sensitive Sliding Window

Feng, Jiayin; Ren, Jiadong

doi:10.1007/978-3-540-76719-0_47

Jiayin Feng¹ &
Jiadong Ren¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4798))

Included in the following conference series:

International Conference on Knowledge Science, Engineering and Management

1248 Accesses

Abstract

Due to streaming data are infinite in length and fast changing with time, it is very significant to limit the memory usage in the process of mining data streams. Maximal frequent itemset is a subset of frequent itemsets; it can represent the important information of frequent itemsets with low computational cost. In this paper, we propose an algorithm MMFI_DSSW (Mining Maximal Frequent Itemsets in Data Streams Sliding Window) to mine maximal frequent itemsets with a novel MFI_BVT (Maximal Frequent Itemsets Binary Vector Table) summary data structure in sliding window. MFI_BVT builds a binary vector for each itemsets first. Then algorithm MMFI_DSSW performs logical AND operation to mine all the maximal frequent itemsets in MFI_BVT with a single-pass scan incoming data. Finally, the mining result can be updated incrementally. Experiment shows that algorithm MMFI_DSSW is efficient and scalable in memory usage and running time of CPU.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Li, H, Lee, S, Shan, M: An efficient algorithm for mining frequent itemsets over the entire history of data streams. In: Proceedings of the First International Workshop on Knowledge Discovery in Data Streams, held in conjunction with the 15th European Conference on Machine Learning (ECML 2004) and the 8th European Conference on the Principles and Practice of Knowledge Discovery in Databases (PKDD 2004), Pisa, Italy (2004)
Google Scholar
Zhi-jun, X., Hong, C., Li, C.: An Efficient Algorithm for Frequent Itemset Mining on Data Streams. In: Perner, P. (ed.) ICDM 2006. LNCS (LNAI), vol. 4065, pp. 474–491. Springer, Heidelberg (2006)
Chapter Google Scholar
Giannella, C., Han, J., Pei, J., Yan, X., Yu, P.S.: Mining Frequent Patterns in Data Streams at Multiple Time Granularities. In: Kargupta, H., Joshi, A., Sivakumar, K. (eds.) Next Generation Data Mining, pp. 191–212. MIT Press, Cambridge, Massachusetts (2003)
Google Scholar
Lin, C.H., Chiu, D.Y., Wu, Y.H., Chen, A.L.P.: Mining Frequent Itemsets from Data Streams with a Time-Sensitive Sliding Window. In: Proceedings of the Fifth SIAM International on Data Mining, Newport Beach, USA (2005)
Google Scholar
Teng, W.G., Chen, M.-S., Yu, P.S.: A Regression-Based Temporal Pattern Mining Scheme for Data Streams. In: Proceedings of the 29th VLDB Conference, pp. 93–104. IEEE Press, Berlin, Germany (2003)
Google Scholar
Chang, J.H., Lee, W.S.: Finding Recent Frequent Itemsets Adaptively over Online Data Streams. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 487–492. ACM Press, Washington, DC, USA (2003)
Chapter Google Scholar
Li, H.-F., Lee, S.-Y., Shan, M.-K.: Online Mining (Recently) Maximal Frequent Itemsets over Data Streams. In: RIDE-SDMA 2005. Proceedings of the 15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications, pp. 11–18. IEEE Press, Tokyo, Japan (2005)
Google Scholar
Lee, D., Lee, W.: Finding maximal frequent itemsets over online data streams adaptively. In: Proceedings of the fifth IEEE InternationalConference on Data Mining, pp. 266–273. IEEE Press, Houston, USA (2005)
Google Scholar
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: Proceedings of the ACM SIGMOD Conference on Management of Data, pp. 1–12. ACM Press, Dallas, USA (2000)
Chapter Google Scholar
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference on VLDB, Santiago, Chile, pp. 487–499 (1994)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Information Science and Engineering, Yanshan University, QinHuangdao 066004, China
Jiayin Feng & Jiadong Ren

Authors

Jiayin Feng
View author publications
You can also search for this author in PubMed Google Scholar
Jiadong Ren
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Zili Zhang Jörg Siekmann

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Feng, J., Ren, J. (2007). MMFI_DSSW – A New Method to Incrementally Mine Maximal Frequent Itemsets in Transaction Sensitive Sliding Window. In: Zhang, Z., Siekmann, J. (eds) Knowledge Science, Engineering and Management. KSEM 2007. Lecture Notes in Computer Science(), vol 4798. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76719-0_47

Download citation

DOI: https://doi.org/10.1007/978-3-540-76719-0_47
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-76718-3
Online ISBN: 978-3-540-76719-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics