FIA: Frequent Itemsets Mining Based on Approximate Counting in Data Streams

Kim, Younghee; Ryu, Joonsuk; Kim, Ungmo

doi:10.1007/978-3-642-10677-4_35

Younghee Kim¹⁹,
Joonsuk Ryu¹⁹ &
Ungmo Kim¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5863))

Included in the following conference series:

International Conference on Neural Information Processing

Abstract

In this paper, we consider the problem of frequent elements over data stream seeks the set of items whose frequency exceeds σN for a given threshold parameter σ. We refer to this model as the sliding window model. We also use a user specified error parameter, ε, to control the accuracy of the mining result. We also propose an FIA (Frequent Itemsets mining based on an Approximate counting) algorithm based on the Chernoff bound with a guarantee of the output quality and also a bound on the memory usage. The proposed algorithm show that runs significantly faster and consumes less memory than do existing algorithms for mining approximate frequent itemsets.

This work was supported by the Korea Science and Engineering Foundation (KOSEF) grant funded by the Korea government(MEST) (No. 2009-0075771).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Frequent Itemset Mining over Data Streams

Mining Data Streams with Dynamic Confidence Intervals

Time-weighted counting for recently frequent pattern mining in data streams

Article 22 March 2017

References

Datar, M., Gionis, A., Indyk, P., Motwani, R.: Maintaining stream statistics over sliding windows. SIAM Journal on Computing 31(6), 1794–1813 (2002)
Article MATH MathSciNet Google Scholar
Manku, G.S., Motwani, R.: Approximate Frequency Counts Over Data Streams. In: Proceedings of the 28^th International Conference on VLDB, pp. 346–357 (2002)
Google Scholar
Yu, J.X., Chong, Z., Lu, H., Zhang, Z., Zhou, A.: False positive or false negative: mining frequent itemsets from high speed transactional data streams. In: Proc, VLDB (2004)
Google Scholar
Chang, J., Lee, W.: A Sliding Window Method for Finding Recently Frequent Itemsets over Online Data Streams. Journal of Information Science and Engineering 20 (2004)
Google Scholar
Lee, C.H., Lin, C.R., Chen, M.S.: Sliding window filtering: An efficient method for incremental mining on a time-variant database. Information Systems 30, 227–244 (2005)
Article Google Scholar
Lin, C.-H., Chiu, D.-Y., Wu, Y.-H., Chen, A.L.P.: Mining frequent itemsets from data streams with a time-sensitive sliding window. In: Proc, SIAM Int’l Conference on Data Mining, pp. 68–79 (2005)
Google Scholar
Giannella, C., Han, J., Pei, J., Yan, X., Yu, P.S.: Mining frequent patterns in data streams at multiple time granularities. In: Data Mining, Next Generation Challenges and Futures Directions, pp. 191–212. AAAI/MIT Press (2004)
Google Scholar
Li, H.F., Lee, S.Y.: Mining frequent itemsets over data streams using efficient window sliding techniques. Expert Systems with Applications (2008)
Google Scholar
Li, H.F., Ho, C.C., Shan, M.K., Lee, S.Y.: Efficient Maintenance and Mining of Frequent Itemsets over Online Data Streams with a Sliding Window. In: IEEE SMC 2006 (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information and Communication Engineering, Sungkyunkwan University, 300 Chunchun-dong, Suwon, Gyeonggi-Do, 440-746, Republic of Korea
Younghee Kim, Joonsuk Ryu & Ungmo Kim

Authors

Younghee Kim
View author publications
You can also search for this author in PubMed Google Scholar
Joonsuk Ryu
View author publications
You can also search for this author in PubMed Google Scholar
Ungmo Kim
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electronic Engineering, City University of Hong Kong, Hong Kong,
Chi Sing Leung
School of Electrical Engineering and Computer Science, Kyungpook National University, 1370 Sankyuk-Dong, Puk-Gu, 702-701, Taegu, Korea
Minho Lee
School of Information Technology, King Mongkut’s University of Technology Thonburi, 126 Pracha-U-Thit Rd., Bangmod, Thungkru, 10140, Bangkok, Thailand
Jonathan H. Chan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kim, Y., Ryu, J., Kim, U. (2009). FIA: Frequent Itemsets Mining Based on Approximate Counting in Data Streams. In: Leung, C.S., Lee, M., Chan, J.H. (eds) Neural Information Processing. ICONIP 2009. Lecture Notes in Computer Science, vol 5863. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10677-4_35

Download citation

DOI: https://doi.org/10.1007/978-3-642-10677-4_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10676-7
Online ISBN: 978-3-642-10677-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics