Mining Recent Frequent Itemsets in Data Streams by Radioactively Attenuating Strategy

Jia, Lifeng; Wang, Zhe; Zhou, Chunguang; Xu, Xiujuan

doi:10.1007/11527503_95

Lifeng Jia²¹,
Zhe Wang²¹,
Chunguang Zhou²¹ &
…
Xiujuan Xu²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3584))

Included in the following conference series:

International Conference on Advanced Data Mining and Applications

2324 Accesses

Abstract

We propose a novel approach for mining recent frequent itemsets. The approach has three key contributions. First, it is a single-scan algorithm which utilizes the special property of suffix-trees to guarantee that all frequent itemsets are mined. During the phase of itemset growth it is unnecessary to traverse the suffix-trees which are the data structure for storing the summary information of data. Second, our algorithm adopts a novel method for itemset growth which includes two special kinds of itemset growth operations to avoid generating any candidate itemset. Third, we devise a new regressive strategy from the attenuating phenomenon of radioelement in nature, and apply it into the algorithm to distinguish the influence of latest transactions from that of obsolete transactions. We conduct detailed experiments to evaluate the algorithm. It confirms that the new method has an excellent scalability and the performance illustrates better quality and efficiency.

This work was supported by the Natural Science Foundation of China (Grant No. 60433020) and the Key Science-Technology Project of the National Education Ministry of China (Grant No. 02090).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Imielinski, T., Swami, A.: Mining Association Rules between Sets of Items in Large Databases. In: ACM SIGMOD Conf. Management of Data, pp. 207–216 (1993)
Google Scholar
Manku, G.S., Motwani, R.: Approximate Frequency Counts Over Data Streams. In: Proceeding of the International Conference on Very Large Data Bases, Hong Kong, China, pp. 346–357 (2002)
Google Scholar
Giannella, C., Han, J., Pei, J., Yan, X., Yu, P.S.: Mining Frequent Patterns in Data Streams at Multiple Time Granularities. Next Generation Data Mining, ch. 3, 191–211 (2002)
Google Scholar
Jin, R., Agrawal, G.: An Algorithm for In-Core Frequent Itemset Mining on Streaming Data (2004), [online] Available http://www.cse.ohio-state.edu/~agrawal/
Teng, W.G., Chen, M.S., Yu, P.S.: A Regression-Based Temporal Pattern Mining Scheme for Data Streams. In: Proceeding of the 29th VLDB Conference, pp. 93–104 (2003)
Google Scholar
Chang, J., Lee, W.: Finding Recent Frequent Itemsets Adaptively over Online Data Streams. In: Proceeding of the ACM International Conference on Knowledge Discovery and Data Mining, Washington, DC, pp. 487–492 (2003)
Google Scholar
Chang, J., Lee, W.: A Sliding Window Method for Finding Recently Frequent Itemsets over Online Data Streams. Journal of Information Science and Engineering, 753–762 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Computer Science, Jilin University, Key Laboratory of Symbol Computation and Knowledge Engineering of the Ministry of Education, Changchun, 130012, China
Lifeng Jia, Zhe Wang, Chunguang Zhou & Xiujuan Xu

Authors

Lifeng Jia
View author publications
You can also search for this author in PubMed Google Scholar
Zhe Wang
View author publications
You can also search for this author in PubMed Google Scholar
Chunguang Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Xiujuan Xu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Technology and Electrical Engineering, The University of Queensland, 4072, Brisbane, Queensland, Australia
Xue Li
The State Key Laboratory for Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, 430072, Wuhan, China
Shuliang Wang
School of ITEE, The Univ of Queensland, St. Lucia, 4072, QLD, Australia
Zhao Yang Dong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jia, L., Wang, Z., Zhou, C., Xu, X. (2005). Mining Recent Frequent Itemsets in Data Streams by Radioactively Attenuating Strategy. In: Li, X., Wang, S., Dong, Z.Y. (eds) Advanced Data Mining and Applications. ADMA 2005. Lecture Notes in Computer Science(), vol 3584. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11527503_95

Download citation

DOI: https://doi.org/10.1007/11527503_95
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27894-8
Online ISBN: 978-3-540-31877-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics