SuffixMiner: Efficiently Mining Frequent Itemsets in Data Streams by Suffix-Forest

Jia, Lifeng; Zhou, Chunguang; Wang, Zhe; Xu, Xiujuan

doi:10.1007/11540007_72

Lifeng Jia²⁰,
Chunguang Zhou²⁰,
Zhe Wang²⁰ &
…
Xiujuan Xu²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3614))

Included in the following conference series:

International Conference on Fuzzy Systems and Knowledge Discovery

909 Accesses

Abstract

We proposed a new algorithm SuffixMiner which eliminates the requirement of multiple passes through the data when finding out all frequent itemsets in data streams, takes full advantage of the special property of suffix-tree to avoid generating candidate itemsets and traversing each suffix-tree during the itemset growth, and utilizes a new itemset growth method to mine all frequent itemsets in data streams. Experiment results show that the SuffixMiner algorithm not only has an excellent scalability to mine frequent itemsets over data streams, but also outperforms Apriori and Fp-Growth algorithms.

This work was supported by the Natural Science Foundation of China (Grant No. 60433020) and the Key Science-Technology Project of the National Education Ministry of China (Grant No. 02090).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Manku, G.S., Motwani, R.: Approximate Frequency Counts Over Data Streams. In: Proceeding of the International Conference on Very Large Data Bases, Hong Kong, China, pp. 346–357 (2002)
Google Scholar
Agrawal, R., Srikant, R.: Fast Algorithms for mining Association Rules. In: Proceeding of the International Conference on Very Large Data Bases, Santiago de Chile, Chile, pp. 487–499 (1994)
Google Scholar
Giannella, C., Han, J., Pei, J., Yan, X., Yu, P.S.: Mining Frequent Patterns in Data Streams at Multiple Time Granularities. In: Next Generation Data Mining, Ch. 3, pp. 191–211 (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Key Laboratory of Symbol Computation and Knowledge Engineering of the Ministry of Education, College of Computer Science, Jilin University, Changchun, 130012, China
Lifeng Jia, Chunguang Zhou, Zhe Wang & Xiujuan Xu

Authors

Lifeng Jia
View author publications
You can also search for this author in PubMed Google Scholar
Chunguang Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Zhe Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xiujuan Xu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Electrical and Electronic Engineering, Nanyang Technological University, Block S1, Nanyang Avenue, 639798, Singapore
Lipo Wang
Honda Research Institute Europe GmbH, Offenbach/Main, Germany
Yaochu Jin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jia, L., Zhou, C., Wang, Z., Xu, X. (2005). SuffixMiner: Efficiently Mining Frequent Itemsets in Data Streams by Suffix-Forest. In: Wang, L., Jin, Y. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2005. Lecture Notes in Computer Science(), vol 3614. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11540007_72

Download citation

DOI: https://doi.org/10.1007/11540007_72
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28331-7
Online ISBN: 978-3-540-31828-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics