Discovering Frequent Itemsets Using Transaction Identifiers

Chai, Duckjin; Choi, Heeyoung; Hwang, Buhyun

doi:10.1007/11539506_147

Duckjin Chai²⁰,
Heeyoung Choi²⁰ &
Buhyun Hwang²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3613))

Included in the following conference series:

International Conference on Fuzzy Systems and Knowledge Discovery

1450 Accesses

Abstract

In this paper, we propose an efficient algorithm which generates frequent itemsets by only one database scan. A frequent itemset is a set of common items that are included in at least as many transactions as a given minimum support. While scanning the database of transactions, our algorithm generates a table having 1-frequent items and a list of transactions per each 1-frequent item, and generates 2-frequent itemsets by using a hash technique. k(k≥3)-frequent itemsets can be simply found by checking whether for all (k–1)-frequent itemsets used to generate a k-candidate itemset, the number of common transactions in their lists is greater than or equal to the minimum support. The experimental analysis of our algorithm has shown that it can generate frequent itemsets more efficiently than FP-growth algorithm.

This work was supported by Institute of Information Assessment(ITRC).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 119.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Adrians, P., Zantige, D.: Data Mining. Addison-Wesley, Reading (1996)
Google Scholar
Agrawal, R., Aggarwal, C., Prasad, V.V.V.: A tree projection algorithm for generation of frequent itemsets. J. Parallel and Distributed Computing (2000)
Google Scholar
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: VLDB, pp. 487–499 (1994)
Google Scholar
Berry, M.J.A., Linoff, G.: Data Mining Techniques-For marketing, Sales, and Customer Support. Wiley Computer Publishing, Chichester (1997)
Google Scholar
Grahne, G., Lakshmanan, L., Wang, X.: Efficient mining of constrained correlated sets. In: ICDE (2000)
Google Scholar
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: ACM SIGMOD, pp. 1–12 (2000)
Google Scholar
Lent, B., Swami, A., Widom, J.: Clustering association rules. In: ICDE, pp. 220–231 (1997)
Google Scholar
Liu, B., Hsu, W., Ma, Y.: Mining association rules with multiple minimum supports. In: ACM SIGKDD, pp. 337–341 (1999)
Google Scholar
Ng, R., Lakshmanan, L.V.S., Han, J., Pang, A.: Exploratory mining and pruning optimizations of constrained associations rules. In: SIGMOD, pp. 13–24 (1998)
Google Scholar
Park, J.S., Chen, M.S., Yu, P.S.: An effective hash-based algorithm for mining association rules. In: ACM SIGMOD, pp. 175–186 (1995)
Google Scholar
Simoudis, E.: Reality Check for Data Mining. IEEE Expert: Intelligent Systems and Their Applications 11(5) (October 1996)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Chonnam National University, 300 Yongbong-dong, Kwangju, Korea
Duckjin Chai, Heeyoung Choi & Buhyun Hwang

Authors

Duckjin Chai
View author publications
You can also search for this author in PubMed Google Scholar
Heeyoung Choi
View author publications
You can also search for this author in PubMed Google Scholar
Buhyun Hwang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Electrical and Electronic Engineering, Nanyang Technological University, Block S1, Nanyang Avenue, 639798, Singapore
Lipo Wang
Honda Research Institute Europe GmbH, Offenbach/Main, Germany
Yaochu Jin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chai, D., Choi, H., Hwang, B. (2005). Discovering Frequent Itemsets Using Transaction Identifiers. In: Wang, L., Jin, Y. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2005. Lecture Notes in Computer Science(), vol 3613. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11539506_147

Download citation

DOI: https://doi.org/10.1007/11539506_147
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28312-6
Online ISBN: 978-3-540-31830-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics