Abstract
In this paper, we propose an efficient algorithm which generates frequent itemsets by only one database scan. A frequent itemset is a set of common items that are included in at least as many transactions as a given minimum support. While scanning the database of transactions, our algorithm generates a table having 1-frequent items and a list of transactions per each 1-frequent item, and generates 2-frequent itemsets by using a hash technique. k(k≥3)-frequent itemsets can be simply found by checking whether for all (k–1)-frequent itemsets used to generate a k-candidate itemset, the number of common transactions in their lists is greater than or equal to the minimum support. The experimental analysis of our algorithm has shown that it can generate frequent itemsets more efficiently than FP-growth algorithm.
This work was supported by Institute of Information Assessment(ITRC).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Adrians, P., Zantige, D.: Data Mining. Addison-Wesley, Reading (1996)
Agrawal, R., Aggarwal, C., Prasad, V.V.V.: A tree projection algorithm for generation of frequent itemsets. J. Parallel and Distributed Computing (2000)
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: VLDB, pp. 487–499 (1994)
Berry, M.J.A., Linoff, G.: Data Mining Techniques-For marketing, Sales, and Customer Support. Wiley Computer Publishing, Chichester (1997)
Grahne, G., Lakshmanan, L., Wang, X.: Efficient mining of constrained correlated sets. In: ICDE (2000)
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: ACM SIGMOD, pp. 1–12 (2000)
Lent, B., Swami, A., Widom, J.: Clustering association rules. In: ICDE, pp. 220–231 (1997)
Liu, B., Hsu, W., Ma, Y.: Mining association rules with multiple minimum supports. In: ACM SIGKDD, pp. 337–341 (1999)
Ng, R., Lakshmanan, L.V.S., Han, J., Pang, A.: Exploratory mining and pruning optimizations of constrained associations rules. In: SIGMOD, pp. 13–24 (1998)
Park, J.S., Chen, M.S., Yu, P.S.: An effective hash-based algorithm for mining association rules. In: ACM SIGMOD, pp. 175–186 (1995)
Simoudis, E.: Reality Check for Data Mining. IEEE Expert: Intelligent Systems and Their Applications 11(5) (October 1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chai, D., Choi, H., Hwang, B. (2005). Discovering Frequent Itemsets Using Transaction Identifiers. In: Wang, L., Jin, Y. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2005. Lecture Notes in Computer Science(), vol 3613. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11539506_147
Download citation
DOI: https://doi.org/10.1007/11539506_147
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28312-6
Online ISBN: 978-3-540-31830-9
eBook Packages: Computer ScienceComputer Science (R0)