The Item-Set Tree: A Data Structure for Data Mining

Hafez, Alaaeldin; Deogun, Jitender; Raghavan, Vijay V.

doi:10.1007/3-540-48298-9_20

Alaaeldin Hafez⁶,
Jitender Deogun⁷ &
Vijay V. Raghavan⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1676))

Included in the following conference series:

International Conference on Data Warehousing and Knowledge Discovery

846 Accesses
13 Citations

Abstract

Enhancements in data capturing technology have lead to exponential growth in amounts of data being stored in information systems. This growth in turn has motivated researchers to seek new techniques for extraction of knowledge implicit or hidden in the data. In this paper, we motivate the need for an incremental data mining approach based on data structure called the itemset tree. The motivated approach is shown to be effective for solving problems related to efficiency of handling data updates, accuracy of data mining results, processing input transactions, and answering user queries. We present efficient algorithms to insert transactions into the item-set tree and to count frequencies of itemsets for queries about strength of association among items. We prove that the expected complexity of inserting a transaction is ≈ O(1), and that of frequency counting is O(n), where n is the cardinality of the domain of items.

This research was supported in part by the U.S. Department of Energy, Grant No. DE-FG02- 97ER1220, and by the Army Research Office, Grant No. DAAH04-96-1-0325, under DEPSCoR program of Advanced Research Projects Agency, Department of Defense.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

R. Agrawal, T. Imilienski, and A. Swami, “Mining Association Rules between Sets of Items in Large Databases,” Proc. of the ACM SIGMOD Int’l Conf. On Management of data, May 1993.
Google Scholar
R. Agrawal, and R. Srikant, “Fast Algorithms for Mining Association Rules,” Proc. Of the 20^th VLDB Conference, Santiago, Chile, 1994.
Google Scholar
R. Agrawal, J. Shafer, “Parallel Mining of Association Rules,” IEEE Transactions on Knowledge and Data Engineering, Vol. 8, No. 6, Dec. 1996.
Google Scholar
C. Agrawal, and P. Yu, “Mining Large Itemsets for Association Rules,” Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 1997.
Google Scholar
S. Brin, R. Motwani, J. Ullman, and S. Tsur, “Dynamic Itemset Counting and Implication Rules for Market Basket Data,” SIGMOD Record (SCM Special Interset Group on Management of Data), 26,2, 1997.
Google Scholar
S. Chaudhuri, “Data Mining and Database Systems: Where is the Intersection,” Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 1997.
Google Scholar
H. Mannila, H. Toivonen, and A. Verkamo, “Efficient Algorithms for Discovering Association Rules,” AAAI Workshop on Knowledge Discovery in databases (KDD-94), July 1994.
Google Scholar
M. Zaki, S. Parthasarathy, M. Ogihara, and W. Li, “ New Algorithms for Fast Discovery of Association Rules,” Proc. Of the 3^rd Int’l Conf. On Knowledge Discovery and data Mining (KDD-97), AAAI Press, 1997.
Google Scholar

Download references

Author information

Authors and Affiliations

The Center for Advanced Computer Studies, University of SW Louisiana, Lafayette, LA, 70504, USA
Alaaeldin Hafez & Vijay V. Raghavan
The Department of Computer Science, University of Nebraska, Lincoln, NE, 68588, USA
Jitender Deogun

Authors

Alaaeldin Hafez
View author publications
You can also search for this author in PubMed Google Scholar
Jitender Deogun
View author publications
You can also search for this author in PubMed Google Scholar
Vijay V. Raghavan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer and Information Science, University of South Australia, The Levels, Adelaide, Australia, 05
Mukesh Mohania
IFS, Technical University of Vienna, Resselgasse 3, A-1040, Vienna, Austria
A Min Tjoa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hafez, A., Deogun, J., Raghavan, V.V. (1999). The Item-Set Tree: A Data Structure for Data Mining. In: Mohania, M., Tjoa, A.M. (eds) DataWarehousing and Knowledge Discovery. DaWaK 1999. Lecture Notes in Computer Science, vol 1676. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48298-9_20

Download citation

DOI: https://doi.org/10.1007/3-540-48298-9_20
Published: 01 March 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66458-1
Online ISBN: 978-3-540-48298-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics