Skip to main content

The Item-Set Tree: A Data Structure for Data Mining

  • Conference paper
  • First Online:
Book cover DataWarehousing and Knowledge Discovery (DaWaK 1999)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1676))

Included in the following conference series:

Abstract

Enhancements in data capturing technology have lead to exponential growth in amounts of data being stored in information systems. This growth in turn has motivated researchers to seek new techniques for extraction of knowledge implicit or hidden in the data. In this paper, we motivate the need for an incremental data mining approach based on data structure called the itemset tree. The motivated approach is shown to be effective for solving problems related to efficiency of handling data updates, accuracy of data mining results, processing input transactions, and answering user queries. We present efficient algorithms to insert transactions into the item-set tree and to count frequencies of itemsets for queries about strength of association among items. We prove that the expected complexity of inserting a transaction is ≈ O(1), and that of frequency counting is O(n), where n is the cardinality of the domain of items.

This research was supported in part by the U.S. Department of Energy, Grant No. DE-FG02- 97ER1220, and by the Army Research Office, Grant No. DAAH04-96-1-0325, under DEPSCoR program of Advanced Research Projects Agency, Department of Defense.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Agrawal, T. Imilienski, and A. Swami, “Mining Association Rules between Sets of Items in Large Databases,” Proc. of the ACM SIGMOD Int’l Conf. On Management of data, May 1993.

    Google Scholar 

  2. R. Agrawal, and R. Srikant, “Fast Algorithms for Mining Association Rules,” Proc. Of the 20th VLDB Conference, Santiago, Chile, 1994.

    Google Scholar 

  3. R. Agrawal, J. Shafer, “Parallel Mining of Association Rules,” IEEE Transactions on Knowledge and Data Engineering, Vol. 8, No. 6, Dec. 1996.

    Google Scholar 

  4. C. Agrawal, and P. Yu, “Mining Large Itemsets for Association Rules,” Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 1997.

    Google Scholar 

  5. S. Brin, R. Motwani, J. Ullman, and S. Tsur, “Dynamic Itemset Counting and Implication Rules for Market Basket Data,” SIGMOD Record (SCM Special Interset Group on Management of Data), 26,2, 1997.

    Google Scholar 

  6. S. Chaudhuri, “Data Mining and Database Systems: Where is the Intersection,” Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 1997.

    Google Scholar 

  7. H. Mannila, H. Toivonen, and A. Verkamo, “Efficient Algorithms for Discovering Association Rules,” AAAI Workshop on Knowledge Discovery in databases (KDD-94), July 1994.

    Google Scholar 

  8. M. Zaki, S. Parthasarathy, M. Ogihara, and W. Li, “ New Algorithms for Fast Discovery of Association Rules,” Proc. Of the 3rd Int’l Conf. On Knowledge Discovery and data Mining (KDD-97), AAAI Press, 1997.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hafez, A., Deogun, J., Raghavan, V.V. (1999). The Item-Set Tree: A Data Structure for Data Mining. In: Mohania, M., Tjoa, A.M. (eds) DataWarehousing and Knowledge Discovery. DaWaK 1999. Lecture Notes in Computer Science, vol 1676. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48298-9_20

Download citation

  • DOI: https://doi.org/10.1007/3-540-48298-9_20

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-66458-1

  • Online ISBN: 978-3-540-48298-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics