Skip to main content

Performance Evaluation and Optimization of Join Queries for Association Rule Mining

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1676))

Abstract

The explosive growth in data collection in business organizations introduces the problem of turning these rapidly expanding data stores into nuggets of actionable knowledge. The state-of-the-art data mining tools available for this integrate loosely with data stored in DMBSSs, typically through a cursor interface. In this paper, we consider several formulations of association rule mining (a typical data mining problem) using SQL-92 queries and study the performance of different join orders and join methods for executing them. We analyze the cost of the different execution plans which provides a basis to incorporate the semantics of association rule mining into future query optimizers. Based on them we identify certain optimizations and develop the Set-oriented Apriori approach. This work is an initial step towards developing “SQL-aware” mining algorithms and exploring the enhancements to current relational DBMSs to make them “mining-aware” thereby bridging the gap between the two.

This work was supported in part by the Office of Naval Research and the Spawar System Center — San Diego, by the Rome Laboratory, DARPA, and the NSF Grant IRI-9528390

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Agrawal, T. Imielinski, and A. Swami. Mining association rules between sets of items in large databases. In Proc. of the ACM SIGMOD Conference, pages 207–216, Washington, D.C., May 1993.

    Google Scholar 

  2. R. Agrawal and K. Shim. Developing tightly-coupled data mining applications on a RDBMS. In Proc. of the KDD Conference, Portland, Oregon, August 1996.

    Google Scholar 

  3. R. Agrawal and R. Srikant. Fast Algorithms for Mining Association Rules. In Proc. of the VLDB Conference, Santiago, Chile, September 1994.

    Google Scholar 

  4. J. Han, Y. Fu, K. Koperski, W. Wang, and 0. Zaiane. DMQL: A data mining query language for relational datbases. In Proc. of the 1996 SIGMOD DMKD workshop, Montreal, Canada, May 1996.

    Google Scholar 

  5. M. Houtsma and A. Swami. Set-oriented mining of association rules. In Int’l Conference on Data Engineering, Taipei, Taiwan, March 1995.

    Google Scholar 

  6. T. Imielinski, A. Virmani, and A. Abdulghani. Discovery Board Application Programming Interface and Query Language for Database Mining. In Proc. of the KDD Conference, Portland, Oregon, August 1996.

    Google Scholar 

  7. R. Meo, G. Psaila, and S. Ceri. A new SQL like operator for mining association rules. In Proc. of the VLDB Conference, Bombay, India, Sep 1996.

    Google Scholar 

  8. PostgreSQL Organization. PostgreSQL 6.3 User Manual, February 1998. http://ww.postgresql.org.

  9. S. Sarawagi, S. Thomas, and R. Agrawal. Integrating Association Rule Mining with Relational Database Systems: Alternatives and Implications. In Proc. of the ACM SIGMOD Confewace, Seattle, Washington, June 1998.

    Google Scholar 

  10. S. TSUT, et al. Query Flocks: A Generalization of Association Rule Mining. In Proc. of the ACM SIGMOD Conference, Seattle, Washington, June 1998.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Thomas, S., Chakravarthy, S. (1999). Performance Evaluation and Optimization of Join Queries for Association Rule Mining. In: Mohania, M., Tjoa, A.M. (eds) DataWarehousing and Knowledge Discovery. DaWaK 1999. Lecture Notes in Computer Science, vol 1676. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48298-9_26

Download citation

  • DOI: https://doi.org/10.1007/3-540-48298-9_26

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-66458-1

  • Online ISBN: 978-3-540-48298-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics