An Extension to SQL for Mining Association Rules

Meo, Rosa; Psaila, Giuseppe; Ceri, Stefano

doi:10.1023/A:1009774406717

An Extension to SQL for Mining Association Rules

Published: June 1998

Volume 2, pages 195–224, (1998)
Cite this article

Data Mining and Knowledge Discovery Aims and scope Submit manuscript

Rosa Meo¹,
Giuseppe Psaila¹ &
Stefano Ceri¹

315 Accesses
100 Citations
Explore all metrics

Abstract

Data mining evolved as a collection of applicative problems and efficient solution algorithms relative to rather peculiar problems, all focused on the discovery of relevant information hidden in databases of huge dimensions. In particular, one of the most investigated topics is the discovery of association rules.

This work proposes a unifying model that enables a uniform description of the problem of discovering association rules. The model provides a SQL-like operator, named X⇒Y, which is capable of expressing all the problems presented so far in the literature concerning the mining of association rules. We demonstrate the expressive power of the new operator by means of several examples, some of which are classical, while some others are fully original and correspond to novel and unusual applications. We also present the operational semantics of the operator by means of an extended relational algebra.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Agrawal, R., Faloutsos, C. and Swami, A. Efficient similarity search in sequence databases. In 4th International Conference On Foundations of Data Organization and Algorithms, Chicago, October 1993.
Agrawal, R., Ghosh, S., Imielinski, T., Iyer, B. and Swami, A. An interval classifier for database mining applications. In 18th International Conference on Very Large Databases (VLDB), pages 560–573, Vancouver, August 1992.
Agrawal, R., Imielinski, T. and Swami, A. Mining association rules between sets of items in large databases. In Proc. ACM SIGMOD Conference on Management of Data, pages 207–216, Washington, D.C., May 1993. British Columbia.
Agrawal, R., Lin, K.I., Sawhney, H.S. and Shim, K. Fast similarity search in the presence of noise, scaling, and translation in time-series databases. In Proceedings of the 21st VLDB Conference, Zurich, Switzerland, September 1995.
Agrawal, R., Mannila, H., Srikant, R., Toivonen, H. and Verkamo, A.I. Fast discovery of association rules. In Padhraic Smyth Usama M. Fayyad, G. Piatetsky-Shapiro and Ramasamy Uthurusamy (Eds), editors, Knowledge Discovery in Databases, volume 2. AAAI/MIT Press, Santiago, Chile, September 1995.
Google Scholar
Agrawal, R., Psaila, G., Wimmers, E.L. and Zait, M. Querying shapes of histories. In Proceedings of the 21st VLDB Conference, Zurich, Switzerland, September 1995.
Agrawal, R. and Srikant, R. Fast algorithms for mining association rules in large databases. In Proceedings of the 20th VLDB Conference, Santiago, Chile, September 1994.
Agrawal, R. and Srikant, R. Mining sequential patterns. In International Conference on Data Engineering, Taipei, Taiwan, March 1995.
Atzeni, P. and De Antonellis, V. Relational Database Theory: A Comprehensive Introduction. Bejamin Cummings, 1993.
Faloutsos, C., Ranganathan, M. and Manolopoulos, Y. Fast subsequence matching in time-series databases. In Proc. of the ACM SIGMOD Conference on Management of Data, May 1994.
Gray, J., Bosworth, A., Layman, A. and Piranesh, H. Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals. In ICDE96 12th International Conference on Data Engineering, pages 560–573, New Orleans, Louisiana, USA, February 1996.
Han, J. and Fu, Y. Discovery of multiple-level association rules from large databases. In Proceedings of the 21st VLDB Conference, Zurich, Switzerland, September 1995.
Houtsma, M.A.W. and Swami, A. Set-oriented mining for association rules in relational databases. In 11th International Conference on Data Engineering, Taipei, Taiwan, March 6-10 1995.
Houtsma, M.A.W. and Swami, A. Set-oriented mining in relational databases. Data and Knowledge Engineering, To Appear 1996.
Imielinski, T. and Mannila, H. A database perspective on knowledge discovery. Coomunications of the ACM, 39(11):58–64, November 1996.
Google Scholar
Meo, R., Psaila, G. and Ceri, S. A tightly coupled architecture for data mining. In Proceedings of the IEEE International Conference on Data Engineering, Orlando, Florida, February, 1998.
Park, J.S., Shen, M. and Yu, P.S. An effective hash based algorithm for mining association rules. In Proceedings of the ACM-SIGMOD International Conference on the Management of Data, San Jose, California, May 1995.
Srikant, R. and Agrawal, R. Mining generalized association rules. In Proceedings of the 21st VLDB Conference, Zurich, Switzerland, September 1995.
Srikant, R. and Agrawal, R. Mining generalized association rules. Technical Report RJ 9963, IBM Almaden Research Center, San Jose, California, June 1995.
Google Scholar
Ullman, J.D. Priciples of Database and Knowledge-Base Systems, volume 1 of Principles of Computer Science Series. Computer Science Press, Rockvill, Maryland (USA), 1988.
Google Scholar
Weiss, S.M. and Kulikowski, C.A. Computer Systems that Learn: Classification and Prediction Methods from Statistics, Neural Nets, Machine Learning, and Expert Systems. Morgan-Kaufmann, 1991.

Download references

Author information

Authors and Affiliations

Dip. Automatica e Informatica, Politecnico di Torino -, C.so Duca degli Abruzzi 24 - 10129, Torino, Italy
Rosa Meo, Giuseppe Psaila & Stefano Ceri

Authors

Rosa Meo
View author publications
You can also search for this author in PubMed Google Scholar
Giuseppe Psaila
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Ceri
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Meo, R., Psaila, G. & Ceri, S. An Extension to SQL for Mining Association Rules. Data Mining and Knowledge Discovery 2, 195–224 (1998). https://doi.org/10.1023/A:1009774406717

Download citation

Issue Date: June 1998
DOI: https://doi.org/10.1023/A:1009774406717

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Extension to SQL for Mining Association Rules

Abstract

Access this article

Similar content being viewed by others

Mining Association Rules from Database Tables with the Instances of Simpson’s Paradox

Knowledge Discovery from Constrained Relational Data: A Tutorial on Markov Logic Networks

Expert deduction rules in data mining with association rules: a case study

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

An Extension to SQL for Mining Association Rules

Abstract

Access this article

Similar content being viewed by others

Mining Association Rules from Database Tables with the Instances of Simpson’s Paradox

Knowledge Discovery from Constrained Relational Data: A Tutorial on Markov Logic Networks

Expert deduction rules in data mining with association rules: a case study

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation