Computation of Mining Queries: An Algebraic Approach

Diop, Cheikh Talibouya; Giacometti, Arnaud; Laurent, Dominique; Spyratos, Nicolas

doi:10.1007/11615576_6

Cheikh Talibouya Diop^21,24,
Arnaud Giacometti²¹,
Dominique Laurent²² &
…
Nicolas Spyratos²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3848))

307 Accesses

Abstract

Mining frequent queries often requires the repeated execution of some extraction algorithm for different values of the support, as well as for different source datasets. This is an expensive process, even if we use the best existing algorithms. Hence the need for iterative mining, whereby mining results already obtained are re-used to accelerate subsequent steps in the mining process.

In this paper, we present an approach for the iterative mining of frequent queries. Our approach is based on the notion of mining context, where a mining context is a set of queries over the same schema. We define operations on mining contexts, based on the standard relational algebra, and we also introduce new operators, one of which for computing frequent queries.

We first study the properties of the operators, then we consider particular mining contexts using biases for which frequent queries can be computed using any level-wise algorithm. Iterative mining is obtained by combining these particular contexts using our set of operations. We have implemented our approach and conducted experiments that show its efficiency in mining frequent queries.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Efficiently mining association rules based on maximum single constraints

Article Open access 31 May 2017

Computing Theoretically-Sound Upper Bounds to Expected Support for Frequent Pattern Mining Problems over Uncertain Big Data

Constrained pattern mining in the new era

Article 23 July 2015

References

Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I.: Fast discovery of association rules. In: Advances in Knowledge Discovery and Data Mining, pp. 309–328. AAAI-MIT Press (1996)
Google Scholar
Botta, M., Meo, R., Sapino, M.L.: Incremental execution of the mine rule operator. Technical Report RT66-2002, University of Turin, Turin (May 2002)
Google Scholar
Chandra, A.K., Merlin, P.M.: Optimal implementation of conjunctive queries in relational databases. In: Ninth ACM Symposium on Theory of Computing, pp. 77–90 (1977)
Google Scholar
Dehaspe, L., Toivonen, H.: Discovery of frequent datalog patterns. In: Data Mining and Knowledge Discovery, vol. 3, pp. 7–36. Kluwer Academic Publishers, Dordrecht (1999)
Google Scholar
Diop, C.T.: Etude et mise en oeuvre des aspects itératifs de l’extraction de règles d’association dans une base de données. PhD thesis, Université de Tours, France (2003)
Google Scholar
Diop, C.T., Giacometti, A., Laurent, D., Spyratos, N.: Composition of mining contexts for efficient extraction of association rules. In: Jensen, C.S., Jeffery, K., Pokorný, J., Šaltenis, S., Bertino, E., Böhm, K., Jarke, M. (eds.) EDBT 2002. LNCS, vol. 2287, pp. 106–123. Springer, Heidelberg (2002)
Chapter Google Scholar
Giacometti, A., Laurent, D., Diop, C.T.: Condensed representations of sets of mining queries. In: Meo, R., Lanzi, P.L., Klemettinen, M. (eds.) Database Support for Data Mining Applications. LNCS (LNAI), vol. 2682, pp. 250–269. Springer, Heidelberg (2004)
Chapter Google Scholar
Giacometti, A., Laurent, D., Diop, C.T., Spyratos, N.: Mining from views: An incremental approach. International Journal Information Theories & Applications 9 (Techn. Report LI, Université de Tours, France) (2002)
Google Scholar
Han, J., Fu, Y., Wang, W., Koperski, K., Zaiane, O.: Dmql: A data mining query language for relational databases. In: SIGMOD Workshop DMKD 1996, pp. 27–34 (1996)
Google Scholar
Imielinski, T., Mannila, H.: A database perspective on knowledge discovery. Communications of the ACM 39, 58–64 (1996)
Article Google Scholar
Kamber, M., Han, J., Chiang, J.: Metarule-guided mining of multi-dimensional association rules using data cubes. In: International Conference on Data Mining and Knowledge Discovery (KDD 1997), Newport Beach, USA, pp. 207–210 (1997)
Google Scholar
Lee, S.D., De Raedt, L.: An algebra for inductive query evaluation. In: IEEE ICDM, pp. 147–154 (2003)
Google Scholar
Mannila, H., Toivonen, H.: Levelwise search and borders of theories in knowledge discovery. Data Mining and Knowledge Discovery 1(3), 241–258 (1997)
Article Google Scholar
Meo, R., Psaila, G., Ceri, S.: A new sql-like operator for mining association rules. In: 22nd VLDB Conf., pp. 122–133 (1996)
Google Scholar
Morzy, T., Wojciechowski, M., Zakrzewicz, M.: Materialized data mining views. In: Zighed, D.A., Komorowski, J., Żytkow, J.M. (eds.) PKDD 2000. LNCS (LNAI), vol. 1910, pp. 65–74. Springer, Heidelberg (2000)
Chapter Google Scholar
Nag, B., Deshpande, P., De Witt, D.J.: Using a knowledge cache for interactive discovery of association rules. In: 5th ACM SIGKDD Conference, pp. 244–253 (1999)
Google Scholar
Nag, B., Deshpande, P., De Witt, D.J.: Caching for multi-dimensional data mining queries. In: Systemics, Cybernetics and Informatics (SCI) (2001)
Google Scholar
De Raedt, L.: A perspective on inductive databases. SIGKDD Explor. Newsletter 4(2), 69–77 (2002)
Article Google Scholar
Tsur, S., Ullman, J.D., Abiteboul, S., Clifton, C., Motwani, R., Nestorov, S., Rosenthal, A.: Query flocks: A generalization of association-rule mining. In: ACM SIGMOD Conference, pp. 1–12 (1998)
Google Scholar
Ullman, J.D.: Principles of Databases and Knowledge-Base Systems, vol. 1. Computer Science Press, Rockville (1988)
Google Scholar

Download references

Author information

Authors and Affiliations

LI, Université de Tours, 41000, Blois, France
Cheikh Talibouya Diop & Arnaud Giacometti
LICP, Université de Cergy-Pontoise, 95 302 Cedex, Cergy-Pontoise, France
Dominique Laurent
LRI, Université Paris 11, 91405 Cedex, Orsay, France
Nicolas Spyratos
Université Gaston Berger, Saint-Louis, Senegal
Cheikh Talibouya Diop

Authors

Cheikh Talibouya Diop
View author publications
You can also search for this author in PubMed Google Scholar
Arnaud Giacometti
View author publications
You can also search for this author in PubMed Google Scholar
Dominique Laurent
View author publications
You can also search for this author in PubMed Google Scholar
Nicolas Spyratos
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

INSA-Lyon, LIRIS CNRS UMR5205, F-69621, Villeurbanne, France
Jean-François Boulicaut
Department of Computer Science, Katholieke Universiteit Leuven, Celestijnenlaan 200A, 3001, Heverlee, Belgium
Luc De Raedt
HIIT, Helsinki University of Technology and, University of Helsinki, Finland
Heikki Mannila

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Diop, C.T., Giacometti, A., Laurent, D., Spyratos, N. (2006). Computation of Mining Queries: An Algebraic Approach. In: Boulicaut, JF., De Raedt, L., Mannila, H. (eds) Constraint-Based Mining and Inductive Databases. Lecture Notes in Computer Science(), vol 3848. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11615576_6

Download citation

DOI: https://doi.org/10.1007/11615576_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31331-1
Online ISBN: 978-3-540-31351-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics