Abstract
There is currently great interest in integrating knowledge discovery research into mainstream database systems. Such an enterprise is nontrivial because knowledge discovery and database systems are rooted in different paradigms, therefore foundational work needs to be carried out and a candidate unified syntax and semantics needs to be proposed. Elsewhere we have indeed carried out such foundational work and used it to propose a unified syntax and semantics for integrating query processing and knowledge discovery. We refer to the resulting class of database systems as combined inference database systems (CIDS), since they are a class of logic-based databases and the integration is anchored by a view of query answering as deductive inference and of knowledge discovery as inductive inference. The most important novel capability of CIDS is that of evaluating expressions which seamlessly compose query answering and knowledge discovery steps. This gives rise to increased flexibility, usability and expressiveness in user interactions with database systems, insofar as many relevant and challenging kinds of information needs can be catered for by CIDS that would be cumbersome to cater for by gluing together existing, state-of-the-art (but, syntactically and semantically, heterogeneous) components. In this paper, we provide an overview of CIDS, then we introduce two motivating applications, we show how CIDS elegantly support such challenging application needs, and we contrast our work with other attempts at integrating knowledge discovery and databases technology.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proc. VLDB 1994, pp. 487–499. Morgan Kaufmann, San Francisco (1994)
Alpdemir, M., Mukherjee, A., Gounaris, A., Paton, N., Watson, P., Fernandes, A., Fitzgerald, D.: OGSA-DQP: A service for distributed querying on the grid. In: Bertino, E., Christodoulakis, S., Plexousakis, D., Christophides, V., Koubarakis, M., Böhm, K., Ferrari, E. (eds.) EDBT 2004. LNCS, vol. 2992, pp. 858–861. Springer, Heidelberg (2004)
Aragão, M.A.T., Fernandes, A.A.A.: Characterizing web service substitutivity with combined deductive and inductive engines. In: Yakhno, T. (ed.) ADVIS 2002. LNCS, vol. 2457, pp. 244–254. Springer, Heidelberg (2002)
Aragão, M.A.T., Fernandes, A.A.A.: Inductive-deductive databases for knowledge management. In: Proc. ECAI KM&OM 2002, pp. 11–19 (2002)
Aragão, M.A.T., Fernandes, A.A.A.: Combining query answering and knowledge discovery. Technical report, University of Machester (2003)
Bergadano, F.: Inductive database relations. IEEE TKDE 5(6), 969–971 (1993)
Ceri, S., Gottlob, G., Tanca, L.: Logic Programming and Databases (1990)
Christiansen, H.: Automated reasoning with a constraint-based metainterpreter. Journal of Logic Programming 37(1-3), 213–254 (1998)
Dehaspe, L., Raedt, L.D.: Dlab: A declarative language bias formalism. In: Michalewicz, M., Raś, Z.W. (eds.) ISMIS 1996. LNCS, vol. 1079, pp. 613–622. Springer, Heidelberg (1996)
Džeroski, S., Lavrač, N. (eds.): Relational Data Mining. Springer, Heidelberg (2001)
Fegaras, L., Maier, D.: Optimizing object queries using an effective calculus. ACM TODS 25(4), 457–516 (2000)
Han, J., Fu, Y., Wang, W., Chiang, J., Zaïane, O.R., Koperski, K.: DBMiner: interactive mining of multiple-level knowledge in relational databases. In: Proc. SIGMOD 1996, pp. 550–550. ACM Press, New York (1996)
Hand, D.J., Mannila, H., Smyth, P.: Principles of Data Mining. MIT, Cambridge (2001)
Imielinski, T., Virmani, A.: MSQL: A query language for database mining. DMKD 3(4), 373–408 (1999)
Lakshmanan, L.V.S., Shiri-Varnaamkhaasti, N.: A parametric approach to deductive databases with uncertainty. IEEE TKDE 13(4), 554–570 (2001)
Lee, S.D., de Raedt, L.: An algebra for inductive query evaluation. In: Proc. ICDM 2003 (2003)
Mannila, H.: Inductive databases and condensed representations for data mining. In: Džeroski, S., Lavrač, N. (eds.) ILP 1997. LNCS, vol. 1297, pp. 21–30. Springer, Heidelberg (1997)
Meo, R., Psaila, G., Ceri, S.: A new SQL-like operator for mining association rules. In: Proc. VLDB’96, pp. 122–133, 3–6 Morgan Kaufmann, San Francisco (1996)
Minker, J.: Logic and databases: A 20 year retrospective. In: Pedreschi, D., Zaniolo, C. (eds.) LID 1996. LNCS, vol. 1154, pp. 3–58. Springer, Heidelberg (1996)
Mowshowitz, A.: Virtual organizations. CACM 40(9), 30–37 (1997)
Netz, A., Chaudhuri, S., Fayyad, U.M., Bernhardt, J.: Integrating data mining with sql databases: OLE DB for data mining. In: Proc. ICDE 2001, pp. 379–387. IEEE Computer, Los Alamitos (2001)
Quinlan, J.R.: Learning decision tree classifiers. ACM Computing Surveys 28(1), 71–72 (1996)
Sampaio, S., Paton, N.W., Smith, J., Watson, P.: Validated cost models for parallel OQL query processing. In: Bellahsène, Z., Patel, D., Rolland, C. (eds.) OOIS 2002. LNCS, vol. 2425, pp. 60–75. Springer, Heidelberg (2002)
Stevens, R.D., Robinson, A.J., Goble, C.A.: myGrid: personalised bioinformatics on the information grid. Bioinformatics 19(1) (2003)
Ullman, J.D.: Bottom-up beats top-down for datalog. In: Proc. 8th ACM SIGACTSIGMOD- SIGART PODS 1989, pp.140–149. ACM Press, New York (1989)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Aragão, M.A.T., Fernandes, A.A.A. (2004). Logic-Based Integration of Query Answering and Knowledge Discovery. In: Christiansen, H., Hacid, MS., Andreasen, T., Larsen, H.L. (eds) Flexible Query Answering Systems. FQAS 2004. Lecture Notes in Computer Science(), vol 3055. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-25957-2_7
Download citation
DOI: https://doi.org/10.1007/978-3-540-25957-2_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22160-9
Online ISBN: 978-3-540-25957-2
eBook Packages: Springer Book Archive