skip to main content
10.1145/1247480.1247533acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

Extending relational query optimization to dynamic schemas for information integration in multidatabases

Published: 11 June 2007 Publication History

Abstract

This paper extends relational processing and optimization to the FISQL/FIRA languages for dynamic schema queries over multidatabases. Dynamic schema queries involve the creation and restructuring of metadata at runtime. We present a full implementation of a FISQL/FIRA engine, which includes subqueries and all transformational capabilities of FISQL/FIRA on distributed, multidatabase platforms. An important application of the system is to enhance traditional information architectures by enabling the creation and maintenance of dynamic wrappers and mapping queries at source databases within GAV, LAV, GLAV, peer-to-peer, or other integration frameworks. In addition to fully supporting FISQL/FIRA on multidatabases, our implementation introduces a bi-level optimization paradigm where purely relational sub-fragments of queries are pushed into source engines. This paradigm shares features of canonical distributed database processing, but has a new dimension through the extension of the relational model to dynamic schemas. We present empirical results showing the feasibility of optimization in this context, and discuss tradeoffs involved. Our system is the first to extend relational databases with these capabilities on this scale.

References

[1]
Francois Barbancon and Daniel P. Miranker. Implementing Federated Database Systems by Compiling Schema SQL. IDEAS 2002.
[2]
Conor Cunningham, C&3233;sar A. Galindo-Legaria, and Goetz Graefe. PIVOT and UNPIVOT: Optimization and Execution Strategies in an RDBMS. VLDB 2004.
[3]
Keir B. Davis and Fereidoon Sadri. Optimization of Schema SQL Queries. IDEAS 2001.
[4]
R. Dhamanka, Y. Lee, A. Doan, A. Halevy, and P. Domingos. iMAP: Discovering Complex Semantic Matches Between Database Schema. SIGMOD 2004.
[5]
George H. L. Fletcher and Catharine M. Wyss. Data Mapping as Search. EDBT 2006.
[6]
H. Garcia-Molina, Y. Papakonstantinou, D. Quass, A. Rajaraman, Y. Sagiv, J. Ullman, V. Vassalos, and J. Widom. The TSIMMIS approach to mediation: Data models and Languages. Journal of Intelligent Information Systems (JIIS), March/April 1997. Vol. 8, no. 2, pp 117--132.
[7]
John Grant, Witold Litwin, Nick Roussopoulos, and Timos Sellis. Query Languages for Relational Multidatabases. VLDB Journal, vol2. pp 153--171, 1993.
[8]
Marc Gyssens, Laks V. S. Lakshmanan, and Iyer N. Subramanian. Tables as a Paradigm for Querying and Restructuring. PODS96, pp. 93--103.
[9]
Laura Haas, Renee Miller, Bartholomew Niswonger, Mary Tork Roth, Peter Schwarz, and Edward Wimmers. Transforming Heterogeneous Data with Database Middleware: Beyond Integration. IEEE Bulletin of the Technical Committee on Data Engineering, March 1999. Vol. 22, no. 1, pp 31--36.
[10]
Qi He and Tok Wang Ling. Extending and Inferring Functional Dependencies in Schema Transformation. CIKM 2005.
[11]
L. V. S. Lakshmanan, F. Sadri, and S. N. Subramanian. On Efficiently Implementing SchemaSQL on an SQL Database System. VLDB 1999.
[12]
L. V. S. Lakshmanan, F. Sadri, and S. N. Subramanian. SchemaSQL - an extension to SQL for multidatabase interoperability. TODS Vol. 26, No. 4, December 2001.
[13]
Maurizio Lenzerini. Data Integration: ATheoretical Perspective. PODS 2002: 233--246.
[14]
Lin Liu. SIGMOD Record: Special Issue on Semantic Integration. 33:4, December 2004.
[15]
Witold Litwin, M. Ketabchi, and Ravi Krishnamurthy. First Order Normal Form for Relational Databases and Multidatabases. SIGMOD RECORD, Vol. 20, No. 4, December 1991.
[16]
Peter McBrien and Alexandra Poulovassilis. Data Integration by Bi-Directional Schema Transformation Rules. ICDE 2003.
[17]
Renée J. Miller, Mauricio A. Hernàndez, Laura M. Haas, Lingling Yan, C. T. Howard Ho, Ronald Fagin, and Lucian Popa. The Clio Project: Managing Heterogeneity. SIGMOD Record, 30:1,March 2001.
[18]
Amit P. Sheth and James A. Larson. Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases. ACM Computing Surveys, 22:3, 1990.
[19]
M. T. Ozsu and P. Valduriez. Principles of Distributed Database Systems, 2nd ed. Prentice Hall, 1999.
[20]
Catharine M. Wyss and Edward L. Robertson. Relational Languages for Metadata Integration. TODS, Vol. 30, No. 2, June 2005.
[21]
Catharine M. Wyss and Edward L. Robertson. A formal characterization of PIVOT/UNPIVOT. CIKM 2005: 602--608.
[22]
http://www.microsoft.com/net, http://www.microsoft.com/vcsharp
[23]
http://www.antlr.org
[24]
http://www.microsoft.com/sql

Cited By

View all
  • (2013)Bidirectional by Necessity: Data Persistence and Adaptability for Evolving Application DevelopmentGenerative and Transformational Techniques in Software Engineering IV10.1007/978-3-642-35992-7_6(219-270)Online publication date: 2013
  • (2010)Stream-Based Real World Information Integration FrameworkWireless Sensor Network Technologies for the Information Explosion Era10.1007/978-3-642-13965-9_6(173-204)Online publication date: 2010
  • (2009)An Algebraic Language for Semantic Data Integration on the Hidden WebProceedings of the 2009 IEEE International Conference on Semantic Computing10.1109/ICSC.2009.94(237-244)Online publication date: 14-Sep-2009
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data
June 2007
1210 pages
ISBN:9781595936868
DOI:10.1145/1247480
  • General Chairs:
  • Lizhu Zhou,
  • Tok Wang Ling,
  • Program Chair:
  • Beng Chin Ooi
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 June 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. FIRA
  2. FISQL
  3. data integration
  4. dynamic schema query processing
  5. federated data model
  6. federated databases
  7. federated interoperable RA
  8. federated interoperable SQL
  9. metadata integration
  10. metadata query processing

Qualifiers

  • Article

Conference

SIGMOD/PODS07
Sponsor:

Acceptance Rates

Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2013)Bidirectional by Necessity: Data Persistence and Adaptability for Evolving Application DevelopmentGenerative and Transformational Techniques in Software Engineering IV10.1007/978-3-642-35992-7_6(219-270)Online publication date: 2013
  • (2010)Stream-Based Real World Information Integration FrameworkWireless Sensor Network Technologies for the Information Explosion Era10.1007/978-3-642-13965-9_6(173-204)Online publication date: 2010
  • (2009)An Algebraic Language for Semantic Data Integration on the Hidden WebProceedings of the 2009 IEEE International Conference on Semantic Computing10.1109/ICSC.2009.94(237-244)Online publication date: 14-Sep-2009
  • (2009)Evaluation of a Framework for Dynamic Source Selection in Stream Processing2009 International Conference on Complex, Intelligent and Software Intensive Systems10.1109/CISIS.2009.167(1106-1111)Online publication date: Mar-2009
  • (2009)Towards a General Framework for Effective Solutions to the Data Mapping ProblemJournal on Data Semantics XIV10.1007/978-3-642-10562-3_2(37-73)Online publication date: 15-Nov-2009
  • (2008)Data exchange with data-metadata translationsProceedings of the VLDB Endowment10.14778/1453856.14538881:1(260-273)Online publication date: 1-Aug-2008
  • (2008)A video stream management system for heterogeneous information integration environmentsProceedings of the 2nd international conference on Ubiquitous information management and communication10.1145/1352793.1352836(202-207)Online publication date: 31-Jan-2008
  • (2008)Dynamic information integration using object deputy model2008 8th IEEE International Conference on Computer and Information Technology10.1109/CIT.2008.4594663(137-142)Online publication date: Jul-2008

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media