dbRouter - A Scaleable and Distributed Query Optimization and Processing Framework

Tok, Wee Hyong; Bressan, Stéphane

doi:10.1007/3-540-46146-9_65

Wee Hyong Tok⁷ &
Stéphane Bressan⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2453))

Included in the following conference series:

International Conference on Database and Expert Systems Applications

1414 Accesses

Abstract

In data integration systems, a central site often maintain a global catalog of all available data sources, and maintain statistics to allow the query optimizer to generate a good query plan. These statistics could be updated in a lazy manner during query execution time. A user query is often broken into several query fragments, and a centralized task scheduler schedules the execution of the respective query fragment, fetching data from the various data sources. This is then integrated at the central site and presented to the user. As data sources are introduced, there is a need to update the global catalog from time to time. However, due to the autonomous nature of the data sources, which are maintained by local administrators, it is dificult to ensure accurate statistics as well as the availability of the data sources. In addition, since the data are integrated at the central site, the central site could become a potential bottleneck. The unpredictable nature of the wide area environment further exacerbate the problem of query processing.

In this paper, we present our ongoing work on dbRouter, a distributed query optimization and processing framework for open environment. The dbRouter provides mechanisms to faciliate the discovery of new data sources, performs distributed query optimization, and manages the routing of data to its destination for processing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Distributed secondo: an extensible and scalable database management system

Article 23 June 2017

Database Integration—Multidatabase Systems

Distributed SECONDO: A Highly Available and Scalable System for Spatial Data Processing

References

Laurent Amsaleg, Michael J. Franklin, and Anthony Tomasic. Dynamic query operator scheduling for wide-area remote access.
Google Scholar
Laurent Amsaleg, Michael J. Franklin, Anthony Tomasic, and Tolga Urhan. Scrambling query plans to cope with unexpected delays, 1996.
Google Scholar
Remzi H. Arpaci-Dusseau, Eric Anderson, Noah Treuhaft, David E. Culler, Joseph M. Hellerstein, David Patterson, and Kathy Yelic. Cluster i/o with river: Making the fast case common, 1999.
Google Scholar
R. Avnur and J. Hellerstein. Eddies: Continuously adaptive query processing, 2000.
Google Scholar
Philippe Bonnet, Johannes Gehrke, and Praveen Seshadri. Towards sensor database systems, Jan 2001.
Google Scholar
Sudarshan Chawathe, Hector Garcia-Molina, Joachim Hammer, Kelly Ireland, Yannis Papakonstantinou, Jeffrey D. Ullman, and Jennifer Widom. The TSIMMIS project: Integration of heterogeneous information sources. In 16th Meeting of the Information Processing Society of Japan, pages 7–18, Tokyo, Japan, 1994.
Google Scholar
L. Haas, D. Kossman, E. Wimmers, and J. Yang. Optimizing queries across diverse data sources, 1997.
Google Scholar
Tomasz Imielinski and Samir Goel. Dataspace-querying and monitoring deeply networked collections in physical space.
Google Scholar
Z. Ives, D. Florescu, M. Friedman, A. Levy, and D. Weld. An adaptive query execution system for data integration. Proceedings of ACM SIGMOD Conf., Philadelphia, PA, 1999., 1999.
Google Scholar
Z.G. Ives, A. Y. Levy, J. Madhavan, R. Pottinger, S. Saroiu, I. Tatarinov, S. Betzler, Q. Chen, E. Jaslikowska, J. Su, W. Tak, and T. Yeung. Self-organising data sharing communities with sagres.
Google Scholar
Michael Stillger, Johann K. Obermaier, and Johann Christoph Freytag. Aques: An agent-based query evaluation system, June 1997.
Google Scholar
M. Stonebraker, P.M. Aoki, R. Devine, W. Litwin, and M. Olson. Mariposa: A new architecure for distributed data, Feb 1994.
Google Scholar
A. Tomasic, L. Raschid, and P. Valduriez. Scaling access to heterogeneous data sources with disco, September/October 1998.
Google Scholar
Tolga Urhan and Michael J. Franklin. Xjoin: A reactively-scheduled pipelined join operator, 2000.
Google Scholar
Tolga Urhan, Michael J. Franklin, and Laurent Amsaleg. Cost-based query scrambling for initial delays, 1998.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing, National University of Singapore, Singapore
Wee Hyong Tok & Stéphane Bressan

Authors

Wee Hyong Tok
View author publications
You can also search for this author in PubMed Google Scholar
Stéphane Bressan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Université Paul Sabatier, IRIT, 118 route de Narbonne, 31062, Toulouse Cedex, France
Abdelkader Hameurlain
Département Informatique, Université Aix-Marseille II, IUT, 413 Avenue Gaston Berger, 13625, Aix-en-Provence Cedex 1, France
Rosine Cicchetti
Institute of Applied Computer Science, University of Linz, Altenbergerstr. 69, 4040, Linz, Austria
Roland Traunmüller

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tok, W.H., Bressan, S. (2002). dbRouter - A Scaleable and Distributed Query Optimization and Processing Framework. In: Hameurlain, A., Cicchetti, R., Traunmüller, R. (eds) Database and Expert Systems Applications. DEXA 2002. Lecture Notes in Computer Science, vol 2453. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46146-9_65

Download citation

DOI: https://doi.org/10.1007/3-540-46146-9_65
Published: 20 August 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44126-7
Online ISBN: 978-3-540-46146-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

dbRouter - A Scaleable and Distributed Query Optimization and Processing Framework

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Distributed secondo: an extensible and scalable database management system

Database Integration—Multidatabase Systems

Distributed SECONDO: A Highly Available and Scalable System for Spatial Data Processing

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

dbRouter - A Scaleable and Distributed Query Optimization and Processing Framework

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Distributed secondo: an extensible and scalable database management system

Database Integration—Multidatabase Systems

Distributed SECONDO: A Highly Available and Scalable System for Spatial Data Processing

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation