Skip to main content

An Adaptive Distributed Query Processing Grid Service

  • Conference paper
Data Management in Grids (DMG 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3836))

Included in the following conference series:

Abstract

Grid services provide an important abstract layer on top of heterogeneous components (hardware and software) that take part into a grid environment. We are developing a data grid service prototype that aims at providing transparent use of grid resources to data intensive scientific applications. Our prototype was designed having as target three main issues: (1) dynamic scheduling and allocation of query execution engine modules into grid nodes; (2)adaptability of query execution to variations on environment conditions and (3) support to special scientific operations. We propose a new node scheduling algorithm and show how it can be integrated into a simple distributed and parallel query optimization strategy. Our implementation demonstrates a speedup of 16.6 with 18 scheduled nodes and a steady throughput rate, obtained applying a dynamic adaptive strategy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Foster, I., Kesselman, C., Nick, J., Tuecke, S.: The Physiology of the Grid: An Open Grid Services Architecture for Distributed System integration. Global Grid Forum (2002)

    Google Scholar 

  2. Foster, I., Kesselman, C.: Globus a metacomputing infrastructure toolkit. Intl. Journal Supercomputer Applications 11, 115–128 (1997)

    Article  Google Scholar 

  3. Czajkowski, K., et al.: From open grid services infrastructure to wsresource framework: Refactoring & evolution, version 1.1. Technical report (2005), http://www.globus.org/wsrf/specs/ogsitowsrf1.0.pdf

  4. Antonioletti, M., Atkinsons, M., et al.: The design and implementation of grid database services in ogsa-dai. Concurrency and Computation: Practice and Experience Journal 17, 357–376 (2005)

    Article  Google Scholar 

  5. Chervenak, A., Foster, I., Kesselman, C., Salisbury, C., Tuecke, S.: The data grid: Towards an architecture for the distributed management and analysis of large scientific datasets (1999)

    Google Scholar 

  6. Avnur, R., Hellerstein, J.: Eddies: continuously adaptive query processing. ACM SIGMOD Record 29, 261–272 (2000)

    Article  Google Scholar 

  7. Porto, F., Giraldi, G., de Oliveira, J.C., Schulze, B.: Codims - an adaptable middleware system for scientific visualization in grids. Concurrency and Computation: Practice and Experience Journal 16, 515–522 (2004)

    Article  Google Scholar 

  8. Alpdemir, M.N., Mukherjee, A., Paton, N., et al.: Ogsa-dqp: A service-based distributed query processor for the grid. In: Cox, J. (ed.) Proc. of UK e-Science All Hands Meeting Nottingham (2003)

    Google Scholar 

  9. Paton, N., Atkinson, M., Dialani, V., Pearson, D., Storey, T., Watson, P.: Database access and integration services on the grid. Technical report, U.K. National eScience Center (2002), http://www.nesc.ac.uk

  10. Smith, J., Gounaris, A., Watson, P., Paton, N.W., et al.: Distributed query processing on the grid. In: Parashar, M. (ed.) GRID 2002. LNCS, vol. 2536, pp. 279–290. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  11. Vadhhiyar, S.S., Dongarra, J.J.: Self adaptivity in grid environment. Concurrency and Computation: Practice and Experience Journal 17, 235–257 (2005)

    Article  Google Scholar 

  12. Rosenblum, L., et al. (eds.): Scientific Visualization - Advances and Challenges. Academic Press, London (1994)

    MATH  Google Scholar 

  13. Barnard, S., et al.: Large-scale distributed computational fluid dynamics on the information power grid using globus, pp. 60–67 (1999)

    Google Scholar 

  14. Molina, H., Ullman, J.D., Widow, J.D.: Database Systems: The Complete Book. Prentice Hall, Englewood Cliffs (2001)

    Google Scholar 

  15. Lo, M.L., Ravishankar, C.V.: Spatial hash-joins. In: Proc. of the ACM SIGMOD Conference on Management of Data, Montreal, Canada, pp. 247–258 (1996)

    Google Scholar 

  16. Barbosa, A., Porto, F., Melo, R.N.: Configurable data integration middleware system. Journal of the Brazilian Computer Society 8, 12–19 (2002)

    Article  Google Scholar 

  17. Selinger, P.G., Adiba, M.E.: Access path selections in distributed data base management systems. In: Proc. 1st Intl. Conf. on Databases, British Computer Society, Aberdeen (1980)

    Google Scholar 

  18. Ozsu, M.T., Valduriez, P.: Principles of Distributed Database Systems. Prentice Hall International, Inc., New Jersey (1999)

    Google Scholar 

  19. Bouganim, L., Fabret, F., Porto, F., Valduriez, P.: Processing queries with expensive functions and large objects in distributed mediator systems. In: Proceedings of Int’l. Conf. on Data Engineering, Heidelberg, Germany, pp. 91–98 (2001)

    Google Scholar 

  20. Ayres, F., Porto, F., Melo, R.N.: An extensible query execution engine for supporting new query execution models. Technical report, EPFL, Ecole Polytechnique Fédérale de Lausanne School of Computer and Communication Sciences, Suisse (2005), http://icwww.epfl.ch/publications/documents/IC_TECH_REPORT_2005034.pdf

  21. Porto, F.: Strategies for the Parallel execution of user programs in scientific applications. PhD thesis, Pontifícia Universidade Católica do Rio de Janeiro, RJ, Brazil (2001)

    Google Scholar 

  22. Graefe, G.: Query evaluation techniques for large databases. ACM Computing Surveys 25, 73–170 (1993)

    Article  Google Scholar 

  23. Deering, S., Cheriton, D.: Multicast routing in datagram internetworks and extended lans. ACM Transactions on Computer Systems (8), 85–111

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Porto, F., da Silva, V.F.V., Dutra, M.L., Schulze, B. (2006). An Adaptive Distributed Query Processing Grid Service. In: Pierson, JM. (eds) Data Management in Grids. DMG 2005. Lecture Notes in Computer Science, vol 3836. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11611950_5

Download citation

  • DOI: https://doi.org/10.1007/11611950_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-31212-3

  • Online ISBN: 978-3-540-32452-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics