skip to main content
10.1145/1341771.1341789acmotherconferencesArticle/Chapter ViewAbstractPublication PagescomputeConference Proceedingsconference-collections
research-article

An integrated query optimization system for data grids

Published:18 January 2008Publication History

ABSTRACT

The disparate and geographically distributed data sources in an enterprise can be integrated using distributed computing technologies such as data grids. The real challenge involved in such data integration efforts is in the design and development of the distributed query processing engine that lie beneath such integrated systems. In the current literature, distributed query processing and optimization is carried out in three distinct phases namely, (1) creation of single node plan, (2) generation of parallel plan, and (3) optimal site selection for plan execution. As considering the three phases in isolation leads to sub-optimal plans, the paper proposes a new distributed query optimization model that integrates all the three phases of the query optimization. This paper also presents different heuristic approaches for solving the proposed integrated distributed query processing problem. Furthermore, the presented system is integrated with a data grid solution and several real-time experiments are conducted to demonstrate its usefulness.

References

  1. Apers, P., Hevner, A., and Yao, A., "Optimization algorithms for distributed queries", IEEE Transactions on Software Engineering, 9 (1), pp. 57--68, 1983 Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Bagul, S. S., Ranade, N., Sharma, A. et al (2006) A Grid based Approach for Dynamic Integration and Access of Distributed and Heterogeneous Information across an Enterprise, International Conference on Information Resources Management Association, (IRMA), 2006Google ScholarGoogle Scholar
  3. Bernstein, P., Goodman, N., Wong, E., Reeve, C., and Rothnie, J., "Query processing in a system for distributed databases (SDD-1)", ACM Transactions on Database Systems, 6 (4), pp. 602--625, 1981 Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Buyya, R., and Venugopal, S., "A gentle introduction to grid computing and technologies", Computer Society of India Communications, July 2005Google ScholarGoogle Scholar
  5. Fegaras, L., "A new heuristic for optimizing large queries", Proc. of DEXA 98, pp. 726--735, 1998 Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Gounaris, A., Sakellariou, R., Paton, N. W., and Fernandes, A. A. A., "A novel approach to resource scheduling for parallel query processing on computational grids", Distributed parallel databases, 19, pp. 87--106, 2006 Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Graefe, G., "Query evaluation techniques for large databases", ACM Computing Surveys, 25 (2), 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Graefe, G., "Encapsulation of parallelism in the volcano query processing system", Proc. of the ACM SIGMOD Conf. on Management of Data, Atlantic City, NJ, USA, pp. 102--111, 1990 Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Haraty, R. A., and Fany, R. C., "Query acceleration in distributed database systems", Revista Comlombiana de Computación, 2 (1), pp. 19--34, 2001Google ScholarGoogle Scholar
  10. Ioannidis, Y., "Query optimization", ACM Computing Surveys, 28 (1), pp. 121--123, 1996 Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Kossmann, D., and Stocker, K., "Iterative dynamic programming: A new class of query optimization algorithms", ACM Transactions on Database Systems, 25 (1), pp. 43--82, 2000 Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Krishnamoorthy, S., (2007) Integrated Distributed Query Processor for the Data Grids, IADIS International Conference on WWW/Internet 2007, Oct 5--8, Vila Real, Portugal (accepted for publication)Google ScholarGoogle Scholar
  13. Labrinidis, A., Roussopoulos, N., "Exploring the tradeoffbetween performance and data freshness in database-driven web servers", The VLDB Journal, 13 (3), pp. 204--255, 2004 Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Liu, C., Chen, H., "A hash partitioning strategy for distributed query processing", Proc. of 5th Intl. Conf. on Extending Database Technology: Advances in Database Technology, LNCS Vol. 1057, pp. 373--387, 1996 Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Liu, L., Pu, C., and Richine, K., "Distributed query scheduling service: An architecture and its implementation", Intl. Journal of Cooperative Information Systems, 7 (2 & 3), 1998Google ScholarGoogle Scholar
  16. Lin, S., "Computer solutions of the traveling salesman problem", Bell System Technical Journal, 44, pp. 2245--2269, 1965Google ScholarGoogle ScholarCross RefCross Ref
  17. OGSA-DAI Project, http://www.ogsadai.org.uk/, last accessed 05-Dec-2006Google ScholarGoogle Scholar
  18. Or, I,., "Traveling Salesman-Type Combinatorial Problems and their Relation to the Logistics of Regional Blood Banking", Ph.D. thesis, Northwestern University, Evanston, Illinois, 1976Google ScholarGoogle Scholar
  19. Selinger, P. G., Astrahan, M. M.,. Chamberlin, D. D., Lorie, R. A., and Price, T. G., "Access path selection in a relational database management system", Proc. of the 1979 ACM SGMOD Intl. Conf. on the Management of Data, 1979 Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Yu, C. T., Chang, C., and Chang, Y., "Two surprising results in processing simple queries in distributed databases", Proc. of 6th IEEE Intl. Computer Software and Applications Conference, pp. 377--384, 1982Google ScholarGoogle Scholar
  21. Yu, C. T., and Chang, C. C., "Distributed query processing", ACM Computing Surveys, 16 (4), pp. 399--43, 1984 Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. An integrated query optimization system for data grids

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          COMPUTE '08: Proceedings of the 1st Bangalore Annual Compute Conference
          January 2008
          195 pages
          ISBN:9781595939500
          DOI:10.1145/1341771

          Copyright © 2008 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 18 January 2008

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate114of622submissions,18%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader