skip to main content
10.1145/1099554.1099730acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Selectivity-based partitioning: a divide-and-union paradigm for effective query optimization

Published:31 October 2005Publication History

ABSTRACT

Modern query optimizers select an efficient join ordering for a physical execution plan based essentially on the average join selectivity factors among the referenced tables. In this paper, we argue that this "monolithic" approach can miss important opportunities for the effective optimization of relational queries. We propose selectivity-based partitioning, a novel optimization paradigm that takes into account the join correlations among relation fragments in order to essentially enable multiple (and more effective) join orders for the evaluation of a single query. In a nutshell, the basic idea is to carefully partition a relation according to the selectivities of the join operations, and subsequently rewrite the query as a union of constituent queries over the computed partitions. We provide a formal definition of the related optimization problem and derive properties that characterize the set of optimal solutions. Based on our analysis, we develop a heuristic algorithm for computing efficiently an effective partitioning of the input query. Results from a preliminary experimental study verify the effectiveness of the proposed approach and demonstrate its potential as an effective optimization technique.

References

  1. S. Babu, R. Motwani, K. Munagala, I. Nishizawa, and J. Widom. Adaptive ordering of pipelined stream filters. In ACM SIGMOD, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. K. Chakrabarti, M. Garofalakis, R. Rastogi, and K. Shim. Approximate Query Processing Using Wavelets. In VLDB, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Chaudhuri and K. Shim. Optimization of queries with user-defined predicates. In VLDB, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A.Dobra, M.Garofalakis, J.Gehrke, and R. Rastohi. Processing complex aggregate queries over data streams. In ACM SIGMOD, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Cluet and G. Moerkotte. On the complexity of generating optimal left-deep processing trees with cross products. In ICDT, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. Deshpande and J. M. Hellerstein. Lifting the burden of history from adaptive query processing. In VLDB, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. J. DeWitt, R. H. Gerber, G. Graefe, M. L. Heytens, K. B. Kumar, and M. Muralikrishna. GAMMA - A High Performance Dataflow Database Machine. In VLDB, 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. D. J. DeWitt and J. Gray. Parallel database systems: The future of high performance database systems. CACM, 35(6), 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. G. Graefe and D. J. DeWitt. The exodus optimizer generator. In ACM SIGMOD, 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. Halevy. Answering queries using views: A survey. Intl. Journal on Very Large Data Bases, 10(4), 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. K. A. Hua and C.Lee. An adaptive data placement scheme for parallel database computer systems. In VLDB, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. T. Ibaraki and T. Kameda. On the optimal nesting order for computing n-relational joins. ACM Transactions on Database Systems, 9(3):482--502, 1984. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Y. E. Ioannidis and Y. C. Kang. Randomized algorithms for optimizing large join queries. In ACM SIGMOD, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Y. E. Ioannidis and V. Poosala. Histogram-Based Approximation of Set-Valued Query Answers. In VLDB, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Y. E. Ioannidis. "Universality of Serial Histograms". In VLDB, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. N. Kabra and D. J. DeWitt. Efficient mid-query re-optimization of sub-optimal query execution plans. In SIGMOD, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. R. Krishnamurthy, B. Boral, and C. Zaniolo. Optimization of nonrecursive queries. In VLDB, 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. H. Pirahesh, J. M. Hellerstein, and W. Hasan. Extensible/rule based query rewrite optimization in starburst. In ACM SIGMOD, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. P. G. Selinger, M. M. Astrahan, R. D. Chamberlin, R. A. Lorie, and T. G. Price. Access path selection in a relational database management system. In ACM SIGMOD, 1979. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. T. K. Sellis. Multiple-query optimization. ACM TODS, 13(1), 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Selectivity-based partitioning: a divide-and-union paradigm for effective query optimization

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            CIKM '05: Proceedings of the 14th ACM international conference on Information and knowledge management
            October 2005
            854 pages
            ISBN:1595931406
            DOI:10.1145/1099554

            Copyright © 2005 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 31 October 2005

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • Article

            Acceptance Rates

            CIKM '05 Paper Acceptance Rate77of425submissions,18%Overall Acceptance Rate1,861of8,427submissions,22%

            Upcoming Conference

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader