skip to main content
10.1145/872757.872804acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

A characterization of the sensitivity of query optimization to storage access cost parameters

Published:09 June 2003Publication History

ABSTRACT

Most relational query optimizers make use of information about the costs of accessing tuples and data structures on various storage devices. This information can at times be off by several orders of magnitude due to human error in configuration setup, sudden changes in load, or hardware failure. In this paper, we attempt to answer the following questions:• Are inaccurate access cost estimates likely to cause a typical query optimizer to choose a suboptimal query plan?• If an optimizer chooses a suboptimal plan as a result of inaccurate access cost estimates, how far from optimal is this plan likely to be?To address these issues, we provide a theoretical, vector-based framework for analyzing the costs of query plans under various storage parameter costs. We then use this geometric framework to characterize experimentally a commercial query optimizer. We develop algorithms for extracting detailed information about query plans through narrow optimizer interfaces, and we perform the characterization using database statistics from a published run of the TPC-H benchmark and a wide range of storage parameters.We show that, when data structures such as tables, indexes, and sorted runs reside on different storage devices, the optimizer can derive significant benefits from having accurate and timely information regarding the cost of accessing storage devices.

References

  1. A. Brown and D. A. Patterson. Towards avilability benchmarks: A case study of software RAID systems. In Proceedings of the USENIX Technical Conference, pages 263--276, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. S. Chaudhuri and V. Narasayya. Automating statistics management for query optimizers. IEEE Transactions on Knowledge and Data Engineering, 13:7--20, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. C.-M. Chen and N. Roussopoulos. Adaptive selectivity estimation using query feedback. In R. T. Snodgrass and M. Winslett, editors, Proceedings of the 1994 ACM SIGMOD International Conference on Management of Data, Minneapolis, Minnesota, May 24--27, 1994, pages 161--172. ACM Press, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. F. Chu, J. Halpern, and J. Gehrke. Least expected cost query optimization: What can we expect? In Proceedings of the ACM Symposium on Principles of Database Systems, pages 293--302, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. F. Chu, J. Halpern, and P. Seshadri. Least expected cost query optimization: An exercise in utility. In Proceedings of the ACM Symposium on Principles of Database Systems, pages 138--147, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. R. L. Cole and G. Graefe. Optimization of dynamic query evaluation plans. In Proceedings of the ACM SIGMOD Conference, pages 150--160, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. I. B. M. Corporation. IBM DB2 Universal Database Command Reference. International Business Machines Corporation, 2001. http://www.elink.ibmlink.ibm.com/public/applications/publications/cgibin/pbi.cgi.Google ScholarGoogle Scholar
  8. I. B. M. Corporation. Autonomic computing: IBM's perspective on the state of information technology. Technical report, 2002. http://www.research.ibm.com/autonomic/manifesto/autonomic_computing.pdf.Google ScholarGoogle Scholar
  9. I. B. M. Corporation. DB2 universal database version 8.1 for Linux, 2002.Google ScholarGoogle Scholar
  10. I. B. M. Corporation. TPC benchmark H full disclosure report: IBM eServer xSeries 350 using IBM DB2 Universal Database 7.2. Technical report, Transaction Processing Performance Council, 2002. http://www.tpc.org/results/ FDR/tpch/x350_100GB_16proc_FDR.pdf.Google ScholarGoogle Scholar
  11. T. P. P. Council. dbgen. http://www.tpc.org/tpch/spec/20000511.tar.z.Google ScholarGoogle Scholar
  12. T. P. P. Council. TPC Benchmark H. Technical report, Transaction Processing Performance Council, 2002. http://www.tpc.org/tpch/spec/tpch150.pdf.Google ScholarGoogle Scholar
  13. A. Deshpande and J. Hellerstein. Decoupled query optimization for federated database systems. In ICDE, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  14. S. Ganguly. Design and analysis of parametric query optimization algorithms. In Proceedings of the Very Large Database Conference, pages 228--238, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. S. Ganguly, W. Hasan, and R. Krishnamurthy. Query optimization for parallel execution. In Proceedings of the ACM SIGMOD Conference, pages 9--18, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. P. Gassner, G. M. Lohman, K. B. Schiefer, and Y. Wang. Query optimization in the IBM DB2 family. Data Engineering Bulletin, 16:4--18, 1993.Google ScholarGoogle Scholar
  17. G. Golub and C. van Loan. Matrix Computations. Johns Hopkins Press, 1983.Google ScholarGoogle Scholar
  18. G. Graefe. The cascades framework for query optimization. Data Engineering Bulletin, 18:19--29, 1995.Google ScholarGoogle Scholar
  19. A. Hulgeri and S. Sudarshan. Parameteric query optimization. In Proceedings of the Very Large Database Conference, pages 167--178, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Y. E. Ioannidis. Query optimization. ACM Computing Surveys, 28:121--123, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Y. E. Ioannidis, R. T. Ng, K. Shim, and T. K. Sellis. Parametric query processing. In Proceedings of the Very Large Database Conference, pages 103--114, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. M. Jarke and J. Koch. Query optimization in database systems. ACM Computing Surveys, 16:111--152, 1984. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. H. Lu, K.-L. Tan, and S. Dao. The fittest survives: An adaptive approach to query optimization. In Proceedings of the Very Large Database Conference, pages 251--262, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. D. A. Patterson et al. Recovery-Oriented Computing (ROC): Motivation, definition, techniques, and case studies. Technical Report UCB/CSD-02-1175, UC Berkeley Computer Science Department, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. V. G. V. Prasad. Parametric query optimization: A geometric approach. Master's thesis, Indian Institute of Technology, Kanpur, India, 1999.Google ScholarGoogle Scholar
  26. F. Preparata and M. Shamos. Computational Geometry: an Introduction. Springer-Verlag, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. W. H. Press, B. P. Flannery, S. A. Teukolsky, and W. T. Vetterling. Numerical Recipes in C. Cambridge University Press, New York, NY, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. S. V. U. M. Rao. Parametric query optimizationi: A non-geometric approach. Master's thesis, Indian Institute of Technology, Kanpur, India, 1999.Google ScholarGoogle Scholar
  29. C. Ruemmler and J. Wilkes. An introduction to disk drive modeling. IEEE Computer, 27:17--28, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. P. G. Selinger, M. M. Astrahan, D. D. Chamberlin, R. A. Lorie, and T. G. Price. Access path selection in a relational database management system. In Proceedings of the ACM SIGMOD Conference, pages 23--34. ACM, 1979. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. T. Shirai, L. Dilworth, R. Reutlinger, S. Kumar, D. Bernabe, B. Wilkins, and B. Cassells. DB2 UDB V7.1 Performance Tuning Guide. IBM International Technical Support Organization, December 2000.Google ScholarGoogle Scholar
  32. M. Steinbrunn, G. Moerkotte, and A. Kemper. Heuristic and randomized optimization for the join ordering problem. VLDB Journal, 6:191--208, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. M. Stillger and J. C. Freytag. Testing the quality of a query optimizer. Data Engineering Bulletin, 18:41--48, 1995.Google ScholarGoogle Scholar
  34. M. Stillger, G. Lohman, V. Markl, and M. Kandil. LEO --- DB2's learning optimizer. In Proceedings of the Very Large Database Conference, pages 19--28, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. G. Strang. Linear Algebra and its Applications. Harcourt, Brace and Jovanovich, Inc., Orlando, FL, 1988.Google ScholarGoogle Scholar
  36. G. Weikum, A. Moenkeberg, C. Hasse, and P. Zabback. Self-tuning database technology and information services: from wishful thinking to viable engineering. In Proceedings of the Very Large Database Conference, pages 20--31, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. B. L. Worthington, G. R. Ganger, Y. N. Patt, and J. Wilkes. On-line extraction of scsi disk drive parameters. In ACM SIGMETRICS Performance Evaluation Review, pages 146--156, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A characterization of the sensitivity of query optimization to storage access cost parameters

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            SIGMOD '03: Proceedings of the 2003 ACM SIGMOD international conference on Management of data
            June 2003
            702 pages
            ISBN:158113634X
            DOI:10.1145/872757

            Copyright © 2003 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 9 June 2003

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • Article

            Acceptance Rates

            SIGMOD '03 Paper Acceptance Rate53of342submissions,15%Overall Acceptance Rate785of4,003submissions,20%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader