skip to main content
10.1145/2247596.2247630acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article

Optimizing index deployment order for evolving OLAP

Published:27 March 2012Publication History

ABSTRACT

Many database applications deploy hundreds or thousands of indexes to speed up query execution. Despite a plethora of prior work on index selection, designing and deploying indexes remains a difficult task for database administrators. First, real-world businesses often require online index deployment, and the traditional off-line approach to index selection ignores intermediate workload performance during index deployment. Second, recent work on on-line index selection does not address effects of complex interactions that manifest during index deployment.

In this paper, we propose a new approach that incorporates transitional design performance into the overall problem of physical database design. We call our approach Incremental Database Design. As the first step in this direction, we study the problem of ordering index deployment. The benefits of a good index deployment order are twofold: (1) a prompt query runtime improvement and (2) a reduced total time to deploy the design. Finding an effective deployment order is difficult due to complex index interaction and a factorial number of possible solutions.

We formulate a mathematical model to represent the index ordering problem and demonstrate that Constraint Programming (CP) is a more efficient solution compared to other methods such as mixed integer programming and A * search. In addition to exact search techniques, we also study local search algorithms that make significant improvements over a greedy solution with minimal computational overhead.

Our empirical analysis using the TPC-H dataset shows that our pruning techniques can reduce the size of the search space by many orders of magnitude. Using the TPC-DS dataset, we verify that our local search algorithm is a highly scalable and stable method for quickly finding the best known solutions.

References

  1. S. Agrawal, S. Chaudhuri, and V. Narasayya. Automated selection of materialized views and indexes in sql databases. In VLDB, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. S. Agrawal, E. Chu, and V. Narasayya. Automatic physical design tuning: workload as a sequence. In SIGMOD, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. P. Baptiste, C. L. Pape, and W. Nuijten. Constraint-Based Scheduling - Applying Constraint Programming to Scheduling Problems. Springer, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Blaschka, C. Sapia, and G. Hofling. On schema evolution in multidimensional databases. DataWarehousing and Knowledge Discovery, pages 153--164, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. N. Bruno and S. Chaudhuri. An online approach to physical design tuning. In ICDE, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  6. N. Bruno and S. Chaudhuri. Physical design refinement: The merge-reduce approach. TODS, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Chaudhuri and V. Narasayya. AutoAdmin what-if index analysis utility. In SIGMOD, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. Chaudhuri and V. R. Narasayya. An Efficient Cost-Driven Index Selection Tool for Microsoft SQL Server. In VLDB, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. D. Dash, N. Polyzotis, and A. Ailamaki. CoPhy: A Scalable, Portable, and Interactive Index Advisor for Large Workloads. In PVLDB, volume 4, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. S. Finkelstein, M. Schkolnick, and P. Tiberio. Physical database design for relational databases. TODS, 13(1):91--128, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. F. Glover and G. Kochenberger. Handbook of metaheuristics. Springer, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  12. F. Glover and M. Laguna. Tabu Search. Kluwer, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. H. Gupta and I. Mumick. Selection of views to materialize under a maintenance cost constraint. ICDT, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. S. Idreos, M. L. Kersten, and S. Manegold. Database cracking. In CIDR, 2007.Google ScholarGoogle Scholar
  15. H. Kimura, C. Coffrin, A. Rasin, and S. Zdonik. A Constraint-Programming Approach To Optimize Index Deployment Order (Extended Version). Arxiv/CoRR abs arXiv:1107.3606, 2011.Google ScholarGoogle Scholar
  16. H. Kimura, G. Huo, A. Rasin, S. Madden, and S. B. Zdonik. Correlation Maps: A Compressed Access Method for Exploiting Soft Functional Dependencies. In VLDB, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. H. Kimura, G. Huo, A. Rasin, S. Madden, and S. B. Zdonik. CORADD: Correlation aware database designer for materialized views and indexes. VLDB, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. C. Mohan and I. Narang. Algorithms for creating indexes for very large tables without quiescing updates. In Proceedings of the 1992 ACM SIGMOD international conference on Management of data, SIGMOD '92, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. S. Papadomanolakis and A. Ailamaki. An Integer Linear Programming Approach to Database Design. In ICDE, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. K. Schnaitter, S. Abiteboul, T. Milo, and N. Polyzotis. On-Line Index Selection for Shifting Workloads. In Proceedings of the 2007 IEEE 23rd International Conference on Data Engineering Workshop, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. K. Schnaitter et al. Index interactions in physical design tuning: Modeling, analysis, and applications. VLDB, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. J. P. Sousa and L. A. Wolsey. A time indexed formulation of non-preemptive single machine scheduling problems. Mathematical Programming, 54:353--367, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. P. Van Hentenryck and L. Michel. Constraint-based local search. The MIT Press, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. D. C. Zilio et al. DB2 design advisor: integrated automatic physical database design. In VLDB, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    EDBT '12: Proceedings of the 15th International Conference on Extending Database Technology
    March 2012
    643 pages
    ISBN:9781450307901
    DOI:10.1145/2247596

    Copyright © 2012 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 27 March 2012

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article

    Acceptance Rates

    Overall Acceptance Rate7of10submissions,70%
  • Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)0

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader