Skip to main content

Two-Stage Stochastic View Selection for Data-Analysis Queries

  • Conference paper
Advances in Databases and Information Systems

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 186))

Abstract

We consider the problem of selecting an optimal set of views to answer a given collection of queries at the present time (stage 1) as well as several collections of queries in the future (stage 2), with a given probability of occurrence associated with each collection, so as to minimize the expected value of the corresponding query response time, while keeping the total size of the views within a given limit. We formulate this problem as a two-stage stochastic programming problem. We show that this model is equivalent to an integer programming (IP) model that can be solved via various commercial IP solvers. We also study the relationship between the queries and the views in this context and use this relationship to reduce the size of the corresponding IP model, hence increase the scalability of our proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, S., Bruno, N., Chaudhuri, S., Narasayya, V.R.: AutoAdmin: Self-tuning database systems technology. IEEE Data Eng. Bull. 29(3), 7–15 (2006)

    Google Scholar 

  2. Agrawal, S., Chaudhuri, S., Narasayya, V.R.: Automated selection of materialized views and indexes in SQL databases. In: VLDB, pp. 496–505 (2000)

    Google Scholar 

  3. Asgharzadeh, Z.T.: Exact and inexact methods for solving the view and index selection problem for OLAP performance improvement. Phd dissertation, North Carolina State University (2010)

    Google Scholar 

  4. Asgharzadeh, Z.T., Chirkova, R., Fathi, Y.: Exact and inexact methods for solving the problem of view selection for aggregate queries. International Journal of Business Intelligence and Data Mining 4(3/4), 391–415 (2009)

    Article  Google Scholar 

  5. Asgharzadeh, Z.T., Chirkova, R., Fathi, Y., Stallmann, M.: Exact and inexact methods for selecting views and indexes for OLAP performance improvement. In: EDBT, pp. 311–322 (2008)

    Google Scholar 

  6. Birge, J.R., Louveaux, F.: Introduction to Stochastic Programming. Springer (1997)

    Google Scholar 

  7. Bruno, N., Chaudhuri, S.: Interactive physical design tuning. In: ICDE, pp. 1161–1164 (2010)

    Google Scholar 

  8. Bruno, N., Chaudhuri, S., Weikum, G.: Database tuning using online algorithms. In: Encyclopedia of Database Systems, pp. 741–744. Springer US (2009)

    Google Scholar 

  9. Chaudhuri, S., Dayal, U., Narasayya, V.R.: An overview of business intelligence technology. Communications of the ACM 54(8), 88–98 (2011)

    Article  Google Scholar 

  10. Chaudhuri, S., Narasayya, V.R., Weikum, G.: Database tuning using combinatorial search. In: Encyclopedia of Database Systems, pp. 738–741. Springer US (2009)

    Google Scholar 

  11. Chaudhuri, S., Weikum, G.: Self-management technology in databases. In: Encyclopedia of Database Systems, pp. 2550–2555. Springer US (2009)

    Google Scholar 

  12. Duan, S., Franklin, P., Thummala, V., Zhao, D., Babu, S.: Shaman: A self-healing database system. In: ICDE, pp. 1539–1542 (2009)

    Google Scholar 

  13. Harinarayan, V., Rajaraman, A., Ullman, J.D.: Implementing data cubes efficiently. In: SIGMOD, pp. 205–216 (1996)

    Google Scholar 

  14. Huang, R., Chirkova, R., Fathi, Y.: Two-stage stochastic view selection for data analysis. Tech. Rep. TR-2011-22, NC State University (2011), ftp://ftp.ncsu.edu/pub/unity/lockers/ftp/csc_anon/tech/2011/TR-2011-22.pdf

  15. ILOG: CPLEX 11.0 software package (2007), http://www.ilog.com/products/cplex/

  16. Kalnis, P., Mamoulis, N., Papadias, D.: View selection using randomized search. DKE 42, 89–111 (2002)

    Article  MATH  Google Scholar 

  17. Kotidis, Y., Roussopoulos, N.: A case for dynamic view management. ACM TODS 26(4), 388–423 (2001)

    Article  MATH  Google Scholar 

  18. Lightstone, S.: Physical database design for relational databases. In: Encyclopedia of Database Systems, pp. 2108–2114. Springer US (2009)

    Google Scholar 

  19. Shukla, A., Deshpande, P., Naughton, J.F.: Materialized view selection for multidimensional datasets. In: VLDB, pp. 488–499 (1998)

    Google Scholar 

  20. Theodoratos, D., Ligoudistianos, S., Sellis, T.K.: View selection for designing the global data warehouse. Data Knowledge and Engineering 39(3), 219–240 (2001)

    Article  MATH  Google Scholar 

  21. Theodoratos, D., Sellis, T.K.: Incremental design of a data warehouse. Journal of Intelligent Information Systems 15(1), 7–27 (2000)

    Article  Google Scholar 

  22. TPC-H Revision 2.1.0: TPC Benchmark H (Decision Support), http://www.tpc.org/tpch/spec/tpch2.1.0.pdf

  23. Wolsey, L.A.: Integer Programming. Wiley (1998)

    Google Scholar 

  24. Yang, J., Karlapalem, K., Li, Q.: Algorithms for materialized view design in data warehousing environment. In: VLDB, pp. 136–145 (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rong Huang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Huang, R., Chirkova, R., Fathi, Y. (2013). Two-Stage Stochastic View Selection for Data-Analysis Queries. In: Morzy, T., Härder, T., Wrembel, R. (eds) Advances in Databases and Information Systems. Advances in Intelligent Systems and Computing, vol 186. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32741-4_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-32741-4_11

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-32740-7

  • Online ISBN: 978-3-642-32741-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics