Abstract
View materialization is considered to be one of the most efficient ways to speed up decision support process and OLAP queries in data warehouse architecture. There are great varieties of research topics concerning view materialization, such as user query rewrite to transparently direct user query from base table to materialized views, or materialized views update as soon as base table changes, etc. Among most of these topics, a proper selection of views to be materialized is fundamental. While much research work has been done on view materialization selection in the central case, there are still no appropriate solutions to the problem of view selection in distributed data warehouse architecture, which is just the focus of this paper. We model the views in distributed warehouse nodes with derivation cube which is a concept widely used in central data warehouse, and make extensions in order to adapt it to distributed cases. Then, we propose a greedy-based selection algorithm under a storage cost constraint to perform selection process. Finally, a detailed experimental comparison is made to demonstrate the advantage of our solution over simply applying the central methods repeatedly on each warehouse nodes.
The work is supported in part by National Natural Science Foundation of China (NSFC) with grant No. 60473124, Shanghai Science and Technology Committee Key Project with Grant No.20051020d1sj05-A and Shanghai Development Foundation of Science and Technology with Grant No.036505001.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Alhajj, R., Elnagar, A.: Incremental Materialization of Object-Oriented Views. Data and Knowledge Eng. 29(2), 121–145 (1999)
Kimball, R.: The Data Warehouse Toolkit. John Wiley & Sons, Inc., Chichester (1996)
Gupta, H.: Selection of Views to Materialize in a Data Warehouse. In: Proceedings of the 6th International Conference on Database Theory, ICDT 1999, Delphi, January 8-10 (1999)
Albrecht, J., Guenzel, H., Lehner, L.: Set-Derivability of Multidimensional Aggregates. In: Proceedings of the 1st International Conference on Data Warehousing and Knowledge Discovery, DAWAK 2001, Florence, Italy, August 30-September 1 (2001)
Gray, J., Bosworth, A., Layman, A., Pirahesh, H.: Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Total. In: Proceedings of the 12th International Conference on Data Engineering, ICDE 1998, New Orleans (LA), U.S.A., February 26-March 1 (1998)
Gupta, H., Mumick, I.: Selection of Views to Materialize Under a Maintenance Cost Constraint. In: Proceedings of the 7th International Conference on Database Theory, ICDT 2001, Jerusalem, Israel, January 10-12 (2001)
Gupta, H.: Selection of Views to Materialize in a Data Warehouse. IEEE Trans. on Knowledge and Data Engineering 17(1), 24–43 (2005)
Harinarayan, V., Rajaraman, A., Ullman, J.: Implementing Data Cubes Efficiently. In: Proc. ACM SIGMOD Int’l Conf. Management of Data (1998)
Gupta, H., Harinarayan, V., Rajaraman, A., Ullman, J.: Index Selection in OLAP. In: Proc. Int’l Conf. Data Engineering (1998)
Chirkova, R., Halevy, A., Suciu, D.: A Formal Perspective on the View Selection Problem. In: Proc. Int’l Conf. on Very Large Database Systems (2001)
Karloff, H., Mihail, M.: On the Complexity of the View-Selection Problem. In: Proc. Symp. on Principles of Database Systems (PODS) (1999)
Chirkova, R.: The View Selection Problem Has an Exponential Bound for Conjunctive Queries and Views. In: Proc. ACM Symp. on Principles of Database Systems (2004)
Yang, J., Karlapalem, K., Li, Q.: Algorithms for Materialized View Design in Data Warehousing Environment. In: Proc. Int’l Conf. on Very Large Database Systems (1999)
Baralis, E., Paraboschi, S., Teniente, E.: Materialized View Selection in a Multidimensional Database. In: Proc. Int’l Conf. on Very Large Database Systems (1997)
Theodoratos, D., Sellis, T.: Data Warehouse Configuration. In: Proc. Int’l Conf. on Very Large Database Systems (1999)
Shukla, A., Deshpande, P., Naughton, J.: Materialized View Selection for Multidimensional Datasets. In: Proc. Int’l Conf. on Very Large Database Systems (2001)
Deshpande, P., Ramasamy, K., Shukla, A., Naughton, J.: Caching Multidimensional Queries Using Chunks. In: 27th International Conference on the Management of Data, SIGMOD 1998, Seattle, USA, June 2-4 (1998)
Scheuermann, P., Shim, J., Vingralek, R.: WATCHMAN: A Data Warehouse Intelligent Cache Manager. In: 22nd International Conference on Very Large Data Bases, VLDB 1999, Bombay, India, September 3-6 (1999)
Kotidis, Y., Roussopoulos, N.: DynaMat: A Dynamic View Management System for Data Warehouses. In: Proceedings ACM International Conference on Management of Data, SIGMOD 2002, Philadelphia (PA), U.S.A., June 1-3 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ye, W., Gu, N., Yang, G., Liu, Z. (2005). Extended Derivation Cube Based View Materialization Selection in Distributed Data Warehouse. In: Fan, W., Wu, Z., Yang, J. (eds) Advances in Web-Age Information Management. WAIM 2005. Lecture Notes in Computer Science, vol 3739. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11563952_22
Download citation
DOI: https://doi.org/10.1007/11563952_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29227-2
Online ISBN: 978-3-540-32087-6
eBook Packages: Computer ScienceComputer Science (R0)