Abstract
During data warehouse design, the designer frequently encounters the problem of choosing among different alternatives for the same design construct. The behavior of the chosen design in the presence of evolution events is an important parameter for this choice. This paper proposes metrics to assess the quality of the warehouse design from the viewpoint of evolution. We employ a graph-based model to uniformly abstract relations and software modules, like queries, views, reports, and ETL activities. We annotate the warehouse graph with policies for the management of evolution events. The proposed metrics are based on graph-theoretic properties of the warehouse graph to assess the sensi tivity of the graph to a set of possible events. We evaluate our metrics with experiments over alternative configurations of the same warehouse schema.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Allen, E.B.: Measuring Graph Abstractions of Software: An Information-Theory Approach. In: METRICS (2002)
Bellahsene, Z.: Schema evolution in data warehouses. Knowl. and Inf. Syst. 4(2) (2002)
Berenguer, G., et al.: A Set of Quality Indicators and their Corresponding Metrics for Conceptual Models of Data Warehouses. In: Tjoa, A.M., Trujillo, J. (eds.) DaWaK 2005. LNCS, vol. 3589. Springer, Heidelberg (2005)
Blaschka, M., Sapia, C., Höfling, G.: On Schema Evolution in Multidimensional Databases. In: Mohania, M., Tjoa, A.M. (eds.) DaWaK 1999. LNCS, vol. 1676. Springer, Heidelberg (1999)
Fan, H., Poulovassilis, A.: Schema Evolution in Data Warehousing Environments - A Schema Transformation-Based Approach. In: Atzeni, P., Chu, W., Lu, H., Zhou, S., Ling, T.-W. (eds.) ER 2004. LNCS, vol. 3288. Springer, Heidelberg (2004)
Favre, C., Bentayeb, F., Boussaid, O.: Evolution of Data Warehouses’ Optimization: A Workload Perspective. In: Song, I.-Y., Eder, J., Nguyen, T.M. (eds.) DaWaK 2007. LNCS, vol. 4654. Springer, Heidelberg (2007)
Golfarelli, M., Lechtenbörger, J., Rizzi, S., Vossen, G.: Schema versioning in data warehouses: Enabling cross-version querying via schema augmentation. Data Knowl. Eng. 59(2), 435–459 (2006)
Gupta, A., Mumick, I.S., Rao, J., Ross, K.: Adapting materialized views after redefinitions: Techniques and a performance study. Information Systems (26) (2001)
Levene, M., Loizou, G.: Why is the snowflake schema a good data warehouse design? Information Systems Journal 28(3), 225–240 (2003)
Nica, A., Lee, A.J., Rundensteiner, E.A.: The CSV algorithm for view synchronization in evolvable large-scale information systems. In: Schek, H.-J., Saltor, F., Ramos, I., Alonso, G. (eds.) EDBT 1998. LNCS, vol. 1377. Springer, Heidelberg (1998)
Papastefanatos, G., Anagnostou, F., Vassiliadis, P., Vassiliou, Y.: Hecataeus: A What-If Analysis Tool for Database Schema Evolution. In: CSMR (2008)
Papastefanatos, G., Vassiliadis, P., Simitsis, A., Vassiliou, Y.: What-if Analysis for Data Warehouse Evolution. In: Song, I.-Y., Eder, J., Nguyen, T.M. (eds.) DaWaK 2007. LNCS, vol. 4654. Springer, Heidelberg (2007)
Papastefanatos, G., et al.: Language Extensions for the Automation of Database Schema Evolution. In: ICEIS (2008)
The TPC BENCHMARKTM DS (April 2007), http://www.tpc.org/tpcds/spec/tpcds1.0.0.d.pdf
Vassiliadis, P., Simitsis, A., Skiadopoulos, S.: Modeling ETL activities as graphs. In: DMDW (2002)
Wedemeijer, L.: Defining Metrics for Conceptual Schema Evolution. In: FMLDO (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Papastefanatos, G., Vassiliadis, P., Simitsis, A., Vassiliou, Y. (2008). Design Metrics for Data Warehouse Evolution. In: Li, Q., Spaccapietra, S., Yu, E., Olivé, A. (eds) Conceptual Modeling - ER 2008. ER 2008. Lecture Notes in Computer Science, vol 5231. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87877-3_32
Download citation
DOI: https://doi.org/10.1007/978-3-540-87877-3_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87876-6
Online ISBN: 978-3-540-87877-3
eBook Packages: Computer ScienceComputer Science (R0)