Abstract
As any product design, data warehouse applications follow a well-known life-cycle. Historically, it included only the physical phase, and had been gradually extended to include the conceptual and the logical phases. The management of phases either internally or intranally is dominated by rule-based approaches. More recently, a cost-based approach has been proposed to substitute rule-based approaches in the physical design phase in order to optimize queries. Unlike the traditional rule-based approach, it explores a huge search space of solutions (e.g., query execution plans), and then based on a cost-model, it selects the most suitable one(s). On the other hand, the logical design phase is still managed by rule-based approaches applied on the conceptual schema. In this paper, we propose to propagate the cost-based vision on the logical phase. As a consequence, the selection of a logical design of a given data warehouse schema becomes an optimization problem with a huge space search generated thanks to correlations (e.g. hierarchies) between data warehouse concepts. By the means of a cost model estimating the overall query processing cost, the best logical schema is selected. Finally, a case study using the Star Schema Benchmark is presented to show the effectiveness of our proposal.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. In: VLDB, pp. 487–499 (1994)
Agrawal, S., Chaudhuri, S., Narasayya, V.R.: Automated selection of materialized views and indexes in sql databases. In: VLDB, pp. 496–505 (2000)
Anderlik, S., Neumayr, B., Schrefl, M.: Using domain ontologies as semantic dimensions in data warehouses. In: Atzeni, P., Cheung, D., Ram, S. (eds.) ER 2012. LNCS, vol. 7532, pp. 88–101. Springer, Heidelberg (2012)
Becker, L., Güting, R.H.: Rule-based optimization and query processing in an extensible geometric database system. ACM Trans. Database Syst. 17(2), 247–303 (1992)
Bellatreche, L., Boukhalfa, K., Richard, P., Woameno, K.Y.: Referential horizontal partitioning selection problem in data warehouses: Hardness study and selection algorithms. IJDWM 5(4), 1–23 (2009)
Bohannon, P., Fan, W., Geerts, F., Jia, X., Kementsietsidis, A.: Conditional functional dependencies for data cleaning. In: ICDE, pp. 746–755 (2007)
Brown, P.G., Hass, P.J.: Bhunt: Automatic discovery of fuzzy algebraic constraints in relational data. In: VLDB, pp. 668–679 (2003)
Codd, E.F.: A relational model of data for large shared data banks. Commun. ACM 13(6), 377–387 (1970)
Golfarelli, M., Rizzi, S.: Data warehouse testing: A prototype-based methodology. Information and Software Technology 53(11), 1183–1198 (2011)
Herbst, H.: Business Rule-Oriented Conceptual Modeling. Contributions to Management Science. Physica-Verlag HD (1997)
Hong, M., Riedewald, M., Koch, C., Gehrke, J., Demers, A.: Rule-based multi-query optimization. In: EDBT, pp. 120–131. ACM, New York (2009)
Kimura, H., Huo, G., Rasin, A., Madden, S., Zdonik, S.: Coradd: Correlation aware database designer for materialized views and indexes. PVLDB 3(1), 1103–1113 (2010)
Marchi, F.D., Hacid, M.-S., Petit, J.-M.: Some remarks on self-tuning logical database design. In: ICDE Workshops, p. 1219 (2005)
Martyn, T.: Reconsidering multi-dimensional schemas. SIGMOD Rec. 33(1), 83–88 (2004)
Petit, J.-M., Toumani, F., Boulicaut, J.-F., Kouloumdjian, J.: Towards the reverse engineering of denormalized relational databases. In: ICDE, pp. 218–227 (1996)
Ram, S., Khatri, V.: A comprehensive framework for modeling set-based business rules during conceptual database design. Inf. Syst. 30(2), 89–118 (2005)
Rasdorf, W., Ulberg, K., Baugh Jr., J.: A structure-based model of semantic integrity constraints for relational data bases. In: Proc. of Engineering with Computers, vol. 2, pp. 31–39 (1987)
Stöhr, T., Märtens, H., Rahm, E.: Multi-dimensional database allocation for parallel data warehouses. In: VLDB, pp. 273–284 (2000)
Tsichritzis, D., Klug, A.C.: The ansi/x3/sparc dbms framework report of the study group on dabatase management systems. Inf. Syst. 3(3), 173–191 (1978)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Bouarar, S., Bellatreche, L., Jean, S., Baron, M. (2014). Do Rule-Based Approaches Still Make Sense in Logical Data Warehouse Design?. In: Manolopoulos, Y., Trajcevski, G., Kon-Popovska, M. (eds) Advances in Databases and Information Systems. ADBIS 2014. Lecture Notes in Computer Science, vol 8716. Springer, Cham. https://doi.org/10.1007/978-3-319-10933-6_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-10933-6_7
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10932-9
Online ISBN: 978-3-319-10933-6
eBook Packages: Computer ScienceComputer Science (R0)