Skip to main content

Do Rule-Based Approaches Still Make Sense in Logical Data Warehouse Design?

  • Conference paper
Advances in Databases and Information Systems (ADBIS 2014)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8716))

Abstract

As any product design, data warehouse applications follow a well-known life-cycle. Historically, it included only the physical phase, and had been gradually extended to include the conceptual and the logical phases. The management of phases either internally or intranally is dominated by rule-based approaches. More recently, a cost-based approach has been proposed to substitute rule-based approaches in the physical design phase in order to optimize queries. Unlike the traditional rule-based approach, it explores a huge search space of solutions (e.g., query execution plans), and then based on a cost-model, it selects the most suitable one(s). On the other hand, the logical design phase is still managed by rule-based approaches applied on the conceptual schema. In this paper, we propose to propagate the cost-based vision on the logical phase. As a consequence, the selection of a logical design of a given data warehouse schema becomes an optimization problem with a huge space search generated thanks to correlations (e.g. hierarchies) between data warehouse concepts. By the means of a cost model estimating the overall query processing cost, the best logical schema is selected. Finally, a case study using the Star Schema Benchmark is presented to show the effectiveness of our proposal.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. In: VLDB, pp. 487–499 (1994)

    Google Scholar 

  2. Agrawal, S., Chaudhuri, S., Narasayya, V.R.: Automated selection of materialized views and indexes in sql databases. In: VLDB, pp. 496–505 (2000)

    Google Scholar 

  3. Anderlik, S., Neumayr, B., Schrefl, M.: Using domain ontologies as semantic dimensions in data warehouses. In: Atzeni, P., Cheung, D., Ram, S. (eds.) ER 2012. LNCS, vol. 7532, pp. 88–101. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  4. Becker, L., Güting, R.H.: Rule-based optimization and query processing in an extensible geometric database system. ACM Trans. Database Syst. 17(2), 247–303 (1992)

    Article  Google Scholar 

  5. Bellatreche, L., Boukhalfa, K., Richard, P., Woameno, K.Y.: Referential horizontal partitioning selection problem in data warehouses: Hardness study and selection algorithms. IJDWM 5(4), 1–23 (2009)

    Google Scholar 

  6. Bohannon, P., Fan, W., Geerts, F., Jia, X., Kementsietsidis, A.: Conditional functional dependencies for data cleaning. In: ICDE, pp. 746–755 (2007)

    Google Scholar 

  7. Brown, P.G., Hass, P.J.: Bhunt: Automatic discovery of fuzzy algebraic constraints in relational data. In: VLDB, pp. 668–679 (2003)

    Google Scholar 

  8. Codd, E.F.: A relational model of data for large shared data banks. Commun. ACM 13(6), 377–387 (1970)

    Article  MATH  Google Scholar 

  9. Golfarelli, M., Rizzi, S.: Data warehouse testing: A prototype-based methodology. Information and Software Technology 53(11), 1183–1198 (2011)

    Article  Google Scholar 

  10. Herbst, H.: Business Rule-Oriented Conceptual Modeling. Contributions to Management Science. Physica-Verlag HD (1997)

    Google Scholar 

  11. Hong, M., Riedewald, M., Koch, C., Gehrke, J., Demers, A.: Rule-based multi-query optimization. In: EDBT, pp. 120–131. ACM, New York (2009)

    Chapter  Google Scholar 

  12. Kimura, H., Huo, G., Rasin, A., Madden, S., Zdonik, S.: Coradd: Correlation aware database designer for materialized views and indexes. PVLDB 3(1), 1103–1113 (2010)

    Google Scholar 

  13. Marchi, F.D., Hacid, M.-S., Petit, J.-M.: Some remarks on self-tuning logical database design. In: ICDE Workshops, p. 1219 (2005)

    Google Scholar 

  14. Martyn, T.: Reconsidering multi-dimensional schemas. SIGMOD Rec. 33(1), 83–88 (2004)

    Article  Google Scholar 

  15. Petit, J.-M., Toumani, F., Boulicaut, J.-F., Kouloumdjian, J.: Towards the reverse engineering of denormalized relational databases. In: ICDE, pp. 218–227 (1996)

    Google Scholar 

  16. Ram, S., Khatri, V.: A comprehensive framework for modeling set-based business rules during conceptual database design. Inf. Syst. 30(2), 89–118 (2005)

    Article  Google Scholar 

  17. Rasdorf, W., Ulberg, K., Baugh Jr., J.: A structure-based model of semantic integrity constraints for relational data bases. In: Proc. of Engineering with Computers, vol. 2, pp. 31–39 (1987)

    Google Scholar 

  18. Stöhr, T., Märtens, H., Rahm, E.: Multi-dimensional database allocation for parallel data warehouses. In: VLDB, pp. 273–284 (2000)

    Google Scholar 

  19. Tsichritzis, D., Klug, A.C.: The ansi/x3/sparc dbms framework report of the study group on dabatase management systems. Inf. Syst. 3(3), 173–191 (1978)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Bouarar, S., Bellatreche, L., Jean, S., Baron, M. (2014). Do Rule-Based Approaches Still Make Sense in Logical Data Warehouse Design?. In: Manolopoulos, Y., Trajcevski, G., Kon-Popovska, M. (eds) Advances in Databases and Information Systems. ADBIS 2014. Lecture Notes in Computer Science, vol 8716. Springer, Cham. https://doi.org/10.1007/978-3-319-10933-6_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-10933-6_7

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-10932-9

  • Online ISBN: 978-3-319-10933-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics