Abstract
During data warehouse schema design, designers often encounter how to model big dimensions that typically contain a large number of attributes and records. To investigate effective approaches for modeling big dimensions is necessary in order to achieve better query performance, with respect to response time. In most cases, the big dimension modeling process is complicated since it usually requires accurate description of business semantics, multiple design revisions and comprehensive testings. In this paper, we present the design methods for modeling big dimensions, which include horizontal partitioning, vertical partitioning and their hybrid. We formalize the design methods, and propose an algorithm that describes the modeling process from an OWL ontology to a data warehouse schema. In addition, this paper also presents an effective ontology-based tool to automate the modeling process. The tool can automatically generate the data warehouse schema from the ontology of describing the terms and business semantics for the big dimension. In case of any change in the requirements, we only need to modify the ontology, and re-generate the schema using the tool. This paper also evaluates the proposed methods based on sample sales data mart.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abello, A., Romero, O.: Using Ontologies to Discover Facts IDs. In: DOLAP, pp. 3–10 (2010)
Abadi, D.J., Madden, S.R., Hachem, N.: Column-Store vs. Row-Stores: How Different Are They Really? In: SIGMOD, pp. 1–14 (2008)
Agrawal, S., Narasayya, V.R., Yang, B.: Integrating Vertical and Horizontal Partitioning into Automated Physical Database Design. In: SIGMOD, pp. 359–370 (2004)
Astrova, I., Korda, N., Kalja, A.: Storing OWL Ontologies in SQL Relational Databases. ECSE 1(4), 167–172 (2007)
Eberhart, A.: Automatic Generation of Java/SQL based Inference Engines from RDF Schema and RuleML. In: Horrocks, I., Hendler, J. (eds.) ISWC 2002. LNCS, vol. 2342, pp. 102–116. Springer, Heidelberg (2002)
Gali, A., Chen, C.X., Claypool, K.T., Uceda-Sosa, R.: From Ontology to Relational Databases. In: ER Workshops, pp. 278–289 (2004)
Costa, M., Madeira, H.: Handling Big Dimensions in Distributed Data Warehouses using the DWS Technique. In: DOLAP, pp. 31–37 (2004)
Imhoff, C., Galemmo, N., Geiger, J.G.: Mastering Data Warehouse Design: Relational and Dimensional Techniques, pp. 285–317. John Wiley and Sons, NY (2003)
Kalyanpur, A., Pastor, D.J., Battle, S., Padget, J.: Automatic Mapping of OWL Ontologies into Java. In: SEKE, pp. 98–103 (2004)
Liu, X., Thomsen, C., Pedersen, T.B.: 3XL: Supporting Efficient Operations on Very Large OWL Lite Triple-stores. Information Systems 36(4), 765–781 (2011)
Moody, D.L., Kortink, M.A.R.: From Er Models to Dimensional Models II: Advanced Design Issues. Business Intelligence Journal 8(4) (2003)
Navathe, S.: A Mixed Fragmentation Methodology For Initial Distributed Database Design. Journal of Computer and Software Engineering 3(4), 395–426 (1995)
Owl Description, www.w3.org/TR/2004/REC-owl-features-20040210 (September 20, 2012)
Romero, O., Abello, A.: Automating Multidimensional Design from Ontologies. In: DOLAP, pp. 1–8 (2007)
Silverston, L., Inmon, W.H., Graziano, K.: The Data Model Resource Book: A Library of Logical Data Models and Data Warehouse Designs. John Wiley and Sons, NY (1997)
TPC-H, http://tpc.org/tpch/ (September 20, 2012)
Vysniauskas, E., Nemuraite, L.: Transforming Ontology Representation from OWL to Relational Database. Information Technology and Control 35(3A), 333–343 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, X., Iftikhar, N. (2013). Ontology-Based Big Dimension Modeling in Data Warehouse Schema Design. In: Abramowicz, W. (eds) Business Information Systems. BIS 2013. Lecture Notes in Business Information Processing, vol 157. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38366-3_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-38366-3_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38365-6
Online ISBN: 978-3-642-38366-3
eBook Packages: Computer ScienceComputer Science (R0)