Abstract
Star schemas describe the structure and properties of multidimensional sources such as data marts and data warehouses. They have a simple structure and a predictable topology. We propose StarMod a representation of Star schema model described in UML and infer its instances from relational schemas. StarMod includes a comprehensive set of properties specific to multidimensional data with a view to its application in matching Star schemas. This paper demonstrates that in comparison to using the relational model, the quality of matching between Star schemas is improved if they are described using a more precise model such as StarMod, even if these Star properties are inferred from the relational schema. We demonstrate that StarMod can be also effective for matching arbitrary non-Star relational schemas.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
http://www.omg.org/spec/CWM/ (2003)
Abelló, A., Samos, J., Saltor, F.: YAM2: a multidimensional conceptual model extending UML. Information Systems 31, 541–567 (2005)
Banek, M., Vrdoljak, B., Tjoa, M.: Automating the schema matching process for hetrogeneous data warehouses. Int. J. of Data Warehousing and Mining 4(4), 1–21 (2008)
Bernstein, P., Melnik, S., Petropoulos, M., Quix, C.: Industrial-strength schema matching. SIGMOD Record 33(4), 38–43 (2004)
Chen, Y.T., Hsu, P.Y.: A grain preservation translation algorithm: From ER diagram to multidimensional model. Inf. Sci. 177, 3679–3695 (2007)
Do, H.H., Rahm, E.: COMA: a system for flexible combination of schema matching approaches. In: Proceedings of the 28th VLDB, pp. 610–621 (2002)
Franconi, E., Sattler, U.: A datawarehouse conceptual datamodel for multidimensional aggregation: a preliminary report. In: Proceedings of the Workshop on Design and Management of Datawarehouses (DMDW 1999), Heidelberg, Germany (1999)
Giovinazzo, W.: Object Oriented Data Warehouse Design, Building a Star Schema. Prentice Hall, Inc., New Jersey (2000)
Haas, L., Hernández, M., Ho, H., Popa, L., Roth, M.: Clio grows up: from research prototype to industrial tool. In: Proceedings of ACM SIGMOD, pp. 805–810 (2005)
Imhoff, C., Galemmo, N., Geiger, J.: Mastering Data Warehouse Design. Wiley Publishing, Inc., Indianapolis (2003)
Kamble, A.S.: A conceptual model for multidimensional data. In: Proceedings of the Fifth Asia-Pacific conference on Conceptual Modelling, pp. 29–38 (2008)
Kimball, R., Ross, M.: The Data Warehouse Toolkit. Wiley Publishing, Inc., Indianapolis (2000)
de Laborda, C.P., Conrad, S.: Relational.OWL: a data and schema representation format based on OWL. In: Proceedings of the 2nd Asia-Pacific Conference on Conceptual Modelling, pp. 89–96 (2005)
Li, L., Yang, L.: Automatic schema matching for data warehouses. In: 5th World Congress on Intelligent Control and Automation, pp. 3939–3943 (2004)
Luján-Mora, S., Trujillo, J., Song, I.Y.: A UML profile for multidimensional modeling in data warehouses. Data Knowl. Eng. 59(3), 725–769 (2006)
Melnik, S., Garcia-Molina, H., Rahm, E.: Similarity flooding: a versatile graph matching algorithm and its application to schema matching. In: Proceedings of 18th International Conference on Data Engineering, pp. 117–118 (2002)
Pavel, S., Euzenat, J.: A survey of schema-based matching approaches. Tech. rep., Informaticae Telecomunicazioni, University of Trento (2004)
Riazati, D., Thom, J.A., Zhang, X.: Inferring aggregation hierarchies for integration of data marts. In: Bringas, P.G., Hameurlain, A., Quirchmayr, G. (eds.) DEXA 2010. LNCS, vol. 6262, pp. 96–110. Springer, Heidelberg (2010)
Torlone, R.: Two approaches to the integration of heterogeneous data warehouses. Distrib. Parallel Databases 23(1), 69–97 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Riazati, D., Thom, J.A. (2011). Matching Star Schemas. In: Hameurlain, A., Liddle, S.W., Schewe, KD., Zhou, X. (eds) Database and Expert Systems Applications. DEXA 2011. Lecture Notes in Computer Science, vol 6861. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23091-2_36
Download citation
DOI: https://doi.org/10.1007/978-3-642-23091-2_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23090-5
Online ISBN: 978-3-642-23091-2
eBook Packages: Computer ScienceComputer Science (R0)