Abstract
Integration of multiple heterogeneous data sources continues to be a critical problem for many application domains and a challenge for researchers world-wide. Schema matching, a fundamental aspect of integration, has been a well-studied problem. However researchers have, for the most part, concentrated on the development of different schema matching algorithms, and their performance with respect to the number of matches produced. To the best of our knowledge, current research in schema matching does not address the issue of quality of matching. We believe that quality of match is an important measure that can not only provide a basis for comparing multiple matches, but can also be used as a metric to compare as well as optimize existing match algorithms. In this paper, we define the Quality of Match (QoM) metric, and provide qualitative and quantitative analysis techniques to evaluate the QoM of two given schemata. In particular, we introduce a taxonomy of schema matches as a qualitative analysis technique, and a weight-based match model that in concert with the taxonomy provides a quantitative measure of the QoM. We show, via examples, how QoM can be used to distinguish the “goodness” of one match in comparison with other matches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bergamaschi, S., Castano, S., Vincini, M., Beneventano, D.: Semantic integration of heterogeneous information sources. Data and Knowledge Engineering 36(3), 215–249 (2001)
Bright, M.W., Hurson, A.R., Pakzad, S.H.: Automated Resolution of Semantic Heterogeneity in Multidatabases. TODS 19(2), 212–253 (1994)
Berlin, J., Motro, A.: AutoPlex: Automated Discovery of Content for Virtual Databases. In: Batini, C., Giunchiglia, F., Giorgini, P., Mecella, M. (eds.) CoopIS 2001. LNCS, vol. 2172, pp. 108–122. Springer, Heidelberg (2001)
Booch, G.: Object-Oriented Analysis and Design. Benjamin Cummings Pub. (1994)
Conrad, R., Scheffner, D., Freitag, J.C.: XML conceptual modeling using UML. In: Laender, A.H.F., Liddle, S.W., Storey, V.C. (eds.) ER 2000. LNCS, vol. 1920, pp. 558–571. Springer, Heidelberg (2000)
Doan, A.H., Domingos, P., Halevy, A.: Reconciling Schemas of Disparate Data Sources: A Machine-Learning Approach. In: sigmod (2001)
Do, H.H., Rahm, E.: COMA-ASystem for Flexible Combination of Schema Matching Approaches. In: vldb (2002)
Haas, L.M., Miller, R.J., Niswonger, B., Roth, M.T., Schwarz, P., Wimmers, E.L.: Transforming Heterogeneous Data with Database Middleware: Beyond Integration. IEEE Data Engineering Bulletin 22(1), 31–36 (1999)
Jeng, J.-J., Cheng, B.H.C.: Specification matching for software reuse:A foundation*. In: Proceedings of the 1995 Symposium on Software reusability. ACM Press, New York (1995)
Madhavan, J., Bernstein, P., Rahm, E.: Generic Schema Matching with Cupid. In: vldb, pp. 49–58 (2001)
Miller, R.J., Ioannidis, Y.E., Ramakrishnan, R.: The Use of Information Capacity in Schema Integration and Translation. In: Int. Conference on Very Large Data Bases, pp. 120–133 (1993)
Rosenthal, A., Reiner, D.: Theoretically Sound Transformations for Practical Database Design. In: March, S.T. (ed.) Entity-Relationship Approach, Proceedings of the Sixth International Conference on Entity-Relationship Approach, New York, USA, November 9-11, pp. 115–131 (1987)
Sanders, G.L.: Data Modeling. Boyd and Fraser Publishing Company (1995)
Zaremski, A.M., Wing, J.M.: Signature matching: a tool for using software libraries. ACM Transactions on Software Engineering and Methodology (TOSEM), ACM Press (1995)
Zaremski, A.M., Wing, J.M.: Specification matching of software components. ACM Transactions on Software Engineering and Methodology (TOSEM), ACM Press (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tansalarak, N., Claypool, K. (2003). QoM: Qualitative and Quantitative Schema Match Measure. In: Song, IY., Liddle, S.W., Ling, TW., Scheuermann, P. (eds) Conceptual Modeling - ER 2003. ER 2003. Lecture Notes in Computer Science, vol 2813. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39648-2_6
Download citation
DOI: https://doi.org/10.1007/978-3-540-39648-2_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20299-8
Online ISBN: 978-3-540-39648-2
eBook Packages: Springer Book Archive