Skip to main content

QoM: Qualitative and Quantitative Schema Match Measure

  • Conference paper
Conceptual Modeling - ER 2003 (ER 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2813))

Included in the following conference series:

Abstract

Integration of multiple heterogeneous data sources continues to be a critical problem for many application domains and a challenge for researchers world-wide. Schema matching, a fundamental aspect of integration, has been a well-studied problem. However researchers have, for the most part, concentrated on the development of different schema matching algorithms, and their performance with respect to the number of matches produced. To the best of our knowledge, current research in schema matching does not address the issue of quality of matching. We believe that quality of match is an important measure that can not only provide a basis for comparing multiple matches, but can also be used as a metric to compare as well as optimize existing match algorithms. In this paper, we define the Quality of Match (QoM) metric, and provide qualitative and quantitative analysis techniques to evaluate the QoM of two given schemata. In particular, we introduce a taxonomy of schema matches as a qualitative analysis technique, and a weight-based match model that in concert with the taxonomy provides a quantitative measure of the QoM. We show, via examples, how QoM can be used to distinguish the “goodness” of one match in comparison with other matches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bergamaschi, S., Castano, S., Vincini, M., Beneventano, D.: Semantic integration of heterogeneous information sources. Data and Knowledge Engineering 36(3), 215–249 (2001)

    Article  MATH  Google Scholar 

  2. Bright, M.W., Hurson, A.R., Pakzad, S.H.: Automated Resolution of Semantic Heterogeneity in Multidatabases. TODS 19(2), 212–253 (1994)

    Article  Google Scholar 

  3. Berlin, J., Motro, A.: AutoPlex: Automated Discovery of Content for Virtual Databases. In: Batini, C., Giunchiglia, F., Giorgini, P., Mecella, M. (eds.) CoopIS 2001. LNCS, vol. 2172, pp. 108–122. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  4. Booch, G.: Object-Oriented Analysis and Design. Benjamin Cummings Pub. (1994)

    Google Scholar 

  5. Conrad, R., Scheffner, D., Freitag, J.C.: XML conceptual modeling using UML. In: Laender, A.H.F., Liddle, S.W., Storey, V.C. (eds.) ER 2000. LNCS, vol. 1920, pp. 558–571. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  6. Doan, A.H., Domingos, P., Halevy, A.: Reconciling Schemas of Disparate Data Sources: A Machine-Learning Approach. In: sigmod (2001)

    Google Scholar 

  7. Do, H.H., Rahm, E.: COMA-ASystem for Flexible Combination of Schema Matching Approaches. In: vldb (2002)

    Google Scholar 

  8. Haas, L.M., Miller, R.J., Niswonger, B., Roth, M.T., Schwarz, P., Wimmers, E.L.: Transforming Heterogeneous Data with Database Middleware: Beyond Integration. IEEE Data Engineering Bulletin 22(1), 31–36 (1999)

    Google Scholar 

  9. Jeng, J.-J., Cheng, B.H.C.: Specification matching for software reuse:A foundation*. In: Proceedings of the 1995 Symposium on Software reusability. ACM Press, New York (1995)

    Google Scholar 

  10. Madhavan, J., Bernstein, P., Rahm, E.: Generic Schema Matching with Cupid. In: vldb, pp. 49–58 (2001)

    Google Scholar 

  11. Miller, R.J., Ioannidis, Y.E., Ramakrishnan, R.: The Use of Information Capacity in Schema Integration and Translation. In: Int. Conference on Very Large Data Bases, pp. 120–133 (1993)

    Google Scholar 

  12. Rosenthal, A., Reiner, D.: Theoretically Sound Transformations for Practical Database Design. In: March, S.T. (ed.) Entity-Relationship Approach, Proceedings of the Sixth International Conference on Entity-Relationship Approach, New York, USA, November 9-11, pp. 115–131 (1987)

    Google Scholar 

  13. Sanders, G.L.: Data Modeling. Boyd and Fraser Publishing Company (1995)

    Google Scholar 

  14. Zaremski, A.M., Wing, J.M.: Signature matching: a tool for using software libraries. ACM Transactions on Software Engineering and Methodology (TOSEM), ACM Press (1995)

    Google Scholar 

  15. Zaremski, A.M., Wing, J.M.: Specification matching of software components. ACM Transactions on Software Engineering and Methodology (TOSEM), ACM Press (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Tansalarak, N., Claypool, K. (2003). QoM: Qualitative and Quantitative Schema Match Measure. In: Song, IY., Liddle, S.W., Ling, TW., Scheuermann, P. (eds) Conceptual Modeling - ER 2003. ER 2003. Lecture Notes in Computer Science, vol 2813. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39648-2_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-39648-2_6

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-20299-8

  • Online ISBN: 978-3-540-39648-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics