Skip to main content

SASMINT System for Database Interoperability in Collaborative Networks

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4275))

Abstract

In most suggested systems aiming to enable interoperability and collaboration among heterogeneous databases, schema matching and integration is performed manually. The SASMINT system introduced in this paper proposes a (semi-) automated approach to tackle the following: 1) identification of the syntactic/semantic/structural similarities between the donor and recipient schemas to resolve their heterogeneities, 2) suggestion of corresponding mappings among the pairs of matched components, 3) facilitation of user-interaction with the system, necessary for validation/enhancement of results, and 4) generation of a proposed integrated schema, and a set of derivation rules for each of its components to support query processing against integrated sources. Unlike other systems that typically apply one specific algorithm, SASMINT applies a hybrid approach for schema matching that combines a selection of algorithms from NLP and graph theory. Furthermore, SASMINT exploits the user-validated schema matching results in its semi-automatic generation of the integrated schema and its necessary derivations.

An erratum to this chapter can be found at http://dx.doi.org/10.1007/11914853_71.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Camarinha-Matos, L.M., Afsarmanesh, H., Ollus, M.: ECOLEAD: A Holistic Approach to Creation and Management of Dynamic Virtual Organizations. In: Proc. of PRO-VE 2005, pp. 3–16 (2005)

    Google Scholar 

  2. Afsarmanesh, H., Camarinha-Matos, L.M.: A Framework for Management of Virtual Organizations Breeding Environments. In: Proc. of PRO-VE 2005, pp. 35–48 (2005)

    Google Scholar 

  3. Sheth, A., Larson, J.: Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases. ACM Computing Surveys 22(3), 183–236 (1990)

    Article  Google Scholar 

  4. Tuijnman, F., Afsarmanesh, H.: Management of Shared Data in Federated Cooperative PEER Environment. International Journal of Intelligent & Cooperative Information Systems 2(4), 451–473 (1993)

    Article  Google Scholar 

  5. Unal, O., Afsarmanesh, H.: Interoperability in Collaborative Network of Biodiversity Organizations. In: Proc. of PRO-VE 2006 (accepted for Publication, 2006)

    Google Scholar 

  6. Hammer, J., McLeod, D.: An Approach to Resolving Semantic Heterogeneity in a Federation of Autonomous, Heterogeneous Database Systems. International Journal of Intelligent & Cooperative Information Systems 2(1), 51–83 (1993)

    Article  Google Scholar 

  7. Bergamaschi, S., et al.: A Semantic Approach to Information Integration: the MOMIS project. In: Proc. of Sesto Convegno della Associazione Italiana per l’Intelligenza Artificiale, AI*IA 1998 (1998)

    Google Scholar 

  8. Bayardo, R.J., et al.: InfoSleuth: Agent-Based Semantic Integration of Information in Open and Dynamic Environments. In: Proc. of ACM SIGMOD International Conference on Management of Data (1997)

    Google Scholar 

  9. Arens, Y., Knoblock, C.A., Shen, W.-M.: Query Reformulation for Dynamic Information Integration. Journal of Intelligent Information Systems, 99–130 (1996)

    Google Scholar 

  10. Mena, E., et al.: OBSERVER: An Approach for Query Processing in Global Information Systems based on Interoperation across Pre-existing Ontologies. Distributed and Parallel Databases Journal 8(2), 223–271 (2000)

    Article  Google Scholar 

  11. Madhavan, J., Bernstein, P.A., Rahm, E.: Generic Schema Matching with Cupid. In: Proc. of VLDB, pp. 49–58 (2001)

    Google Scholar 

  12. Do, H.H., Rahm, E.: COMA - A System for Flexible Combination of Schema Matching Approaches. In: Proc. of VLDB, pp. 610–621 (2002)

    Google Scholar 

  13. Melnik, S., Garcia-Molina, H., Rahm, E.: Similarity Flooding: A Versatile Graph Matching Algorithm and its Application to Schema Matching. In: Proc. of ICDE (2002)

    Google Scholar 

  14. Mitra, P., Wiederhold, G., Decker, S.: A scalable framework for the interoperation of information sources. In: Proc. of. International Semantic Web Working Symposium (2001)

    Google Scholar 

  15. Doan, A., et al.: Learning to Map between Ontologies on the Semantic Web. In: Proc. of World-Wide Web Conf. (WWW-2002) (2002)

    Google Scholar 

  16. Miller, R.J., Haas, L.M., Hernandez, M.A.: Schema Mapping as Query Discovery. In: Proc. of VLDB, pp. 77–88 (2000)

    Google Scholar 

  17. Embley, D.W., Xu, L., Ding, Y.: Automatic direct and indirect schema mapping: Experiences and lessons learned. ACM SIGMOD Record 33(4), 14–19 (2004)

    Article  Google Scholar 

  18. Bernstein, P.A., et al.: Industrial-Strength Schema Matching. ACM SIGMOD Record 33(4), 38–43 (2004)

    Article  Google Scholar 

  19. Rahm, E., Do, H.-H., Maßmann, S.: Matching Large XML Schemas. ACM SIGMOD Record 33(4), 26–31 (2004)

    Article  Google Scholar 

  20. Levenshtein, V.I.: Binary codes capable of correcting deletions, insertions, and reversals. Cybernetics and Control Theory 10(8), 707–710 (1966)

    MathSciNet  Google Scholar 

  21. Monge, A.E., Elkan, C.: The Field Matching Problem: Algorithms and Applications. In: Second International Conference on Knowledge Discovery and Data Mining, pp. 267–270 (1996)

    Google Scholar 

  22. Jaro, M.A.: Probabilistic linkage of large public health. Statistics in Medicine 14, 491–498 (1995)

    Article  Google Scholar 

  23. Salton, G., Yang, C.S.: On the specification of term values in automatic indexing. Journal of Documentation (29), 351–372 (1973)

    Article  Google Scholar 

  24. Jaccard, P.: The distribution of flora in the alpine zone. The New Phytologist 11(2), 37–50 (1912)

    Article  Google Scholar 

  25. Cleverdon, C.W., Keen, E.M.: Factors determining the performance of indexing systems, volume 2: test results, Aslib Cranfield Research Project. Cranfield Institute of Technology (1966)

    Google Scholar 

  26. Rijsbergen, C.J.v.: Information Retrieval. Butterworths, London (1979)

    Google Scholar 

  27. Wu, Z., Palmer, M.: Verb Semantics and Lexical Selection. In: 32nd Annual Meeting of the Association for Computational Linguistics (1994)

    Google Scholar 

  28. Fellbaum, C.: An Electronic Lexical Database. MIT press, Cambridge (1998)

    MATH  Google Scholar 

  29. Lesk, M.: Automatic Sense Disambiguation Using Machine Readable Dictionaries: How to Tell a Pine Code from an Ice Cream Cone. In: Proc. of 5th International Conference on Systems Documentation, pp. 24–26 (1986)

    Google Scholar 

  30. Blondel, V., et al.: A Measure of Similarity between Graph Vertices: Applications to Synonym Extraction and Web Searching. Journal of SIAM Review 46(4), 647–666 (2004)

    Article  MathSciNet  MATH  Google Scholar 

  31. Afsarmanesh, H., et al.: The PEER Information Management Language User Manual. Technical Report CS-94-14, Department of Computer Systems, University of Amsterdam (1994)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Unal, O., Afsarmanesh, H. (2006). SASMINT System for Database Interoperability in Collaborative Networks. In: Meersman, R., Tari, Z. (eds) On the Move to Meaningful Internet Systems 2006: CoopIS, DOA, GADA, and ODBASE. OTM 2006. Lecture Notes in Computer Science, vol 4275. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11914853_7

Download citation

  • DOI: https://doi.org/10.1007/11914853_7

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-48287-1

  • Online ISBN: 978-3-540-48289-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics