Abstract
In most suggested systems aiming to enable interoperability and collaboration among heterogeneous databases, schema matching and integration is performed manually. The SASMINT system introduced in this paper proposes a (semi-) automated approach to tackle the following: 1) identification of the syntactic/semantic/structural similarities between the donor and recipient schemas to resolve their heterogeneities, 2) suggestion of corresponding mappings among the pairs of matched components, 3) facilitation of user-interaction with the system, necessary for validation/enhancement of results, and 4) generation of a proposed integrated schema, and a set of derivation rules for each of its components to support query processing against integrated sources. Unlike other systems that typically apply one specific algorithm, SASMINT applies a hybrid approach for schema matching that combines a selection of algorithms from NLP and graph theory. Furthermore, SASMINT exploits the user-validated schema matching results in its semi-automatic generation of the integrated schema and its necessary derivations.
An erratum to this chapter can be found at http://dx.doi.org/10.1007/11914853_71.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Camarinha-Matos, L.M., Afsarmanesh, H., Ollus, M.: ECOLEAD: A Holistic Approach to Creation and Management of Dynamic Virtual Organizations. In: Proc. of PRO-VE 2005, pp. 3–16 (2005)
Afsarmanesh, H., Camarinha-Matos, L.M.: A Framework for Management of Virtual Organizations Breeding Environments. In: Proc. of PRO-VE 2005, pp. 35–48 (2005)
Sheth, A., Larson, J.: Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases. ACM Computing Surveys 22(3), 183–236 (1990)
Tuijnman, F., Afsarmanesh, H.: Management of Shared Data in Federated Cooperative PEER Environment. International Journal of Intelligent & Cooperative Information Systems 2(4), 451–473 (1993)
Unal, O., Afsarmanesh, H.: Interoperability in Collaborative Network of Biodiversity Organizations. In: Proc. of PRO-VE 2006 (accepted for Publication, 2006)
Hammer, J., McLeod, D.: An Approach to Resolving Semantic Heterogeneity in a Federation of Autonomous, Heterogeneous Database Systems. International Journal of Intelligent & Cooperative Information Systems 2(1), 51–83 (1993)
Bergamaschi, S., et al.: A Semantic Approach to Information Integration: the MOMIS project. In: Proc. of Sesto Convegno della Associazione Italiana per l’Intelligenza Artificiale, AI*IA 1998 (1998)
Bayardo, R.J., et al.: InfoSleuth: Agent-Based Semantic Integration of Information in Open and Dynamic Environments. In: Proc. of ACM SIGMOD International Conference on Management of Data (1997)
Arens, Y., Knoblock, C.A., Shen, W.-M.: Query Reformulation for Dynamic Information Integration. Journal of Intelligent Information Systems, 99–130 (1996)
Mena, E., et al.: OBSERVER: An Approach for Query Processing in Global Information Systems based on Interoperation across Pre-existing Ontologies. Distributed and Parallel Databases Journal 8(2), 223–271 (2000)
Madhavan, J., Bernstein, P.A., Rahm, E.: Generic Schema Matching with Cupid. In: Proc. of VLDB, pp. 49–58 (2001)
Do, H.H., Rahm, E.: COMA - A System for Flexible Combination of Schema Matching Approaches. In: Proc. of VLDB, pp. 610–621 (2002)
Melnik, S., Garcia-Molina, H., Rahm, E.: Similarity Flooding: A Versatile Graph Matching Algorithm and its Application to Schema Matching. In: Proc. of ICDE (2002)
Mitra, P., Wiederhold, G., Decker, S.: A scalable framework for the interoperation of information sources. In: Proc. of. International Semantic Web Working Symposium (2001)
Doan, A., et al.: Learning to Map between Ontologies on the Semantic Web. In: Proc. of World-Wide Web Conf. (WWW-2002) (2002)
Miller, R.J., Haas, L.M., Hernandez, M.A.: Schema Mapping as Query Discovery. In: Proc. of VLDB, pp. 77–88 (2000)
Embley, D.W., Xu, L., Ding, Y.: Automatic direct and indirect schema mapping: Experiences and lessons learned. ACM SIGMOD Record 33(4), 14–19 (2004)
Bernstein, P.A., et al.: Industrial-Strength Schema Matching. ACM SIGMOD Record 33(4), 38–43 (2004)
Rahm, E., Do, H.-H., Maßmann, S.: Matching Large XML Schemas. ACM SIGMOD Record 33(4), 26–31 (2004)
Levenshtein, V.I.: Binary codes capable of correcting deletions, insertions, and reversals. Cybernetics and Control Theory 10(8), 707–710 (1966)
Monge, A.E., Elkan, C.: The Field Matching Problem: Algorithms and Applications. In: Second International Conference on Knowledge Discovery and Data Mining, pp. 267–270 (1996)
Jaro, M.A.: Probabilistic linkage of large public health. Statistics in Medicine 14, 491–498 (1995)
Salton, G., Yang, C.S.: On the specification of term values in automatic indexing. Journal of Documentation (29), 351–372 (1973)
Jaccard, P.: The distribution of flora in the alpine zone. The New Phytologist 11(2), 37–50 (1912)
Cleverdon, C.W., Keen, E.M.: Factors determining the performance of indexing systems, volume 2: test results, Aslib Cranfield Research Project. Cranfield Institute of Technology (1966)
Rijsbergen, C.J.v.: Information Retrieval. Butterworths, London (1979)
Wu, Z., Palmer, M.: Verb Semantics and Lexical Selection. In: 32nd Annual Meeting of the Association for Computational Linguistics (1994)
Fellbaum, C.: An Electronic Lexical Database. MIT press, Cambridge (1998)
Lesk, M.: Automatic Sense Disambiguation Using Machine Readable Dictionaries: How to Tell a Pine Code from an Ice Cream Cone. In: Proc. of 5th International Conference on Systems Documentation, pp. 24–26 (1986)
Blondel, V., et al.: A Measure of Similarity between Graph Vertices: Applications to Synonym Extraction and Web Searching. Journal of SIAM Review 46(4), 647–666 (2004)
Afsarmanesh, H., et al.: The PEER Information Management Language User Manual. Technical Report CS-94-14, Department of Computer Systems, University of Amsterdam (1994)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Unal, O., Afsarmanesh, H. (2006). SASMINT System for Database Interoperability in Collaborative Networks. In: Meersman, R., Tari, Z. (eds) On the Move to Meaningful Internet Systems 2006: CoopIS, DOA, GADA, and ODBASE. OTM 2006. Lecture Notes in Computer Science, vol 4275. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11914853_7
Download citation
DOI: https://doi.org/10.1007/11914853_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-48287-1
Online ISBN: 978-3-540-48289-5
eBook Packages: Computer ScienceComputer Science (R0)