Abstract
In a conventional information mediation scenario it is assumed that all sources, including their schemas, are known before the integrated view is defined. We have found this assumption to be unrealistic for scientific information integration - new relevant sources are discovered quite frequently, and need to be integrated incrementally with an existing federation. In this paper, we address the issue of source registration, the mechanism by which a new information source “registers” its semantics with the mediator, such that not only new views can be defined with the newly joining source, but existing views can benefit from the source without any redefinition. We approach the problem in the framework of semantic (a.k.a. knowledge-based or model-based) mediation, a version of information integration where the sources cannot be integrated solely based on their own logical schema, but need additional domain knowledge at the mediator to “glue” them together. We solve the problem by introducing a process called contextualization, whereby a source specifies a set of axioms to express its own conceptual model relative to the mediator’s knowledge base. To this end, we present a context specification language CSL that allows the user to specify this mapping, and illustrate how the mediator interprets a CSL specification to update its knowledge schema and preexisting views. The examples are derived from a real-world scenario involving an ongoing collaboration with several neuroscience groups.
Work partially supported by NIH BIRN-CC 3 P41 RR08605-08S1, NSF/NPACI Neuroscience Thrust ASC-975249, and DOE SciDAC/SDM DE-FC02-01ER25486.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
A. Artale, E. Franconi, N. Guarino, and L. Pazzi. Part-whole Relations in Object-Centered Systems: An Overview. Data & Knowledge Engineering, 20:347–383, 1996.
D. Beneventano and S. Bergamaschi. Extensional Knowledge for semantic query optimization in a mediator based system. In Int. Workshop on Foundations of Models for Info. Integ. (FMII-2001), 2001.
B. Benn, Y. Chen, and I. Gringer. A rule-based strategy for schema integration in a heterogeneous information environment, 1996.
S. Bergamaschi, S. Castano, and M. Vincini. Semantic Integration of Semistructured and Structured Data Sources. SIGMOD Record, 28(1):54–59, 1999.
C. Baru, A. Gupta, B. Ludäscher, R. Marciano, Y. Papakonstantinou, P. Velikhov, and V. Chu. XML-Based Information Mediation with MIX. In Intl. Conf. on Management of Data (SIGMOD), pp. 597–599, 1999.
P. A. Bernstein, A. Y. Halevy, and R. A. Pottinger. A vision for management of complex models. SIGMOD Record, 29(4):55–63, 2000.
D. Calvanese, S. Castano, F. Guerra, D. Lembo, M. Melchiori, G. Ter-racina, D. Ursino, M. Vincini. Towards a Comprehensive Methodological Framework for Semantic Integration of Heterogeneous Data Sources. Intl. Workshop on Knowledge Representation meets Databases (KRDB), 2001.
P. N. Creasy and G. Ellis. A Conceptual Graph Approach to Conceptual Schema Integration. In Conceptual Graphs for Knowledge Representation: ICCS, pp. 126–141, Quebec, Canada, 1993.
H. Chalupsky. OntoMorph: A Translation System for Symbolic Knowledge. In Principles of Knowledge Representation and Reasoning, 2000.
L. Ekenberg and P. Johannesson. Conflictfreeness as a Basis for Schema Integration. In Conference on Information Systems and Management of Data (CISMOD), pp. 1–13, 1995.
D. Florescu, A. Levy, and A. Mendelzon. Database Techniques for the World-Wide Web: A Survey. SIGMOD Record, 27(3), September 1998.
A. Gupta, B. Ludäscher, and M. E. Martone. Knowledge-Based Integration of Neuroscience Data Sources. In Intl. Conference on Scientific and Statistical Database Management (SSDBM), 2000.
J. Grant and J. Minker. A Logic-Based Approach to Data Integration. Theory and Practice of Logic Programming (TPLP), 2(3):323–368, 2002.
H. Garcia-Molina, Y. Papakonstantinou, D. Quass, A. Rajaraman, Y. Sagiv, J. Ullman, and J. Widom. The TSIMMIS Approach to Mediation: Data Models and Languages. In Next Generation Information Technologies and Systems, 1995.
C. Goble, R. Stevens, G. Ng, S. Bechhofer, N. Paton, P. Baker, M. Peim, and A. Brass. Transparent Access to Multiple Bioinformatics Information Sources. IBM Systems Journal, 40(2):534–551, 2001.
A. Y. Halevy. Answering Queries Using Views: A Survey. VLDB Journal, 10(4):270–294, 2001.
L.M. Haas, D. Kossmann, E. L. Wimmers, and J. Yang. Optimizing Queries Across Diverse Data Sources. In Intl. Conf. on Very Large Databases (VLDB), pp. 276–285, Athens, Greece, 1997.
E. Hovy. A Standard for Large Ontologies. In Workshop on Research & Development Opportunities in Federal Information Services, 1997.
M. Kifer, G. Lausen, and J. Wu. Logical Foundations of Object-Oriented and Frame-Based Languages. Journal of the ACM, 42(4):741–843, 1995.
C.A. Knoblock, S. Minton, J.L. Ambite, P.J.M.N. Ashish, I. Muslea, A. G. Philpot, and S. Tejada. Modeling Web Sources for Information Integration. In 15 th National Conference on Artificial Intelligence, 1998.
B. Ludäscher, A. Gupta, and M. E. Martone. Model-Based Information Integration in a Neuroscience Mediator System. In Intl. Conf. on Very Large Data Bases (VLDB), pp. 639–642, Cairo, Egypt, 2000.
B. Ludäscher, A. Gupta, and M. E. Martone. Model-Based Mediation with Domain Maps. In 17th Intl. Conf. on Data Engineering (ICDE), Heidelberg, Germany, 2001.
A.Y. Levy, A. Rajaraman, and J. J. Ordille. Querying Heterogeneous Information Sources Using Source Descriptions. In Intl. Conference on Very Large Data Bases (VLDB), pp. 251–262, 1996.
D. L. McGuinness, R. Fikes, J. Rice, S. Wilder. The Chimaera Ontology Environment. 17 th Natl. Conf. on Artificial Intelligence (AAAI), 2000.
M.E. Martone, A. Gupta, M. Wong, X. Qian, G. Sosinsky, S. Lamont, B. Ludäscher, and M. H. Ellisman. A Cell-Centered Database for Electron Tomographic Data. Journal of Structural Biology, 2002. to appear; see also http://ncmir.ucsd.edu/CCDB/.
P. Mitra, G. Wiederhold, and M. L. Kersten. A Graph-Oriented Model for Articulation of Ontology Interdependencies. In Extending Database Technology, pp. 86–100, 2000.
N. F. Noy and M. A.Musen. PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment. In 17th National Conference on Artificial Intelligence (AAAI), pp. 450–455, 2000.
M. Peim, E. Franconi, N. Paton, and C. Goble. Query Processing with Description Logic Ontologies Over Object-Wrapped Databases. In Intl. Conf. on Scientific and Statistical Database Management (SSDBM), 2002.
C. Parent and S. Spaccapietra. Issues and Approaches of Database Integration. Communications of the A CM, 41(5):166–178, 1998.
L. Palopoli, G. Terracina, and D. Ursino. The System DIKE: Towards the Semi-Automatic Synthesis of Cooperative Information Systems and Data Warehouses. In Proc. ADBIS-DASFAA Symposium, pp. 108–117, 2000.
V. Ramesh and S. Ram. Integrity Constraint Integration in Heterogeneous Databases: An Enhanced Methodology for Schema Integration. Information Systems, 22(8):423–446, 1997.
D. Rosaci, G. Terracina, and D. Ursino. A Semi-automatic Technique for Constructing a Global Representation of Information Sources Having Different Formats and Structure. In DEXA, pp. 734–743, 2001.
A. P. Sheth and J. A. Larson. Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases. ACM Computing Surveys, 22(3):183–236, 1990.
G. Stumme and A. Maedche. FCA-MERGE: Bottom-Up Merging of Ontologies. In IJCAI, pp. 225–234, 2001.
C. Türker. Semantic Integrity Constraints in Federated Database Schemata. DISDBIS 63, infix-Verlag, 1999. Ph.D. thesis, Fakultät für Informatik, Universität Magdeburg.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gupta, A., Ludäscher, B., Martone, M.E. (2002). Registering Scientific Information Sources for Semantic Mediation. In: Spaccapietra, S., March, S.T., Kambayashi, Y. (eds) Conceptual Modeling — ER 2002. ER 2002. Lecture Notes in Computer Science, vol 2503. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45816-6_23
Download citation
DOI: https://doi.org/10.1007/3-540-45816-6_23
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44277-6
Online ISBN: 978-3-540-45816-6
eBook Packages: Springer Book Archive