Abstract
In a multidatabase system that consists of object databases, the same real-world entity can be stored as objects in different databases with incompatible object identifiers. How to identify and integrate these objects representing the same entities such that (a) object duplication in the query result can be avoided, (b) information for the entity can be gathered, and (c) the specialization of multiple classes can be built is an important issue to provide a well structured global object schema and a more informative query result. In this paper, we extend our results on probabilistic query processing and joining relations on incompatible keys to solve the problem. Various data and schema conflicts such as missing data, inconsistent data and domain mismatch which may exist in classes from different databases are considered in the process of identification.
Similar content being viewed by others
References
J. Banerjee, W. Kim, and K.C. Kim. “Queries in Object-Oriented Databases,” in Proc. IEEE International Conference on Data Engineering, 1988, pp. 31–38.
C.Batini, M.Lenzerini, and S.B.Navathe, “A Comparative Analysis of Methodologies for Database Schema Integration,” ACM Computing Surveys, vol. 18, pp. 323–364, 1986.
E. Bertino, M. Damiani, P. Randi, and L. Spampinato, “An Advanced Information Management System,” in Proc. IEEE International Workshop on Research Issues on Data Engineering: Interoperability in Multidatabase Systems, 1993, pp. 61–64.
E.Bertino, M.Negri, G.Pelagatti, and L.Spampinato, “Applications of Object-Oriented Technology to the Integration of Heterogeneous Database Systems,” Distributed and Parallel Databases: An International Journal, vol 2, pp. 343–370, 1994.
Y. Breitbart, P.L. Olson, and G.R. Thompson, “Database Integration in a Distributed Heterogeneous Database System,” in Proc. IEEE International Conference on Data Engineering, 1986, pp. 301–310.
Y.Breitbart, “Multidatabase Interoperability,” SIGMOD RECORD, vol. 19, pp. 53–60, 1990.
A.Chatterjee and A.Segev, “Data Manipulation in Heterogeneous Databases,” SIGMOD RECORD, vol. 20, pp. 64–68, 1991.
A.L.P. Chen, J.L. Koh, T. Kuo, and C.C. Liu, “Schema Integration and Query Processing for Multiple Object Databases,” Journal of Integrated Computer-Aided Engineering: Special Issue on Multidatabase and Interoperable Systems, Wiley Interscience (to appear).
A.L.P. Chen, “A Localized Approach to Distributed Query Processing,” in Proc. International Conference on Extending Data Base Technology (EDBT), 1990.
E.F.Codd, “Missing Information (Applicable and Inapplicable) in Relational Databases,” SIGMOD RECORD, vol. 15, pp. 53–78, 1986.
B. Czejdo, M. Rusinkiewicz, and D.W. Embley, “An Approach to Schema Integration and Query Formulation in Federated Database Systems,” in Proc. IEEE International Conference on Data Engineering, 1987, pp. 477–484.
C.J. Date, “The Outer Join,” in Proc. 2nd International Conference on Databases, 1983.
U.Dayal and H.Y.Hwang, “View Definition and Generalization for Database Integration in a Multidatabase System,” IEEE Transactions on Software Engineering, vol. 10, pp. 628–644, 1984.
S.M.Deen, R.R.Amin, and M.C.Taylor, “Data Integration in Distributed Databases,” IEEE Transactions on Software Engineering, vol. 13, pp. 860–864, 1987.
L.G.DeMichiel, “Resolving Database Incompatibility: An Approach to Performing Relational Operations over Mismatched Domains,” IEEE Transactions on Knowledge and Data Engineering, vol. 1, pp. 485–493, 1989.
P. Drew, R. King, D. McLeod, M. Rusinkiewicz, and A. Silberschatz, “Report of the Workshop on Semantic Heterogeneity and Interoperation in Multidatabase Systems,” SIGMOD Record, pp. 47–56, 1993.
W.Gotthard, P.C.Lockemann, and A.Neufeld, “System-Guided View Integration for Object-Oriented Databases,” IEEE Transactions on Knowledge and Data Engineering, vol. 4, pp. 1–22, 1992.
S. Hayne and S. Ram, “Multi-User View Integration System (MUVIS): An Expert System for View Integration,” in Proc. IEEE International Conference on Data Engineering, 1990, pp. 402–409.
V. Kashyap and A. Sheth, “Semantics-based Information Brokering,” in Proc. International Conference on Information and Knowledge Management (CIKM), 1994.
V. Kashyap and A. Sheth, “Semantic and Schematic Similarities between Objects in Databases: A Contextbased Approach,” Technical Report TR-CS-95-001, LSDIS Lab., Dept. of CS, Univ. of GA, 1995.
M. Kaul, K. Drosten, and E.J. Neuhold, “View System: Integrating Heterogeneous Information Bases by Object-Oriented Views,” in Proc. IEEE International Conference on Data Engineering, 1992, pp. 2–10.
W. Kent, R. Ahmed, J. Albert, A. Ketabchi, and M.C. Shan, “Object Identification in Multidatabase Systems,” in Proc. the IFIP TC2/WG2.6 Conference on Semantics of Interoperable Database Systems, DS-5, 1992.
W. Kim, “A Model of Queries for Object-Oriented Databases,” in Proc. International Conference on Very Large Data Bases, 1989, pp. 423–432.
J.L. Koh and A.L.P. Chen, “Integration of Heterogeneous Object Schemas,” in Proc. International Conference on Entity-Relationship Approach, 1993, pp. 289–300.
J.L. Koh and A.L.P. Chen, “A Mapping Strategy for Querying Multiple Object Databases with a Global Object Schema,” in Proc. IEEE International Workshop on Research Issues on Data Engineering: Interoperability in Multidatabase Systems, 1995.
J.L. Koh and A.L.P. Chen, “Query Execution Strategies for Missing Data in Integrated Multiple Object Databases,” submitted for publication, 1995.
W.S. Li and C. Clifton, “Semantic Integration in Heterogeneous Databases Using Neural Networks,” in Proc. International Conference on Very Large Data Bases, 1994, pp. 1–12.
E.P. Lim, J. Srivastava, S. Prabhakar, and J. Richardson, “Entity Identification in Database Integration,” in Proc. IEEE International Conference on Data Engineering, 1993, pp. 294–301.
W.Litwin, A.Abdellatif, A.Zeroual, and B.Nicolas, “MSQL: A Multidatabase Language,” Information Science, vol. 48, pp. 59–101, 1989.
W.Litwin, L.Mark, and N.Roussopoulos, “Interoperability of Multiple Autonomous Databases,” ACM Computing Surveys, vol. 22, pp. 267–293, 1990.
C.C. Liu and A.L.P. Chen, “Object View Derivation and Object Query Transformation,” in Proc. Eighteenth Annual International Computer Software & Applications Conference (COMPSAC), 1994, pp. 157–162.
J.M.Morrissey, “Imprecise Information and Uncertainty in Information Systems,” ACM Transactions on Information Systems, vol. 8, pp. 159–180, 1990.
A.Motro, “Superviews: Virtual Integration of Multiple Databases,” IEEE Transactions on Software Engineering, vol. 13, pp. 785–798, 1987.
A. Motro, “Sources of Uncertainty in Information Systems,” in Proc. Workshop on Uncertainty Management in Information Systems, 1992, pp. 1–18.
C. Pu, “Key Equivalence in Heterogeneous Databases,” in Proc. International Workshop on Interoperability in Multidatabase Systems, 1991, pp. 314–316.
E.A. Rundensteiner, “MiltiView: A Methodology for Supporting Multiple Views in Object-Oriented Databases,” in Proc. International Conference on Very Large Data Bases, 1992, pp. 187–198.
A. Sheth, J. Larson, A. Cornelio, and S. Navathe, “A Tool for Integrating conceptual Schemas and User Views,” in Proc. IEEE International Conference on Data Engineering, 1988, pp. 176–183.
A. Sheth and V. Kashyap, “So Far (Semantically) yet So Near (Semantically),” in Proc. the IFIP TC2/WG2.6 Conference on Semantics of Interoperable Database Systems, DS-5, 1992.
S. Spaccapietra, C. Parent, and Y. Dupont, “Model Independent Assertions for Integration of Heterogeneous Schemas,” The VLDB Journal, pp. 81–126, 1992.
P.S.M. Tsai and A.L.P. Chen, “Querying Uncertain Data in Heterogeneous Databases,” in Proc. IEEE International Workshop on Research Issues on Data Engineering: Interoperability in Multidatabase Systems, 1993, pp. 161–168.
F.S.C. Tseng, A.L.P. Chen, and W.P. Yang, “Answering Heterogeneous Database Queries with Degrees of Uncertainty,” Distributed and Parallel Databases: An International Journal, pp. 281–302, 1993.
Y.R. Wang and S.E. Madnick, “The Inter-Database Instance Identification Problem in Integrating Autonomous Systems,” in Proc. IEEE International Conference on Data Engineering, 1989, pp. 46–55.
Author information
Authors and Affiliations
Additional information
Recommended by: Amit Sheth
Rights and permissions
About this article
Cite this article
Chen, A.L.P., Tsai, P.S.M. & Koh, JL. Identifying object isomerism in multidatabase systems. Distrib Parallel Databases 4, 143–168 (1996). https://doi.org/10.1007/BF00204905
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/BF00204905