Abstract
This paper presents an approach to query decomposition in a multidatabase environment. The unique aspect of this approach is that it is based on performing transformations over an object algebra that can be used as the basis for a global query language. In the paper, we first present our multidatabase environment and semantic framework, where a global conceptual schema based on the Object Data Management Group standard encompasses the information from heterogeneous data sources that include relational databases as well as object-oriented databases and flat file sources. The meta-data about the global schema is enhanced with information about virtual classes as well as virtual relationships and inheritance hierarchies that exist between multiple sources. The AQUA object algebra is used as the formal foundation for manipulation of the query expression over the multidatabase. AQUA is enhanced with distribution operators for dealing with data distribution issues. During query decomposition we perform an extensive analysis of traversals for path expressions that involve virtual relationships and hierarchies for access to several heterogeneous sources. The distribution operators defined in algebraic terms enhance the global algebra expression with semantic information about the structure, distribution, and localization of the data sources relevant to the solution of the query. By using an object algebra as the basis for query processing, we are able to define algebraic transformations and exploit rewriting techniques during the decomposition phase. Our use of an object algebra also provides a formal and uniform representation for dealing with an object-oriented approach to multidatabase query processing. As part of our query processing discussion, we include an overview of a global object identification approach for relating semantically equivalent objects from diverse data sources, illustrating how knowledge about global object identity is used in the decomposition and assembly processes.
Similar content being viewed by others
References
R. Ahmed, P. De Smedt, W. Du, W. Kent, M. Ketabchi, W. Litwin, A. Rafii, and M. Shan, “The pegasus heterogeneous multidatabase system,” IEEE Computer, pp. 19–27, 1991.
E. Bertino, “Integration of heterogeneous data repositories by using object-oriented views,” in Proceedings of the First International Workshop on Interoperability in Multidatabase Systems, Kyoto, Japan, IEEE Computer Society, April 1991, pp. 22–29.
E. Bertino, M. Negri, G. Pelagatti, and L. Sbattella, “Integration of heterogeneous database applications through an object-oriented interface,” Information Systems, vol. 14, no.5, pp. 407–420, 1989.
A. Bouguettaya, B. Benatallah, and A. Elmagarmid, “An overview of multidatabase systems: Past and present,” in [16], pp. 1–32.
M. Bright, A. Hurson, and S. Pakzad, “A taxonomy and current issues in multidatabase systems,” IEEE Computer, vol. 25, no.3, pp. 50–60, 1992.
O. Bukhres and A.K. Elmagarmid (Eds.), Object-Oriented Multidatabase Systems, A Solution for Advanced Applications, Prentice Hall: Englewood Cliffs, NJ, 1996.
D. Callen and S.D. Urban, “Consolidation of query results in a multidatabase environment: An object oriented approach,” in Proceedings of the IEEE Computer Society Computer Software and Applications Conference, Seoul, Korea, IEEE Computer Society Press, Aug. 1996, pp. 367–372.
M. Castellanos, F. Saltor, and M. Garcia-Solaco, “A canonical model for the interoperability among object-oriented and relational databases,” in [46], pp. 309–314.
R. Cattell and D.K. Barry (Eds.), The Object Database Standard ODMG-2.0, Morgan-Kaufmann: San Mateo, CA, 1997.
R. Cattell, D.K. Barry, M. Berler, J. Eastman, D. Jordan, C. Russell, O. Schadow, T. Torsten Stanienda, and F. Velez, The Object Database Standard ODMG-3.0, Morgan-Kaufmann: San Mateo, CA, 2000.
M. Cherniack, S.B. Zdonik, and M.H. Nodine, “To form a more perfect union (intersection, difference),” in Proceedings of the Database Programming Languages Conference, Springer-Verlag: Berlin, 1995, pp. 1–14.
B. Demuth, A. Geppert, and T. Gorchs, “Algebraic query optimization in the CoOMS structurally object-oriented database system,” in [17], pp. 121–142.
A. Dogac, C. Dengi, E. Kilic, G. Ozhan, S. Nural, C. Evrendilek, U. Halici, B. Arpinar, P. Koukal, N. Kesim, and S. Mancuhan, “METU interoperable database system,” Technical Report from Middle East Technical University, TR-6-1, June 1995.
F. Eliassen, “Managing identity in global object view,” in Proceedings of the Fifth International Workshop on Research Issues on Data Engineering: Distributed Object Management, IEEE Computer Society Press, 1995, pp. 70–77.
A. Elmagarmid, W. Du, and R. Ahmed, “Local autonomy and its effects on multidatabase systems,” in [16], pp. 33–52.
A. Elmagarmid, M. Rusinkiewicz, and A. Sheth (Eds.), Management of Heterogeneous and Autonomous Database Systems, Morgan Kaufmann: San Francisco, CA, 1999.
J.C. Freytang, D. Maier, and G. Vossen. (Eds.), Query Processing for Advanced Database Systems, Morgan Kaufmann: San Mateo, CA, 1994.
H. Garcia-Molina, J. Hammer, K. Ireland, Y. Papakonstantinou, J. Ullman, and J. Widom, “Integrating and accessing heterogeneous information systems in TSIMMIS,” in Proceedings of the AAAI Symposium on Information Gathering, Stanford, California, March 1995, pp. 61–64.
H. Garcia-Molina, Y. Papakonstantinou, D. Quass, A. Rajaraman, Y. Sagiv, J. Ullman, V. Vassalos, and J. Widom, “The TSIMMIS approach to mediation: Data models and languages,” Journal of Intelligent Information Systems, vol. 8, no.2, pp. 117–132, 1997.
H. Garcia-Molina, Y. Papakonstantinou, D. Quass, A. Rajamaran, Y. Sagiv, J. Ullman, and J. Widom, “The TSIMMIS approach to mediation: Data models and languages (extended abstract),” in Proceedings of the Workshop on Next Generation Information Technology and Systems, Naharia, Israel, Springer-Verlag: Berlin, June 1995.
M. Garcia-Solaco, F. Saltor, and M. Castellanos, “Semantic heterogeneity in multidatabase systems,” in [6], pp. 129–202.
G. Gardarin, B. Finance, and P. Fankhauser, “Federating object-oriented and relational databases: The IRO-DB experience,” in Proceedings of the Second International Conference on Cooperative Information Systems CoopIS 97, IEEE Computer Society Press, June 1997, pp. 2–13.
G. Gardarin, F. Sha, and Z-H. Tang, “Calibrating the query optimizer cost model of IRO-DB, an object-oriented federated database system,” in Proceedings of the 22nd VLDB Conference, Mumbai (Bombay) India, Morgan Kaufman, 1996, pp. 378–389.
M. Hartig and K. Dittrich, “An object-oriented integration framework for building heterogeneous database systems,” in Interoperable Database Systems (DS-5), D.K. Hsiao, E.J. Neuhold, and R. Sack-Davies (Eds.), Elsevier Science Publishers B.V. (North-Holland), 1993, pp. 33–53.
L.M. Hass, D. Kossmann, E.L. Wimmers, J. Yang, and L. Raschid, “Optimizing queries across diverse data sources,” in Proceedings of the 23rd VLDB Conference, Athens, Greece, ACM Press, 1997, pp. 267–285.
R. Hull, “Managing semantic heterogeneity in databases: A theoretical perspective,” in Tutorial at 16th ACM Symposium in Principles of Database Systems (SIGMOD/PODS 97), Tucson, Arizona, May 1997.
“InfoSleuth Project at MCC,” available at http://www.mccc.com/projects/infosleuth.
V. Kashyap and A. Sheth, “Semantic similarities between objects in multiple databases,” in [16], pp. 57–89.
A. Kemper and G. Moerkotte, “Query optimization in object Bases: Exploiting relational techniques,” in [17], pp. 63–98.
W. Kent, R. Ahmed, T. J. Alber, M. Ketabchi, and M. Shan, “Object identification in multidatabase systems,” in Interoperable Database Systems (DS-5), D.K. Hsiao, E.J. Neuhold, and R. Sack-Davies (Eds.), Elsevier Science Publishers B.V. (North-Holland), 1993, pp. 313–330.
W. Kim (Ed.), Modern Database Systems: The Object Model, Interoperability, and Beyond, ACM Press: New York, 1995.
W. Kim and W. Kelley, “On view support in object-oriented database systems,” in [31], pp. 108–129.
J. Lavariega, “Object-oriented query processing in a multidatabase environment: Integrating structural and operational approaches,” Ph.D. Dissertation, Department of Computer Science and Engineering, Arizona State University, December 1999.
J. Lavariega and S. Urban, “Donaji, a semantic architecture for multidatabase systems,” in Workshop on Intelligent Information Integration, 13th Biennial European Conference on Artificial Intelligence (ECAI-98), clBrighton, UK, TZ1-Report no. 10-1998, Universitat Bremen, Aug. 25, 1998, pp. 39–54.
J. Lavariega and S. Urban, “Global object identification in the Donají multidatabase environment” in Proceedings of the Business Information Systems Conference, Poznan, Poland, Springer-Verlag, April 11-13, 2000, pp. 147–171.
T.W. Leung, G. Mitchel, B. Subramanian, B. Vance, S.L. Vandenberg, and S.B. Zdonik, “The AQUA data model and algebra,” in Proceedings of the 4th International Workshop on Database Programming Languages (DBPL), Manhattan, New York City, Springer-Verlag, August 1993.
A.Y. Levy, D. Srivastava, and T. Kirk, “Data model and query evaluation in global information systems,” Journal of Intelligent Information Systems, vol. 5, no.2, pp. 121–143, 1995.
Q. Li and D. McLeod, “An object-oriented approach to federated databases,” in Proceedings of the First International Workshop on Interoperability in Multidatabase Systems, Kyoto, Japan, IEEE Computer Society, 1991, pp. 64–70.
L. Liu, C. Pu, and Y. Lee, “An adaptive approach to query mediation across heterogeneous information sources,” in Proceedings of the International Conference on Cooperative Information Systems, Brussels, Belgium, IEEE Computer Society Press, June 1996, pp. 144–156.
F. Manola and S. Heiler, “An approach to interoperable object models,” in [46], pp. 27–35.
U. Merz and R. King, “DIRECT: A query facility for multiple databases,” ACM Transactions on Information Systems, vol. 12, no.4, pp. 339–359, 1994.
J. Murphy and J. Grimson, “The Jupiter system: A prototype for multidatabase interoperability,” in Directions in Databases, 12th British National Conference on Databases, BNCOD12, Guildford, United Kingdom, LNCS 526, Springer Verlag, July 1994, pp. 205–220.
S. Navathe and A. Savasere, “A schema integration facility using object-oriented data model,” in [6], pp. 105–128.
M.H. Nodine and A. Unruh, “Facilitating open communication in agent systems: The InfoSleuth infrastructure,” Microelectronics and Computer Technology Corporation (MCC), 1997.
M.H. Nodine and S.B. Zdonik, “The impact of transaction management on object-oriented multidatabase views,” in [6], pp. 57–104.
M.T. Ozsu, U. Dayal, and P. Valduriez (Eds.), Distributed Object Management, Morgan Kaufmann: San Mateo, CA, 1994.
M.T. Ozsu and P. Valduriez, Principles of Distributed Database Systems, 2nd Edition, Prentice Hall, 1999.
Y. Papakonstantinou, S. Abiteboul, and H. Garcia-Molina, “Object fusion in mediator systems,” in Proceedings of the 22nd VLDB Conference, Mumbai (Bombay) India, ACM Press, 1996, pp. 413–424.
Y. Papakonstantinou, H. Garcia-Molina, and J. Widom, “Object exchange across heterogeneous information sources,” in Proceedings of the 11th International Conference on Data Engineering, Taipei, Taiwan, 1995, IEEE Computer Society Press, March 6-10, pp. 251–260.
E. Pitoura, O. Bukhres, and A. Elmagarmid, “Object orientation in multidatabase systems,” ACM Computing Surveys, vol. 27, no.2, pp. 141–195, 1995.
M.T. Roth and P. Schwarz, “Don't scrap it, wrap it! A wrapper architecture for legacy data sources,” in Proceedings of the 23rd VLDB Conference, Athens, Greece, ACM Press, 1997, pp. 266–275.
A.P. Sheth and J.A. Larson, “Federated database systems for managing distributed heterogeneous, and autonomous databases,” ACM Computing Surveys, vol. 22, no.3, pp. 183–236, 1990.
A.R. Simon, Strategic Database Technology: Management for the Year 2000, Morgan Kaufmann: San Francisco, CA, 1995.
Z. Tari, J. Stokes, and S. Spaccapietra, “Object normal forms and dependency constraints for object-oriented schemata,” ACM Transactions on Database Systems, vol. 22, no.4, pp. 513-569, 1997.
A. Tomasic, L. Raschid, and P. Valduriez, “Scaling access to heterogeneous data sources with DISCO,” IEEE Transactions on Knowledge and Data Engineering, vol. 10, no.5, pp. 808–823, 1998.
S.D. Urban and T. Ben Abdellatif, “Object-oriented query language access to relational databases: A semantic framework for query translation,” Journal of Systems Integration, vol. 5, no.2, pp. 123–256, 1995.
G. Wiederhold and M. Genesereth, “The basis for mediation,” in Proceedings of the Conference on Cooperative Information Systems CoopIS 95, Vienna, Austria, IEEE Computer Society Press, May 1995, pp. 138–155.
G. Wiederhold and M. Genesereth, “The conceptual basis for mediation services,” IEEE Expert, vol. 12, no.8, pp. 38–47, 1997.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Lavariega, J.C., Urban, S.D. An Object Algebra Approach to Multidatabase Query Decomposition in Donají . Distributed and Parallel Databases 12, 27–71 (2002). https://doi.org/10.1023/A:1015630231324
Issue Date:
DOI: https://doi.org/10.1023/A:1015630231324