Skip to main content
Log in

An Object Algebra Approach to Multidatabase Query Decomposition in Donají

  • Published:
Distributed and Parallel Databases Aims and scope Submit manuscript

Abstract

This paper presents an approach to query decomposition in a multidatabase environment. The unique aspect of this approach is that it is based on performing transformations over an object algebra that can be used as the basis for a global query language. In the paper, we first present our multidatabase environment and semantic framework, where a global conceptual schema based on the Object Data Management Group standard encompasses the information from heterogeneous data sources that include relational databases as well as object-oriented databases and flat file sources. The meta-data about the global schema is enhanced with information about virtual classes as well as virtual relationships and inheritance hierarchies that exist between multiple sources. The AQUA object algebra is used as the formal foundation for manipulation of the query expression over the multidatabase. AQUA is enhanced with distribution operators for dealing with data distribution issues. During query decomposition we perform an extensive analysis of traversals for path expressions that involve virtual relationships and hierarchies for access to several heterogeneous sources. The distribution operators defined in algebraic terms enhance the global algebra expression with semantic information about the structure, distribution, and localization of the data sources relevant to the solution of the query. By using an object algebra as the basis for query processing, we are able to define algebraic transformations and exploit rewriting techniques during the decomposition phase. Our use of an object algebra also provides a formal and uniform representation for dealing with an object-oriented approach to multidatabase query processing. As part of our query processing discussion, we include an overview of a global object identification approach for relating semantically equivalent objects from diverse data sources, illustrating how knowledge about global object identity is used in the decomposition and assembly processes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. R. Ahmed, P. De Smedt, W. Du, W. Kent, M. Ketabchi, W. Litwin, A. Rafii, and M. Shan, “The pegasus heterogeneous multidatabase system,” IEEE Computer, pp. 19–27, 1991.

  2. E. Bertino, “Integration of heterogeneous data repositories by using object-oriented views,” in Proceedings of the First International Workshop on Interoperability in Multidatabase Systems, Kyoto, Japan, IEEE Computer Society, April 1991, pp. 22–29.

    Google Scholar 

  3. E. Bertino, M. Negri, G. Pelagatti, and L. Sbattella, “Integration of heterogeneous database applications through an object-oriented interface,” Information Systems, vol. 14, no.5, pp. 407–420, 1989.

    Google Scholar 

  4. A. Bouguettaya, B. Benatallah, and A. Elmagarmid, “An overview of multidatabase systems: Past and present,” in [16], pp. 1–32.

  5. M. Bright, A. Hurson, and S. Pakzad, “A taxonomy and current issues in multidatabase systems,” IEEE Computer, vol. 25, no.3, pp. 50–60, 1992.

    Google Scholar 

  6. O. Bukhres and A.K. Elmagarmid (Eds.), Object-Oriented Multidatabase Systems, A Solution for Advanced Applications, Prentice Hall: Englewood Cliffs, NJ, 1996.

    Google Scholar 

  7. D. Callen and S.D. Urban, “Consolidation of query results in a multidatabase environment: An object oriented approach,” in Proceedings of the IEEE Computer Society Computer Software and Applications Conference, Seoul, Korea, IEEE Computer Society Press, Aug. 1996, pp. 367–372.

    Google Scholar 

  8. M. Castellanos, F. Saltor, and M. Garcia-Solaco, “A canonical model for the interoperability among object-oriented and relational databases,” in [46], pp. 309–314.

  9. R. Cattell and D.K. Barry (Eds.), The Object Database Standard ODMG-2.0, Morgan-Kaufmann: San Mateo, CA, 1997.

    Google Scholar 

  10. R. Cattell, D.K. Barry, M. Berler, J. Eastman, D. Jordan, C. Russell, O. Schadow, T. Torsten Stanienda, and F. Velez, The Object Database Standard ODMG-3.0, Morgan-Kaufmann: San Mateo, CA, 2000.

    Google Scholar 

  11. M. Cherniack, S.B. Zdonik, and M.H. Nodine, “To form a more perfect union (intersection, difference),” in Proceedings of the Database Programming Languages Conference, Springer-Verlag: Berlin, 1995, pp. 1–14.

    Google Scholar 

  12. B. Demuth, A. Geppert, and T. Gorchs, “Algebraic query optimization in the CoOMS structurally object-oriented database system,” in [17], pp. 121–142.

  13. A. Dogac, C. Dengi, E. Kilic, G. Ozhan, S. Nural, C. Evrendilek, U. Halici, B. Arpinar, P. Koukal, N. Kesim, and S. Mancuhan, “METU interoperable database system,” Technical Report from Middle East Technical University, TR-6-1, June 1995.

  14. F. Eliassen, “Managing identity in global object view,” in Proceedings of the Fifth International Workshop on Research Issues on Data Engineering: Distributed Object Management, IEEE Computer Society Press, 1995, pp. 70–77.

  15. A. Elmagarmid, W. Du, and R. Ahmed, “Local autonomy and its effects on multidatabase systems,” in [16], pp. 33–52.

  16. A. Elmagarmid, M. Rusinkiewicz, and A. Sheth (Eds.), Management of Heterogeneous and Autonomous Database Systems, Morgan Kaufmann: San Francisco, CA, 1999.

    Google Scholar 

  17. J.C. Freytang, D. Maier, and G. Vossen. (Eds.), Query Processing for Advanced Database Systems, Morgan Kaufmann: San Mateo, CA, 1994.

    Google Scholar 

  18. H. Garcia-Molina, J. Hammer, K. Ireland, Y. Papakonstantinou, J. Ullman, and J. Widom, “Integrating and accessing heterogeneous information systems in TSIMMIS,” in Proceedings of the AAAI Symposium on Information Gathering, Stanford, California, March 1995, pp. 61–64.

  19. H. Garcia-Molina, Y. Papakonstantinou, D. Quass, A. Rajaraman, Y. Sagiv, J. Ullman, V. Vassalos, and J. Widom, “The TSIMMIS approach to mediation: Data models and languages,” Journal of Intelligent Information Systems, vol. 8, no.2, pp. 117–132, 1997.

    Google Scholar 

  20. H. Garcia-Molina, Y. Papakonstantinou, D. Quass, A. Rajamaran, Y. Sagiv, J. Ullman, and J. Widom, “The TSIMMIS approach to mediation: Data models and languages (extended abstract),” in Proceedings of the Workshop on Next Generation Information Technology and Systems, Naharia, Israel, Springer-Verlag: Berlin, June 1995.

    Google Scholar 

  21. M. Garcia-Solaco, F. Saltor, and M. Castellanos, “Semantic heterogeneity in multidatabase systems,” in [6], pp. 129–202.

  22. G. Gardarin, B. Finance, and P. Fankhauser, “Federating object-oriented and relational databases: The IRO-DB experience,” in Proceedings of the Second International Conference on Cooperative Information Systems CoopIS 97, IEEE Computer Society Press, June 1997, pp. 2–13.

  23. G. Gardarin, F. Sha, and Z-H. Tang, “Calibrating the query optimizer cost model of IRO-DB, an object-oriented federated database system,” in Proceedings of the 22nd VLDB Conference, Mumbai (Bombay) India, Morgan Kaufman, 1996, pp. 378–389.

    Google Scholar 

  24. M. Hartig and K. Dittrich, “An object-oriented integration framework for building heterogeneous database systems,” in Interoperable Database Systems (DS-5), D.K. Hsiao, E.J. Neuhold, and R. Sack-Davies (Eds.), Elsevier Science Publishers B.V. (North-Holland), 1993, pp. 33–53.

    Google Scholar 

  25. L.M. Hass, D. Kossmann, E.L. Wimmers, J. Yang, and L. Raschid, “Optimizing queries across diverse data sources,” in Proceedings of the 23rd VLDB Conference, Athens, Greece, ACM Press, 1997, pp. 267–285.

    Google Scholar 

  26. R. Hull, “Managing semantic heterogeneity in databases: A theoretical perspective,” in Tutorial at 16th ACM Symposium in Principles of Database Systems (SIGMOD/PODS 97), Tucson, Arizona, May 1997.

  27. “InfoSleuth Project at MCC,” available at http://www.mccc.com/projects/infosleuth.

  28. V. Kashyap and A. Sheth, “Semantic similarities between objects in multiple databases,” in [16], pp. 57–89.

  29. A. Kemper and G. Moerkotte, “Query optimization in object Bases: Exploiting relational techniques,” in [17], pp. 63–98.

  30. W. Kent, R. Ahmed, T. J. Alber, M. Ketabchi, and M. Shan, “Object identification in multidatabase systems,” in Interoperable Database Systems (DS-5), D.K. Hsiao, E.J. Neuhold, and R. Sack-Davies (Eds.), Elsevier Science Publishers B.V. (North-Holland), 1993, pp. 313–330.

    Google Scholar 

  31. W. Kim (Ed.), Modern Database Systems: The Object Model, Interoperability, and Beyond, ACM Press: New York, 1995.

    Google Scholar 

  32. W. Kim and W. Kelley, “On view support in object-oriented database systems,” in [31], pp. 108–129.

  33. J. Lavariega, “Object-oriented query processing in a multidatabase environment: Integrating structural and operational approaches,” Ph.D. Dissertation, Department of Computer Science and Engineering, Arizona State University, December 1999.

  34. J. Lavariega and S. Urban, “Donaji, a semantic architecture for multidatabase systems,” in Workshop on Intelligent Information Integration, 13th Biennial European Conference on Artificial Intelligence (ECAI-98), clBrighton, UK, TZ1-Report no. 10-1998, Universitat Bremen, Aug. 25, 1998, pp. 39–54.

  35. J. Lavariega and S. Urban, “Global object identification in the Donají multidatabase environment” in Proceedings of the Business Information Systems Conference, Poznan, Poland, Springer-Verlag, April 11-13, 2000, pp. 147–171.

    Google Scholar 

  36. T.W. Leung, G. Mitchel, B. Subramanian, B. Vance, S.L. Vandenberg, and S.B. Zdonik, “The AQUA data model and algebra,” in Proceedings of the 4th International Workshop on Database Programming Languages (DBPL), Manhattan, New York City, Springer-Verlag, August 1993.

    Google Scholar 

  37. A.Y. Levy, D. Srivastava, and T. Kirk, “Data model and query evaluation in global information systems,” Journal of Intelligent Information Systems, vol. 5, no.2, pp. 121–143, 1995.

    Google Scholar 

  38. Q. Li and D. McLeod, “An object-oriented approach to federated databases,” in Proceedings of the First International Workshop on Interoperability in Multidatabase Systems, Kyoto, Japan, IEEE Computer Society, 1991, pp. 64–70.

    Google Scholar 

  39. L. Liu, C. Pu, and Y. Lee, “An adaptive approach to query mediation across heterogeneous information sources,” in Proceedings of the International Conference on Cooperative Information Systems, Brussels, Belgium, IEEE Computer Society Press, June 1996, pp. 144–156.

    Google Scholar 

  40. F. Manola and S. Heiler, “An approach to interoperable object models,” in [46], pp. 27–35.

  41. U. Merz and R. King, “DIRECT: A query facility for multiple databases,” ACM Transactions on Information Systems, vol. 12, no.4, pp. 339–359, 1994.

    Google Scholar 

  42. J. Murphy and J. Grimson, “The Jupiter system: A prototype for multidatabase interoperability,” in Directions in Databases, 12th British National Conference on Databases, BNCOD12, Guildford, United Kingdom, LNCS 526, Springer Verlag, July 1994, pp. 205–220.

  43. S. Navathe and A. Savasere, “A schema integration facility using object-oriented data model,” in [6], pp. 105–128.

  44. M.H. Nodine and A. Unruh, “Facilitating open communication in agent systems: The InfoSleuth infrastructure,” Microelectronics and Computer Technology Corporation (MCC), 1997.

  45. M.H. Nodine and S.B. Zdonik, “The impact of transaction management on object-oriented multidatabase views,” in [6], pp. 57–104.

  46. M.T. Ozsu, U. Dayal, and P. Valduriez (Eds.), Distributed Object Management, Morgan Kaufmann: San Mateo, CA, 1994.

    Google Scholar 

  47. M.T. Ozsu and P. Valduriez, Principles of Distributed Database Systems, 2nd Edition, Prentice Hall, 1999.

  48. Y. Papakonstantinou, S. Abiteboul, and H. Garcia-Molina, “Object fusion in mediator systems,” in Proceedings of the 22nd VLDB Conference, Mumbai (Bombay) India, ACM Press, 1996, pp. 413–424.

    Google Scholar 

  49. Y. Papakonstantinou, H. Garcia-Molina, and J. Widom, “Object exchange across heterogeneous information sources,” in Proceedings of the 11th International Conference on Data Engineering, Taipei, Taiwan, 1995, IEEE Computer Society Press, March 6-10, pp. 251–260.

  50. E. Pitoura, O. Bukhres, and A. Elmagarmid, “Object orientation in multidatabase systems,” ACM Computing Surveys, vol. 27, no.2, pp. 141–195, 1995.

    Google Scholar 

  51. M.T. Roth and P. Schwarz, “Don't scrap it, wrap it! A wrapper architecture for legacy data sources,” in Proceedings of the 23rd VLDB Conference, Athens, Greece, ACM Press, 1997, pp. 266–275.

    Google Scholar 

  52. A.P. Sheth and J.A. Larson, “Federated database systems for managing distributed heterogeneous, and autonomous databases,” ACM Computing Surveys, vol. 22, no.3, pp. 183–236, 1990.

    Google Scholar 

  53. A.R. Simon, Strategic Database Technology: Management for the Year 2000, Morgan Kaufmann: San Francisco, CA, 1995.

    Google Scholar 

  54. Z. Tari, J. Stokes, and S. Spaccapietra, “Object normal forms and dependency constraints for object-oriented schemata,” ACM Transactions on Database Systems, vol. 22, no.4, pp. 513-569, 1997.

    Google Scholar 

  55. A. Tomasic, L. Raschid, and P. Valduriez, “Scaling access to heterogeneous data sources with DISCO,” IEEE Transactions on Knowledge and Data Engineering, vol. 10, no.5, pp. 808–823, 1998.

    Google Scholar 

  56. S.D. Urban and T. Ben Abdellatif, “Object-oriented query language access to relational databases: A semantic framework for query translation,” Journal of Systems Integration, vol. 5, no.2, pp. 123–256, 1995.

    Google Scholar 

  57. G. Wiederhold and M. Genesereth, “The basis for mediation,” in Proceedings of the Conference on Cooperative Information Systems CoopIS 95, Vienna, Austria, IEEE Computer Society Press, May 1995, pp. 138–155.

    Google Scholar 

  58. G. Wiederhold and M. Genesereth, “The conceptual basis for mediation services,” IEEE Expert, vol. 12, no.8, pp. 38–47, 1997.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lavariega, J.C., Urban, S.D. An Object Algebra Approach to Multidatabase Query Decomposition in Donají . Distributed and Parallel Databases 12, 27–71 (2002). https://doi.org/10.1023/A:1015630231324

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1015630231324

Navigation