skip to main content
10.1145/276304.276321acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article
Free Access

Your mediators need data conversion!

Published:01 June 1998Publication History

ABSTRACT

Due to the development of the World Wide Web, the integration of heterogeneous data sources has become a major concern of the database community. Appropriate architectures and query languages have been proposed. Yet, the problem of data conversion which is essential for the development of mediators/wrappers architectures has remained largely unexplored.

In this paper, we present the YAT system for data conversion. This system provides tools for the specification and the implementation of data conversions among heterogeneous data sources. It relies on a middleware model, a declarative language, a customization mechanism and a graphical interface.

The model is based on named trees with ordered and labeled nodes. Like semistructured data models, it is simple enough to facilitate the representation of any data. Its main originality is that it allows to reason at various levels of representation. The YAT conversion language (called YATL) is declarative, rule-based and features enhanced pattern matching facilities and powerful restructuring primitives. It allows to preserve or reconstruct the order of collections. The customization mechanism relies on program instantiations: an existing program may be instantiated into a more specific one, and then easily modified. We also present the architecture, implementation and practical use of the YAT prototype, currently under evaluation within the OPAL* project.

References

  1. 1.W. Litwin, L. Mark, and N. Roussopoulos, "interoperability of multiple autonomous databases," ACM Computing Surveys, vol. 22, no. 3, pp. 267-293, Sept. 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. 2.M. J. Carey, L. M. Haas, P. M. Schwarz, M. Arya, W. F. Cody, R. Fagin, M. Flickner, A. W. Luniewski, W. Nit)lack, D. Petkovic, J. Thomas, j. H. Williams, and E. L. Wimmers, "Towards heterogeneous multimedia information systems: The garlic approach," in Research Issues Zn Data Engineering, Los Alamitos, California~ Mar 1995, pp. 124-131o Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. 3.Y. Papakonstantinou, H. Garcia-Molina, and 3. Widom, "Object exchange across heterogeneous information sources," in Proc. of IEEE Int. Conf. on Data Engineering (ICDE), Taipei, Taiwan, Mar. 1995, pp. 251 .. 260. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. 4.A. Tomasic, L. Raschid, and P. Valduriez, "Scaling heterogeneous databases and the design of disco," in Proceedings of the 16th International Conference on Dzstributed Computing Systems, Hong Kong, May 1996, pp. 449-457. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. 5.S. Adali, K. Candan, Y. Papakonstantinou, and Vo S. Subrahmanian, "Query caching and optimization in distributed mediator systems," in Proc. of the A CM SIGMOD Conf. on Management of Data, Montreal, Canada, June 1996, pp. 137-148. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. 6.P. Buneman, S. B. Davidson, G. Hillebrand, and D. Suciu, "A query language and optimization techniques for unstructured data," in Proc. of the A CM SIGMOD Conf. on Management o/Data, Montreal, Canada, June 1996, pp. 505-516. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. 7.S. Abiteboul, D. Quass, J. McHugh, J. Widom, and 3. L. Wiener, "The lorel query language for semistructured data," International Journal on Digital Libraries, vol. 1, no. 1, pp. 68-88, Apr. 1997.Google ScholarGoogle ScholarCross RefCross Ref
  8. 8.M. F. Fernandez, D. Florescu, A. Y. Levy, and D. Suciu, ":A query language for a web-site management system," SIGMOD Record, vol. 26, no. 3, pp. 4-11, Sept. 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. 9.R. Goldman and J. Widom, "Data guides: Enabling query formulation and optimization in semistructured databases," in Proc. International Conference on Very Large Data Bases, Athens, Greece, Aug. 1997, pp. 436- 445. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. 10.P. Buneman, S. B. Davidson, and D. Suciu, "Programming constructs for unstructured data," in Proc. Int. Workshop on Database Programming Languages, Gubbio, Italy, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. 11.S. Abiteboul, S. Cluet, and T. Milo, "Correspondence and translation for heterogeneous data," in Proc. Int. Conf. on Database Theory (ICDT), Delphi, Greece, Jan. 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. 12.M. Kifer, W. Kim, and Y. Sagiv, "Querying objectoriented databases," in Proc. of the A CM SIGMOD Conf. on Management o/Data, San Diego, California, June 1992, pp. 393-402. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. 13.Y. Papakonstantinou, S. Abiteboul, and H. Garcia- Molina, "Object fusion in mediator systems," in Proc. International Conference on Very Large Data Bases, Bombay, India, Sept. 1996, pp. 413-424. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. 14.X. Leroy, The Objective Carol system release 1.07, IN- RIA, Dec. 1997, Documentation and user's manual. f tp ://ftp. inri a. fr/lang/caml- light/.Google ScholarGoogle Scholar
  15. 15.U. Dayal and H. Hwang, "View definition and generalisation for database integration in multibase: A system for heterogeneous distributed databases," IEEE Transactions on Software Engineering, vol. 10, no. 6, pp. 628-644, Nov. 1984.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. 16.S. Abiteboul and R. Hull, "Restructuring hierarchical database objects," Theoretical Computer Science, vol. 62, pp. 3-38, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. 17.A. S. Kosky, Transforming Databases with Recursive Data Structures, P h.D. thesis, University of Pennsylvania, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. 18.Y. Papakonstantinou, H. Garcia-Molina, and J. Ullman, "Medmaker: A mediation system based on declarative specifications," in Proc. of IEEE Int. Conf. on Data Engineering (IUDE), New Orleans, Louisiana, Feb. 1996, pp. 132-141. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. 19.S. Abiteboul, "Querying semi-structured data," in Proc. Int. Con/. on Database Theory (IUDT), Delphi, Greece, Jan. 1997, pp. 1-18. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. 20.S. B. Davidson and A. S. Kosky, "Wol: A language for database transformations and constraints," in Proc. of IEEE Int. Conf. on Data Engineering (ICDE), Birmingham, UK, Apr. 1997, pp. 55-65. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. 21.S. Cluet and J. Simeon, "Data integration based on data conversion and restructuring," Technical report, Verso database group- INRIA, Oct. 1997, hZZp : //www-rocq. inria, fr/verso / Jerome. S ime on/YAT/.Google ScholarGoogle Scholar
  22. 22.L. Fegaras and D. Maier, "Towards an effective calculus for object query languages," in Proc. o/the ACM SIG- MOD Conf. on Management of Data, San Jose, California, May 1995, pp. 47-58. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. 23.L. Libkin, R. Machlin, and L. Wong, "A query language for multidimensional arrays: Design, implementation, and optimization techniques," in Proc~ of the A CM SIGMOD Conf. on Management of Data, Montreal, Canada, June 1996, pp. 229-239. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. 24.A. P. Marathe and K. Salem, "A language for manipulating arrays," in Proc. International Conference on Very Large Data Bases, Athens, Greece, Aug. 1997, pp. 46--55. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. 25.02 Technology, Versailles, The 02 Web Reference Manual version 4.6, Sept. 1996.Google ScholarGoogle Scholar
  26. 26.K. Smaga, "Interface graphique pour la traduction de donn~es," M.S. thesis, Universit~ Paris VI-CNAM- ENST~ Sept. 1997.Google ScholarGoogle Scholar

Index Terms

  1. Your mediators need data conversion!

                Recommendations

                Comments

                Login options

                Check if you have access through your login credentials or your institution to get full access on this article.

                Sign in
                • Published in

                  cover image ACM Conferences
                  SIGMOD '98: Proceedings of the 1998 ACM SIGMOD international conference on Management of data
                  June 1998
                  599 pages
                  ISBN:0897919955
                  DOI:10.1145/276304

                  Copyright © 1998 ACM

                  Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                  Publisher

                  Association for Computing Machinery

                  New York, NY, United States

                  Publication History

                  • Published: 1 June 1998

                  Permissions

                  Request permissions about this article.

                  Request Permissions

                  Check for updates

                  Qualifiers

                  • Article

                  Acceptance Rates

                  Overall Acceptance Rate785of4,003submissions,20%

                PDF Format

                View or Download as a PDF file.

                PDF

                eReader

                View online with eReader.

                eReader