ABSTRACT
Due to the development of the World Wide Web, the integration of heterogeneous data sources has become a major concern of the database community. Appropriate architectures and query languages have been proposed. Yet, the problem of data conversion which is essential for the development of mediators/wrappers architectures has remained largely unexplored.
In this paper, we present the YAT system for data conversion. This system provides tools for the specification and the implementation of data conversions among heterogeneous data sources. It relies on a middleware model, a declarative language, a customization mechanism and a graphical interface.
The model is based on named trees with ordered and labeled nodes. Like semistructured data models, it is simple enough to facilitate the representation of any data. Its main originality is that it allows to reason at various levels of representation. The YAT conversion language (called YATL) is declarative, rule-based and features enhanced pattern matching facilities and powerful restructuring primitives. It allows to preserve or reconstruct the order of collections. The customization mechanism relies on program instantiations: an existing program may be instantiated into a more specific one, and then easily modified. We also present the architecture, implementation and practical use of the YAT prototype, currently under evaluation within the OPAL* project.
- 1.W. Litwin, L. Mark, and N. Roussopoulos, "interoperability of multiple autonomous databases," ACM Computing Surveys, vol. 22, no. 3, pp. 267-293, Sept. 1990. Google ScholarDigital Library
- 2.M. J. Carey, L. M. Haas, P. M. Schwarz, M. Arya, W. F. Cody, R. Fagin, M. Flickner, A. W. Luniewski, W. Nit)lack, D. Petkovic, J. Thomas, j. H. Williams, and E. L. Wimmers, "Towards heterogeneous multimedia information systems: The garlic approach," in Research Issues Zn Data Engineering, Los Alamitos, California~ Mar 1995, pp. 124-131o Google ScholarDigital Library
- 3.Y. Papakonstantinou, H. Garcia-Molina, and 3. Widom, "Object exchange across heterogeneous information sources," in Proc. of IEEE Int. Conf. on Data Engineering (ICDE), Taipei, Taiwan, Mar. 1995, pp. 251 .. 260. Google ScholarDigital Library
- 4.A. Tomasic, L. Raschid, and P. Valduriez, "Scaling heterogeneous databases and the design of disco," in Proceedings of the 16th International Conference on Dzstributed Computing Systems, Hong Kong, May 1996, pp. 449-457. Google ScholarDigital Library
- 5.S. Adali, K. Candan, Y. Papakonstantinou, and Vo S. Subrahmanian, "Query caching and optimization in distributed mediator systems," in Proc. of the A CM SIGMOD Conf. on Management of Data, Montreal, Canada, June 1996, pp. 137-148. Google ScholarDigital Library
- 6.P. Buneman, S. B. Davidson, G. Hillebrand, and D. Suciu, "A query language and optimization techniques for unstructured data," in Proc. of the A CM SIGMOD Conf. on Management o/Data, Montreal, Canada, June 1996, pp. 505-516. Google ScholarDigital Library
- 7.S. Abiteboul, D. Quass, J. McHugh, J. Widom, and 3. L. Wiener, "The lorel query language for semistructured data," International Journal on Digital Libraries, vol. 1, no. 1, pp. 68-88, Apr. 1997.Google ScholarCross Ref
- 8.M. F. Fernandez, D. Florescu, A. Y. Levy, and D. Suciu, ":A query language for a web-site management system," SIGMOD Record, vol. 26, no. 3, pp. 4-11, Sept. 1997. Google ScholarDigital Library
- 9.R. Goldman and J. Widom, "Data guides: Enabling query formulation and optimization in semistructured databases," in Proc. International Conference on Very Large Data Bases, Athens, Greece, Aug. 1997, pp. 436- 445. Google ScholarDigital Library
- 10.P. Buneman, S. B. Davidson, and D. Suciu, "Programming constructs for unstructured data," in Proc. Int. Workshop on Database Programming Languages, Gubbio, Italy, 1995. Google ScholarDigital Library
- 11.S. Abiteboul, S. Cluet, and T. Milo, "Correspondence and translation for heterogeneous data," in Proc. Int. Conf. on Database Theory (ICDT), Delphi, Greece, Jan. 1997. Google ScholarDigital Library
- 12.M. Kifer, W. Kim, and Y. Sagiv, "Querying objectoriented databases," in Proc. of the A CM SIGMOD Conf. on Management o/Data, San Diego, California, June 1992, pp. 393-402. Google ScholarDigital Library
- 13.Y. Papakonstantinou, S. Abiteboul, and H. Garcia- Molina, "Object fusion in mediator systems," in Proc. International Conference on Very Large Data Bases, Bombay, India, Sept. 1996, pp. 413-424. Google ScholarDigital Library
- 14.X. Leroy, The Objective Carol system release 1.07, IN- RIA, Dec. 1997, Documentation and user's manual. f tp ://ftp. inri a. fr/lang/caml- light/.Google Scholar
- 15.U. Dayal and H. Hwang, "View definition and generalisation for database integration in multibase: A system for heterogeneous distributed databases," IEEE Transactions on Software Engineering, vol. 10, no. 6, pp. 628-644, Nov. 1984.Google ScholarDigital Library
- 16.S. Abiteboul and R. Hull, "Restructuring hierarchical database objects," Theoretical Computer Science, vol. 62, pp. 3-38, 1988. Google ScholarDigital Library
- 17.A. S. Kosky, Transforming Databases with Recursive Data Structures, P h.D. thesis, University of Pennsylvania, 1996. Google ScholarDigital Library
- 18.Y. Papakonstantinou, H. Garcia-Molina, and J. Ullman, "Medmaker: A mediation system based on declarative specifications," in Proc. of IEEE Int. Conf. on Data Engineering (IUDE), New Orleans, Louisiana, Feb. 1996, pp. 132-141. Google ScholarDigital Library
- 19.S. Abiteboul, "Querying semi-structured data," in Proc. Int. Con/. on Database Theory (IUDT), Delphi, Greece, Jan. 1997, pp. 1-18. Google ScholarDigital Library
- 20.S. B. Davidson and A. S. Kosky, "Wol: A language for database transformations and constraints," in Proc. of IEEE Int. Conf. on Data Engineering (ICDE), Birmingham, UK, Apr. 1997, pp. 55-65. Google ScholarDigital Library
- 21.S. Cluet and J. Simeon, "Data integration based on data conversion and restructuring," Technical report, Verso database group- INRIA, Oct. 1997, hZZp : //www-rocq. inria, fr/verso / Jerome. S ime on/YAT/.Google Scholar
- 22.L. Fegaras and D. Maier, "Towards an effective calculus for object query languages," in Proc. o/the ACM SIG- MOD Conf. on Management of Data, San Jose, California, May 1995, pp. 47-58. Google ScholarDigital Library
- 23.L. Libkin, R. Machlin, and L. Wong, "A query language for multidimensional arrays: Design, implementation, and optimization techniques," in Proc~ of the A CM SIGMOD Conf. on Management of Data, Montreal, Canada, June 1996, pp. 229-239. Google ScholarDigital Library
- 24.A. P. Marathe and K. Salem, "A language for manipulating arrays," in Proc. International Conference on Very Large Data Bases, Athens, Greece, Aug. 1997, pp. 46--55. Google ScholarDigital Library
- 25.02 Technology, Versailles, The 02 Web Reference Manual version 4.6, Sept. 1996.Google Scholar
- 26.K. Smaga, "Interface graphique pour la traduction de donn~es," M.S. thesis, Universit~ Paris VI-CNAM- ENST~ Sept. 1997.Google Scholar
Index Terms
- Your mediators need data conversion!
Recommendations
Your mediators need data conversion!
Due to the development of the World Wide Web, the integration of heterogeneous data sources has become a major concern of the database community. Appropriate architectures and query languages have been proposed. Yet, the problem of data conversion which ...
A general methodology for data conversion and restructuring
This paper presents a methodology and a model for data conversion or translation. The model assumes that both source and target systems are available and that conversion interfaces may be required to interact between these systems and the conversion ...
The Application of GML in Spatial Data Conversion
CSA '13: Proceedings of the 2013 International Conference on Computer Sciences and ApplicationsThe paper puts forward a method of spatial data conversion based on GML (Geography Markup Language) which has a uniform data model that can describe geographic feature. The multi-source and heterogeneous spatial data on the internet is encoded into a ...
Comments