ABSTRACT
In the paper we discuss the problem of data integration in a P2P environment. In such setting each peer stores schema of its local data, mappings between the schema and schemas of some other peers (peer's partners), and schema constraints. The goal of the integration is to answer queries formulated against arbitrarily chosen peers. The answer consists of data stored in the queried peer as well as data of its direct and indirect partners. We focus on defining and using mappings, schema constraints, query propagation across the P2P system, and query reformulation in such scenario. A special attention is paid to discovering missing values using schema constraints and to reconcile inconsistent data using reliability levels assigning to the sources of data. The discussed approach has been implemented in SixP2P system (Semantic Integration of XML data in P2P environment).
- Abiteboul, S., Hull, R., Vianu, V.: Foundations of Databases, Addison-Wesley, Reading, Massachusetts, 1995. Google ScholarDigital Library
- Arenas, M.: Normalization theory for XML, SIGMOD Record, 35(4), 2006, 57--64. Google ScholarDigital Library
- Arenas, M., Libkin, L.: XML Data Exchange: Consistency and Query Answering, PODS Conference, 2005, 13--24. Google ScholarDigital Library
- Buneman, P., Davidson, S. B., Fan, W., Hara, C. S., Tan, W. C.: Reasoning about keys for XML, Information Systems, 28(8), 2003, 1037--1063. Google ScholarDigital Library
- Calvanese, D., Giacomo, G. D., Lenzerini, M., Rosati, R.: Logical Foundations of Peer-To-Peer Data Integration., Proc. of the 23rd ACM SIGMOD Symposium on Principles of Database Systems (PODS 2004), 2004, 241--251. Google ScholarDigital Library
- Chiticariu, L., Hernandez, M. A., Kolaitis, P. G., Popa, L.: Semi-Automatic Schema Integration in Clio, VLDB, 2007, 1326--1329. Google ScholarDigital Library
- Dong, X. L., Halevy, A. Y., Yu, C.: Data Integration with Uncertainty, VLDB, ACM, 2007, 687--698. Google ScholarDigital Library
- Fagin, R., Kolaitis, P. G., Popa, L., Tan, W. C.: Composing Schema Mappings: Second-Order Dependencies to the Rescue, PODS, 2004, 83--94. Google ScholarDigital Library
- Fuxman, A., Kolaitis, P. G., Miller, R. J., Tan, W. C.: Peer data exchange, ACM Trans. Database Syst., 31(4), 2006, 1454--1498. Google ScholarDigital Library
- Haas, L. M., Hernandez, M. A., Ho, H., Popa, L., Roth, M.: Clio grows up: from research prototype to industrial tool, SIGMOD Conference, 2005, 805--810. Google ScholarDigital Library
- Halevy, A. Y., Ives, Z. G., Suciu, D., Tatarinov, I.: Schema mediation for large-scale semantic data sharing, VLDB J., 14(1), 2005, 68--83. Google ScholarDigital Library
- Lenzerini, M.: Data Integration: A Theoretical Perspective., PODS (L. Popa, Ed.), ACM, 2002, 233--246. Google ScholarDigital Library
- Madhavan, J., Halevy, A. Y.: Composing Mappings Among Data Sources., VLDB, 2003, 572--583. Google ScholarDigital Library
- Pankowski, T.: Management of executable schema mappings for XML data exchange, Database Technologies for Handling XML Information on the Web, EDBT 2006 Workshops, Lecture Notes in Computer Science 4254, 2006, 264--277. Google ScholarDigital Library
- Pankowski, T.: Reconciling inconsistent data in probabilistic XML data integration, British National Conference on Databases (BNCOD) 2008, Lecture Notes in Computer Science 5071, 2008, 75--86. Google ScholarDigital Library
- Pankowski, T., Cybulka, J., Meissner, A.: Reasoning About XML Schema Mappings in the Presence of Key Constraints and Value Dependencies, Web Reasoning and Rule Systems (RR 2007), Lecture Notes in Computer Science 4524, 2007, 374--376. Google ScholarDigital Library
- Pankowski, T., Cybulka, J., Meissner, A.: XML Schema Mappings in the Presence of Key Constraints and Value Dependencies, ICDT 2007 Workshop EROW '07, CEUR Workshop Proceedings Vol. 229, CEUR-WS.org, Vol. 229, 2007, 1--15.Google ScholarCross Ref
- Staworko, S., Chomicki, J.: Validity-Sensitive Querying of XML Databases, Database Technologies for Handling XML Information on the Web, EDBT 2006 Workshops, Lecture Notes in Computer Science 4254, 2006, 164--177. Google ScholarDigital Library
- Taylor, N. E., Ives, Z. G.: Reconciling while tolerating disagreement in collaborative data sharing, SIGMOD Conference, ACM, 2006, 13--24. Google ScholarDigital Library
- XML Path Language (XPath) 2.0: 2006. www.w3.org/TR/xpath20Google Scholar
- Xu, W., Özsoyoglu, Z. M.: Rewriting XPath Queries Using Materialized Views, Int. Conference on Very Large Data Bases, 2005, 2005, 121--132. Google ScholarDigital Library
- Yu, C., Popa, L.: Constraint-Based XML Query Rewriting For Data Integration., SIGMOD Conference, 2004, 371--382. Google ScholarDigital Library
Index Terms
- XML data integration in SixP2P: a theoretical framework
Recommendations
XML Processing and Data Integration with XQuery
Most Web applications exchange data as XML, but they create and process this data with languages that don't have native support for XML. With appropriate middleware, XQuery can dramatically simplify this process, treating all data sources as though they ...
Optimising XML---RDF data integration: a formal approach to improve XSPARQL efficiency
ESWC'12: Proceedings of the 9th international conference on The Semantic Web: research and applicationsThe Semantic Web provides a wealth of open data in RDF format. XML remains a widespread format for data exchange. When combining data of these two formats several problems arise due to representational incompatibilities. The query language XSPARQL, ...
XML schema integration to facilitate E-commerce
Web-enabled systems integrationXML has become the de facto standard for Information Exchange protocol for e-commerce and many work group applications such as Enterprise Resource Planning (ERP). The availability of large amounts of heterogeneous distributed web data necessitates the ...
Comments