skip to main content
10.1145/1379350.1379353acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdamapConference Proceedingsconference-collections
research-article

XML data integration in SixP2P: a theoretical framework

Authors Info & Claims
Published:25 March 2008Publication History

ABSTRACT

In the paper we discuss the problem of data integration in a P2P environment. In such setting each peer stores schema of its local data, mappings between the schema and schemas of some other peers (peer's partners), and schema constraints. The goal of the integration is to answer queries formulated against arbitrarily chosen peers. The answer consists of data stored in the queried peer as well as data of its direct and indirect partners. We focus on defining and using mappings, schema constraints, query propagation across the P2P system, and query reformulation in such scenario. A special attention is paid to discovering missing values using schema constraints and to reconcile inconsistent data using reliability levels assigning to the sources of data. The discussed approach has been implemented in SixP2P system (Semantic Integration of XML data in P2P environment).

References

  1. Abiteboul, S., Hull, R., Vianu, V.: Foundations of Databases, Addison-Wesley, Reading, Massachusetts, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Arenas, M.: Normalization theory for XML, SIGMOD Record, 35(4), 2006, 57--64. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Arenas, M., Libkin, L.: XML Data Exchange: Consistency and Query Answering, PODS Conference, 2005, 13--24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Buneman, P., Davidson, S. B., Fan, W., Hara, C. S., Tan, W. C.: Reasoning about keys for XML, Information Systems, 28(8), 2003, 1037--1063. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Calvanese, D., Giacomo, G. D., Lenzerini, M., Rosati, R.: Logical Foundations of Peer-To-Peer Data Integration., Proc. of the 23rd ACM SIGMOD Symposium on Principles of Database Systems (PODS 2004), 2004, 241--251. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Chiticariu, L., Hernandez, M. A., Kolaitis, P. G., Popa, L.: Semi-Automatic Schema Integration in Clio, VLDB, 2007, 1326--1329. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Dong, X. L., Halevy, A. Y., Yu, C.: Data Integration with Uncertainty, VLDB, ACM, 2007, 687--698. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Fagin, R., Kolaitis, P. G., Popa, L., Tan, W. C.: Composing Schema Mappings: Second-Order Dependencies to the Rescue, PODS, 2004, 83--94. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Fuxman, A., Kolaitis, P. G., Miller, R. J., Tan, W. C.: Peer data exchange, ACM Trans. Database Syst., 31(4), 2006, 1454--1498. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Haas, L. M., Hernandez, M. A., Ho, H., Popa, L., Roth, M.: Clio grows up: from research prototype to industrial tool, SIGMOD Conference, 2005, 805--810. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Halevy, A. Y., Ives, Z. G., Suciu, D., Tatarinov, I.: Schema mediation for large-scale semantic data sharing, VLDB J., 14(1), 2005, 68--83. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Lenzerini, M.: Data Integration: A Theoretical Perspective., PODS (L. Popa, Ed.), ACM, 2002, 233--246. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Madhavan, J., Halevy, A. Y.: Composing Mappings Among Data Sources., VLDB, 2003, 572--583. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Pankowski, T.: Management of executable schema mappings for XML data exchange, Database Technologies for Handling XML Information on the Web, EDBT 2006 Workshops, Lecture Notes in Computer Science 4254, 2006, 264--277. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Pankowski, T.: Reconciling inconsistent data in probabilistic XML data integration, British National Conference on Databases (BNCOD) 2008, Lecture Notes in Computer Science 5071, 2008, 75--86. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Pankowski, T., Cybulka, J., Meissner, A.: Reasoning About XML Schema Mappings in the Presence of Key Constraints and Value Dependencies, Web Reasoning and Rule Systems (RR 2007), Lecture Notes in Computer Science 4524, 2007, 374--376. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Pankowski, T., Cybulka, J., Meissner, A.: XML Schema Mappings in the Presence of Key Constraints and Value Dependencies, ICDT 2007 Workshop EROW '07, CEUR Workshop Proceedings Vol. 229, CEUR-WS.org, Vol. 229, 2007, 1--15.Google ScholarGoogle ScholarCross RefCross Ref
  18. Staworko, S., Chomicki, J.: Validity-Sensitive Querying of XML Databases, Database Technologies for Handling XML Information on the Web, EDBT 2006 Workshops, Lecture Notes in Computer Science 4254, 2006, 164--177. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Taylor, N. E., Ives, Z. G.: Reconciling while tolerating disagreement in collaborative data sharing, SIGMOD Conference, ACM, 2006, 13--24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. XML Path Language (XPath) 2.0: 2006. www.w3.org/TR/xpath20Google ScholarGoogle Scholar
  21. Xu, W., Özsoyoglu, Z. M.: Rewriting XPath Queries Using Materialized Views, Int. Conference on Very Large Data Bases, 2005, 2005, 121--132. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Yu, C., Popa, L.: Constraint-Based XML Query Rewriting For Data Integration., SIGMOD Conference, 2004, 371--382. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. XML data integration in SixP2P: a theoretical framework

                Recommendations

                Comments

                Login options

                Check if you have access through your login credentials or your institution to get full access on this article.

                Sign in
                • Published in

                  cover image ACM Other conferences
                  DaMaP '08: Proceedings of the 2008 international workshop on Data management in peer-to-peer systems
                  March 2008
                  85 pages
                  ISBN:9781595939678
                  DOI:10.1145/1379350

                  Copyright © 2008 ACM

                  Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                  Publisher

                  Association for Computing Machinery

                  New York, NY, United States

                  Publication History

                  • Published: 25 March 2008

                  Permissions

                  Request permissions about this article.

                  Request Permissions

                  Check for updates

                  Qualifiers

                  • research-article

                PDF Format

                View or Download as a PDF file.

                PDF

                eReader

                View online with eReader.

                eReader