Abstract
This paper addresses the problem of data integration in a P2P environment, where each peer stores schema of its local data, mappings between the schemas, and some schema constraints. The goal of the integration is to answer queries formulated against a chosen peer. The answer consists of data stored in the queried peer as well as data of its direct and indirect partners. We focus on defining and using mappings, schema constraints, query propagation across the P2P system, and query reformulation in such scenario. The main focus is the exploitation of constraints for merging results from different peers to derive more complex information, and utilizing constraint knowledge to query propagation and the merging strategy. We show how the discussed method has been implemented in SixP2P system.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Abiteboul, S., Benjelloun, O., Manolescu, I., Milo, T., Weber, R.: Active XML: Peer-to-Peer Data and Web Services Integration. In: VLDB, pp. 1087–1090. Morgan Kaufmann, San Francisco (2002)
Abiteboul, S., Hull, R., Vianu, V.: Foundations of Databases. Addison-Wesley, Reading (1995)
Arenas, M.: Normalization theory for XML. SIGMOD Record 35(4), 57–64 (2006)
Arenas, M., Libkin, L.: XML Data Exchange: Consistency and Query Answering. In: PODS Conference, pp. 13–24 (2005)
Brzykcy, G., Bartoszek, J., Pankowski, T.: Schema Mappings and Agents’ Actions in P2P Data Integration System. Journal of Universal Computer Science 14(7), 1048–1060 (2008)
Fagin, R., Kolaitis, P.G., Popa, L., Tan, W.C.: Composing Schema Mappings: Second-Order Dependencies to the Rescue. In: PODS, pp. 83–94 (2004)
Fuxman, A., Kolaitis, P.G., Miller, R.J., Tan, W.C.: Peer data exchange. ACM Trans. Database Syst. 31(4), 1454–1498 (2006)
Koloniari, G., Pitoura, E.: Peer-to-peer management of XML data: issues and research challenges. SIGMOD Record 34(2), 6–17 (2005)
Madhavan, J., Halevy, A.Y.: Composing Mappings Among Data Sources. In: VLDB, pp. 572–583 (2003)
Melnik, S., Bernstein, P.A., Halevy, A.Y., Rahm, E.: Supporting Executable Mappings in Model Management. In: SIGMOD Conference, pp. 167–178 (2005)
Milo, T., Abiteboul, S., Amann, B., Benjelloun, O., Ngoc, F.D.: Exchanging intensional XML data. ACM Trans. Database Syst. 30(1), 1–40 (2005)
Ooi, B.C., Shu, Y., Tan, K.-L.: Relational Data Sharing in Peer-based Data Management Systems. SIGMOD Record 32(3), 59–64 (2003)
Pankowski, T.: Reconciling inconsistent data in probabilistic XML data integration. In: Gray, A., Jeffery, K., Shao, J. (eds.) BNCOD 2008. LNCS, vol. 5071, pp. 75–86. Springer, Heidelberg (2008)
Pankowski, T.: XML data integration in SixP2P – a theoretical framework. In: EDBT Workshop Data Management in P2P Systems (DAMAP 2008), pp. 1–8. ACM Digital Library (2008)
Pankowski, T., Cybulka, J., Meissner, A.: XML Schema Mappings in the Presence of Key Constraints and Value Dependencies. In: ICDT 2007 Workshop EROW 2007, CEUR Workshop Proceedings. CEUR-WS.org, vol. 229, pp. 1–15 (2007)
Tatarinov, I., Halevy, A.Y.: Efficient Query Reformulation in Peer-Data Management Systems. In: SIGMOD Conference, pp. 539–550 (2004)
Tatarinov, I., Ives, Z.G., Madhavan, J., Halevy, A.Y., Suciu, D., Dalvi, N.N., Dong, X., Kadiyska, Y., Miklau, G., Mork, P.: The Piazza peer data management project. SIGMOD Record 32(3), 47–52 (2003)
XML Path Language (XPath) 2.0 (2006), http://www.w3.org/TR/xpath20
Yu, C., Popa, L.: Constraint-Based XML Query Rewriting For Data Integration. In: SIGMOD Conference, pp. 371–382 (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pankowski, T. (2008). Query Propagation in a P2P Data Integration System in the Presence of Schema Constraints. In: Hameurlain, A. (eds) Data Management in Grid and Peer-to-Peer Systems. Globe 2008. Lecture Notes in Computer Science, vol 5187. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85176-9_5
Download citation
DOI: https://doi.org/10.1007/978-3-540-85176-9_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85175-2
Online ISBN: 978-3-540-85176-9
eBook Packages: Computer ScienceComputer Science (R0)