Abstract
Peer-to-peer (P2P) networks have become an important infrastructure during the last years. Using P2P networks for distributed information systems allows us to shift the focus from centrally organized to distributed information systems where all peers can provide and have access to information.
In previous papers, we have described an RDF-based P2P infrastructure called Edutella which is a specific example of a more advanced approach to P2P networks called schema-based peer-to-peer networks. Schema-based P2P networks have a number of advantages compared with simpler P2P networks such as Napster or Gnutella. Instead of prescribing one global schema to describe content, they support arbitrary metadata schemas and ontologies (crucial for the Semantic Web). Thereby they allow complex and extendable descriptions of resources thus introducing dynamic behavior to the former fixed and limited descriptions, and can provide complex query facilities against these metadata instead of simple keyword-based searches.
In this paper we will elaborate topologies, indices and query routing strategies for efficient query distribution in such networks. Our work is based on the concept of super-peer networks which provide better scalability compared to traditional P2P networks. By adapting existing concepts of mediator-based information systems to super-peer based networks, as we will showin this paper, they are able to support sophisticated routing, clustering and mediation strategies based on the metadata schemas and attributes. The resulting routing indices can be built using local clustering policies and support local mediation and transformation rules between heterogeneous schemas, and we sketch some first ideas for implementing these advanced functionalities as well.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
K. Aberer, P. Cudré-Mauroux, and M. Hauswirth. The chatty web: Emergent semantics through gossiping. In Proceedings of the Twelfth International World Wide Web Conference (WWW2003), Budapest, Hungary, May 2003.
K. Aberer and M. Hauswirth. Semantic gossiping. In Database and Information Systems Research for Semantic Web and Enterprises, Invitational Workshop, University of Georgia, Amicalola Falls and State Park, Georgia, April 2002.
A. Sheth and J. Larson. Federated database systems for managing distributed, heterogeneous, and autonomous databases. ACM Computing Surveys, 22(3):183–236, 1990.
D. Beckett, E. Miller, and D. Brickley. Expressing simple dublin core in RDF/XML. Technical report, Dublin Core Metadata Initiative, 2002. http://dublincore.org/documents/2002/07/31/dcmes-xml/.
P. A. Bernstein, F. Giunchiglia, A. Kementsietsidis, J. Mylopoulos, L. Serafini, and I. Zaihrayeu. Data management for peer-to-peer computing: A vision. In Proceedings of the Fifth International Workshop on the Web and Databases, Madison,Wisconsin, June 2002.
S. Busse. Model Correspondences in Continuous Engineering of MBIS-doctorial thesis. Logos Verlag, September 2002.
A. Crespo and H. Garcia-Molina. Routing indices for peer-to-peer systems. In Proceedings International Conference on Distributed Computing Systems, July 2002.
U. Dayal, E. N. Hanson, and J. Widom. Active database systems. In Modern Database Systems, pages 434–456. ACM SIGMOD International Conference on Management of Data, 1995.
H. Garcia-Molina and B. Yang. Efficient search in peer-to-peer networks. In Proceedings of ICDCS, 2002.
Gnutella.
IEEE P1484.12 Learning Object MetadataWorking Group. Draft standard for learning object metadata. Technical report, IEEE Learning Technology Standards Committee (LTSC), 2002. http://ltsc.ieee.org/doc/wg12/LOM_1484_12_1_v1_Final_Draft.pdf.
A. Kementsietsidis, M. Arenas, and R. J. Miller. Mapping data in peer-to-peer systems: Semantics and algorithmic issues. In Proceedings of the ACM SIGMOD International Conference on Management of Data), June 2003.
U. Leser. Query Planning in Mediator Based Information Systems — doctorial thesis. TU Berlin, June 2000.
F. Naumann. Quality-Driven Query Answering for Integrated Information Systems-doctorial thesis. Springer Verlag, lecture notes in computer science, 2261 edition, July 2002.
W. Nejdl, B. Wolf, C. Qu, S. Decker, M. Sintek, A. Naeve, M. Nilsson, M. Palmér, and T. Risch. EDUTELLA: a P2P Networking Infrastructure based on RDF. In WWW 11 Conference Proceedings, Hawaii, USA, May 2002.
W. Nejdl, M. Wolpers, W. Siberski, A. Löser, I. Bruckhorst, M. Schlosser, and C. Schmitz. Super-Peer-Based Routing and Clustering Strategies for RDF-Based Peer-To-Peer Networks. In Proceedings of the Twelfth International World Wide Web Conference (WWW2003), Budapest, Hungary, May 2003.
Neo-Modus. Direct Connect Homepage. http://www.neo-modus.com/.
M. Oezsu and P. Valduriez. Principles of distributed database systems. Prentice Hall, 2nd edition edition, 1999.
S. Ratnasamy, P. Francis, M. Handley, R. Karp, and S. Shenker. Ascalable content addressable network. In Proceedings of the 2001 Conference on applications, technologies, architectures, and protocols for computer communications. ACM Press NewYork, NY, USA, 2001.
M. Schlosser, M. Sintek, S. Decker, and W. Nejdl. HyperCuP—Hypercubes, Ontologies and Efficient Search on P2P Networks. In International Workshop on Agents and Peer-to-Peer Computing, Bologna, Italy, July 2002.
I. Stoica, R. Morris, D. Karger, M. F. Kaashoek, and H. Balakrishnan. Chord: A scalable peer-to-peer lookup service for internet applications. In Proceedings of the 2001 Conference on applications, technologies, architectures, and protocols for computer communications. ACM Press NewYork, NY, USA, 2001.
D. M. Strong, Y. W. Lee, and R. Y. Wang. Data quality in context. Communications of the ACM, 40(5):103–110, 1997.
J. Widom and U. Dayal. A Guide To Active Databases. Morgan-Kaufmann, 1993.
G. Wiederhold. Mediators in the architecture of future information systems. IEEE Computer, 25(3): 38–49, 1992.
B. Yang and H. Garcia-Molina. Designing a super-peer network. http://dbpubs.stanford.edu:8090/pub/2002-13, 2002.
B.Y. Zhao, J. D. Kubiatowicz, and A. D. Joseph. Tapestry:An infrastructure for fault-tolerant wide-area location and routing. Technical Report UCB/CSD-01-1141, UC Berkeley, EECS, 2001.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Löser, A., Siberski, W., Wolpers, M., Nejdl, W. (2003). Information Integration in Schema-Based Peer-To-Peer Networks. In: Eder, J., Missikoff, M. (eds) Advanced Information Systems Engineering. CAiSE 2003. Lecture Notes in Computer Science, vol 2681. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45017-3_19
Download citation
DOI: https://doi.org/10.1007/3-540-45017-3_19
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40442-2
Online ISBN: 978-3-540-45017-7
eBook Packages: Springer Book Archive