ABSTRACT
RDF-based P2P networks have a number of advantages compared with simpler P2P networks such as Napster, Gnutella or with approaches based on distributed indices such as CAN and CHORD. RDF-based P2P networks allow complex and extendable descriptions of resources instead of fixed and limited ones, and they provide complex query facilities against these metadata instead of simple keyword-based searches.In previous papers, we have described the Edutella infrastructure and different kinds of Edutella peers implementing such an RDF-based P2P network. In this paper we will discuss these RDF-based P2P networks as a specific example of a new type of P2P networks, schema-based P2P networks, and describe the use of super-peer based topologies for these networks. Super-peer based networks can provide better scalability than broadcast based networks, and do provide perfect support for inhomogeneous schema-based networks, which support different metadata schemas and ontologies (crucial for the Semantic Web). Furthermore, as we will show in this paper, they are able to support sophisticated routing and clustering strategies based on the metadata schemas, attributes and ontologies used. Especially helpful in this context is the RDF functionality to uniquely identify schemas, attributes and ontologies. The resulting routing indices can be built using dynamic frequency counting algorithms and support local mediation and transformation rules, and we will sketch some first ideas for implementing these advanced functionalities as well.
- K. Aberer and M. Hauswirth. Semantic gossiping. In Database and Information Systems Research for Semantic Web and Enterprises, Invitational Workshop, University of Georgia, Amicalola Falls and State Park, Georgia, April 2002.Google Scholar
- L. A. Adamic, R. M. Lukose, A. R. Puniyani, and B. A. Huberman. Search in Power-law Networks. In Physical Review E, 64 46135, 2001.Google ScholarCross Ref
- P. A. Bernstein, F. Giunchiglia, A. Kementsietsidis, J. Mylopoulos, L. Serafini, and I. Zaihrayeu. Data management for peer-to-peer computing: A vision. In Proceedings of the Fifth International Workshop on the Web and Databases, Madison, Wisconsin, June 2002.Google Scholar
- S. Busse. Model Correspondences in Continuous Engineering of MBIS - doctoral thesis. Logos Verlag, September 2002.Google Scholar
- S. Chawathe, H. Garcia-Molina, J. Hammer, K. Ireland, Y. Papakonstantinou, J. Ullman, and J. Widom. The TSIMMIS project: Integration of heterogeneous information sources. In Proceedings of IPSJ Conference, Tokyo, Japan, October 1994.Google Scholar
- A. Crespo and H. Garcia-Molina. Routing indices for peer-to-peer systems. In Proceedings International Conference on Distributed Computing Systems, July 2002. Google ScholarDigital Library
- A. Crespo and H. Garcia-Molina. Semantic overlay networks, November 2002. Submitted for publication.Google Scholar
- M. Harren, J. M. Hellerstein, R. Huebsch, B. T. Loo, S. Shenker, and I. Stoica. Complex queries in DHT-based peer-to-peer networks. In F. Kaashoek and A. Rowstron, editors, Proceedings for the 1st International Workshop on Peer-to-Peer Systems (IPTPS '02), March 2002. Google ScholarDigital Library
- R. Korfhage. Information Storage and Retrieval. John Wiley, New York, 1997. Google ScholarDigital Library
- U. Leser. Query Planning in Mediator Based Information Systems - doctoral thesis. TU Berlin, June 2000.Google Scholar
- G. S. Manku and R. Motwani. Approximate frequency counts over data streams. In Proceedings of the 28th International Conference on Very Large Data Bases, Hong Kong, China, August 2002. Google ScholarDigital Library
- W. Nejdl, B. Wolf, C. Qu, S. Decker, M. Sintek, A. Naeve, M. Nilsson, M. Palmér, and T. Risch. EDUTELLA: a P2P Networking Infrastructure based on RDF. In WWW 11 Conference Proceedings, Hawaii, USA, May 2002. Google ScholarDigital Library
- S. Ratnasamy, P. Francis, M. Handley, R. Karp, and S. Shenker. A scalable content addressable network. In Proceedings of the 2001 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications. ACM Press New York, NY, USA, 2001. Google ScholarDigital Library
- G. Salton. Automatic Text Processing: The Transformation, Retrieval and Analysis of Information by Computer. Addison Wesley, Reading, MA, 1989. Google ScholarDigital Library
- S. Saroiu, P. K. Gummadi, and S. D. Gribble. A measurement study of peer-to-peer file sharing systems. In Proceedings of Multimedia Computing and Networking (MMCN), January 2002.Google Scholar
- M. Schlosser, M. Sintek, S. Decker, and W. Nejdl. HyperCuP--Hypercubes, Ontologies and Efficient Search on P2P Networks. In International Workshop on Agents and Peer-to-Peer Computing, Bologna, Italy, July 2002.Google ScholarDigital Library
- I. Stoica, R. Morris, D. Karger, M. F. Kaashoek, and H. Balakrishnan. Chord: A scalable peer-to-peer lookup service for internet applications. In Proceedings of the 2001 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications. ACM Press New York, NY, USA, 2001. Google ScholarDigital Library
- G. Wiederhold. Mediators in the architecture of future information systems. IEEE Computer, 25(3):38--49, 1992. Google ScholarDigital Library
- B. Yang and H. Garcia-Molina. Designing a super-peer network. http://dbpubs.stanford.edu:8090/pub/2002-13, 2002.Google Scholar
Index Terms
- Super-peer-based routing and clustering strategies for RDF-based peer-to-peer networks
Recommendations
A subscribable peer-to-peer RDF repository for distributed metadata management
In this paper, we present a scalable peer-to-peer RDF repository, named RDFPeers, which stores each triple in a multi-attribute addressable network by applying globally known hash functions. Queries can be efficiently routed to the nodes that store ...
Understanding churn in peer-to-peer networks
IMC '06: Proceedings of the 6th ACM SIGCOMM conference on Internet measurementThe dynamics of peer participation, or churn, are an inherent property of Peer-to-Peer (P2P) systems and critical for design and evaluation. Accurately characterizing churn requires precise and unbiased information about the arrival and departure of ...
Peer-to-peer multimedia applications
MM '06: Proceedings of the 14th ACM international conference on MultimediaIn both academia and industry, peer-to-peer (P2P) applications have attracted great attention. Peer-to-peer file sharing applications, such as Napster, Gnutella, Kazaa, BitTorrent, Skype and PPLive, have witnessed tremendous success among end users. And ...
Comments