Abstract
While Peer-to-Peer (P2P) model gains significant attention in distributed computing, it is also expected to be a powerful model for information sharing. P2P systems are expected to provide exhaustive reliable computational resources and scalable accessibility. The data management and distribution in such systems requires storage, replication, data modeling, indexing, querying, retrieval, streaming, and topology management. While a lot of data management strategies have been proposed through the last years, these strategies have not been investigated with respect to a common model for P2P systems. However, since the services provided by the P2P systems are so diverse, it is very challenging to come up with a common layer-based model for all P2P systems. In this paper, we firstly propose a conceptual model for P2P systems, and then provide a classification and summary of data management and distribution strategies by referring to this model. The horizontal layers of the model correspond to modules of a P2P system whereas the columns are related to the services provided. The modules include base P2P service, storage, indexing, logical, service, and application modules. The services include security, querying, publish, join/leave, collaboration, and streaming. The paper concludes by providing a comprehensive list of data management and distribution strategies used in the existing P2P systems.













Similar content being viewed by others
References
Abad C, Yurcik W, Campbell RH (2004) A survey and comparison of end-system overlay multicast solutions suitable for network-centric warfare. SPIE defense and security symposium / battlespace digitization and network-centric systems IV
Adar E, Huberman BA (2000) Free riding on gnutella. Technical report, Xerox PARC, August, 2000
Adya A, Bolosky WJ, Castro M, Cermak G, Chaiken R, Douceur JR, Howell J, Lorch JR, Theimer M, Wattenhofer RP (2002) FARSITE: Federated, Available, and Reliable Storage for an Incompletely Trusted Environment. 5th symposium on operating systems design and implementation
Agarwal S, Dube S (2006) Gossip based streaming with incentives for peer collaboration. IEEE 8th International Symposium on Multimedia, San Diego, CA, December 2006
Andtroutellis-Theotokis S, Spinellis D (2004) A survey of peer-to-peer content distribution technologies. ACM Comput Surv 34(4):335–371
Arenas M, Kantere V, Kementsietsidis A, Kiringa I, Miller RJ, Mylopoulos J (2003) The hyperion project: from data integration to data coordination. In SIGMOD Record, Special Issue on Peer-to-Peer Data Management 32(3): 53–58
Bawa M, Cooper BF, Crespo A, Daswani N, Ganesan P, Garcia-Molina H, Kamvar S, Marti S, Schlosser M, Sun Q, Vinograd P, Yang B (2003) Peer-to-peer research at Stanford. SIGMOD Record, September 2003
Bellini P, Bruno I, Cenni D, Nesi P, Rogai D (2007) P2P architecture for automated B2B cross media content distribution. In 3rd International Conference on Automated Production of Cross Media Content for Multi-Channel Distribution, AXMEDIS 2007, pp 105–112
BitTorrent http://www.bittorrent.com
Boncz PA, Treijtel C (2003) AmbientDB: relational query processing in a P2P network. International Workshop on Databases, Information Systems, and P2P Computing (DBISP2P) (co-located with VLDB 2003), Volume 2788 of Lecture Notes in Computer Science/Lecture Notes in Artificial Intelligence (LNCS/LNAI), © Springer-Verlag, Berlin, Germany
Bonifati A, Chrysanthis PK, Ouksel AM, Sattler K (2008) Distributed databases and peer-to-peer databases: past and present. SIGMOD Rec 37(1):5–11
Bulterman DCA (2003) Using SMIL to encode interactive, peer-level multimedia annotations. Proceedings of the 2003 ACM symposium on Document Engineering, Grenoble, France, pp 32–41
Castro M, Liskov B (1999) Practical byzantine fault tolerance. In: Proceedings of the Third Symposium on Operating Systems Design and Implementation (OSDI’99), New Orleans, Louisiana
Castro M, Druschel P, Kermarrec A-M, Rowstron A (2002) SCRIBE: a large-scale and decentralized application-level multicast infrastructure. IEEE Jrnl. on Selected Areas in Communications (J-SAC), Sp. Issue on Network Support for Group Communication 20, Oct. 2002
Castro M, Druschel P, Kermarrec A-M, Nandi A, Rowstron A, Singh A (2003) Splitstream: Highbandwidth multicast in cooperative environments. In Proc. of the 20th ACM Symp. on Operating Sys. Principles (SOSP 2003), Oct. 2003
Christensen BG (2006) Experiences Developing Mobile P2P Applications with LightPeers. Proceedings of the Sixth IEEE International Conference on Peer-to-Peer Computing (P2P’06)
Chu Y-H, Rao SG, Seshan S, Zhang H (2003) A case for end system multicast. In IEEE Jrnl. on Selected Areas in Communications (J-SAC), Sp. Issue on Network Support for Group Communication
Clark I, Sandberg O, Wiley B, Hong T (2000) Freenet: a distributed anonymous information storage and retrieval system. Proc. of the Workshop on Design Issues in Anonymity and Unobservabilit. Berkeley, CA, pp 311–320, July 2000
Cohen E, Shenker S (2002) Replication strategies in unstructured P2P networks. SIGCOMM 2002, August 2002
Collection of Computer Science Bibliographies. http://liinwww.ira.uka.de/csbib. Accessed in December 2006
Conti M, Crowcroft J, Delmastro F, Passarella A (2003) P2P support for Group-Communication Applications: a Cross-Layer approach for MANET Environments. IEEE INFOCOM 2006, Barcelona, Spain
Cordasco G, Scarano V, Vitolo C (2004) A P2P Distributed Adaptive Directory. Proc. of the 13th Int. Conf. on World Wide Web, WWW 2004, New York, NY
Dabek F, Kaashoek MF, Karger D, Morris R, Stoica I (2001) Wide-area cooperative storage with CFS. ACM SOSP 2001, Banff, October 2001
Dan W, Rongjuan Z (2005) A layered resource management model in P2P system. Proc. of the 6th International Conference on Parallel and Distributed Computing, Applications, and Technologies (PDCAT’05)
Danezis G, Lesniewski-Laas C, Kaashoek MF, Anderson R (2005) Sybil-resistant DHT routing. In: Proc. 10th European Symposium on Research in Comp. Sec., Milan, Italy, September 2005, pp 305–318
Douceur JR (2002) The Sybil attack. In: Proceedings for the 1st International Workshop on Peer-to-Peer Systems (IPTPS ’02), Cambridge, Massachusetts
Ding H, Sølvberg I, Lin Y (2004) A Vision on Semantic Retrieval in P2P Network. International Conference on Advanced Information Networking and Applications (AINA 2004), March 2004, Fukuoka, Japan
Eugster P, Handurukande S, Guerraoui R, Kermarrec A-M, Kouznetsov P (2001) Lightweight probabilistic broadcast. In: Proc. of the Intl. Conf. on Dependable Sys. and Networks (DSN 2001), July 2001
Fahrenholtz D, Turau V (2004) A Tree-based DHT Approach to Scalable Weakly Consistent Peer-to-Peer Data Management. PDMST ‘04 1st International Workshop on Peer2Peer Data Management, Security and Trust
Francis P, Pryadkin Y, Radoslavov P, Govindan R, Lindell B (2006) YOID: Your Own Internet Distribution. http://www.isi.edu/div7/yoid/, Accessed in December 2006
Free Haven http://freehaven.net.
Goldberg AV, Yianilos PN (1998) Towards an Archival Intermemory. Proceedings of IEEE Advances in Digital Libraries (ADL 98)
Habib A, Chuang J (2004) Incentive Mechanism for Peer-to-Peer Media Streaming. International Workshop on Quality of Service (IWQoS) pp. 171–180, June, 2004
Harwood A, Nutanong S, Tanin E, Truong MT (2004) Complex Applications over Peer-to-Peer Networks. ACM/IFIP/USENIX 5th International Middleware Conference, Toronto, Canada
Halevy AY, Ives ZG, Madhavan J, Mork P, Suciu D, Tatarinov I (2004) The piazza peer data management system. IEEE Trans Knowl Data Eng 16(7):787–798
Hamlen KW, Thuraisingham B (2007) Secure peer-to-peer networks for trusted collaboration. Collaborative Computing: Networking, Applications and Worksharing, 2007. CollaborateCom 2007. International Conference on, vol., no., pp.58–63, 12–15 Nov. 2007
Hasan R, Anwar Z, Yurcik W, Brumbaugh L, Campbell R (2005) A survey of peer-to-peer storage techniques for distributed file systems. In International Conference on Information Technology: Coding and Computing, ITCC, vol 2. pp 205–213
Hefeeda M, Habib A, Botev B, Xu D, Bhargava B (2003) PROMISE: Peer-to-Peer Media Streaming Using CollectCast. ACM Multimedia, 2003, Berkeley, California, USA
Helder DA, Jamin S (2002) End-host multicast communication using switch-tree protocols. In: Proc. of the 2nd Workshop on Global and Peer-to-Peer Computing on Large Scale Distributed Sys. (GP2PC 2002), May 2002
Hellerstein JM (2003) Toward Network Data Independence. ACM SIGMOD Record, SPECIAL ISSUE: Special topic section on peer to peer data management, Volume 32, Issue 3, pp. 34–40, September, 2003
Hyojin P, Jinhong Y, Juyoung P, Shin GK, Jun KC (2008) A survey on peer-to-peer overlay network schemes. In: Advanced Communication Technology, 2008. ICACT 2008. 10th International Conference on, vol 2, no., pp 986–988, 17–20 Feb. 2008
Jannotti J, Gifford DK, Johnson KL, Kaashoek MF, O’Toole JW, Jr. (2000) Overcast: Reliable multicasting with an overlay network. In: Proc. of the 4th Usenix Symp. on Operating Sys. Design and Implementation (OSDI 2000), Oct. 2000
Jiang X, Dong Y, Xu D, Bhargava B (2003) GNUSTREAM: a P2P Media Streaming System Prototype. IEEE International Conference on Multimedia & Expo (ICME)
JXTA (2006) http://www.jxta.org. Accessed in December 2006
Kalapriya K, Nandy SK, Babu VK (2004) Can Streaming Of Stored Playback Video Be Supported On Peer to Peer Infrastructure? In: Proceedings of 18th International Conference on Advanced Information Networking and Application (AINA’04), vol 2. Fukuoka City, Japan, pp 200–203
Kalogeraki V, Chen F (2004) Managing distributed objects in peer-to-peer networks. IEEE Networks Magazine, special issue on Middleware Technologies for future Communication Networks 18(1):22–29
Karnstedt M, Hose K, Sattler K (2004) Query Routing and Processing in Schema-Based P2P Systems. Proceedings of the 15th International Workshop on Database and Expert Systems Applications (DEXA’04)
Kazaa http://www.kazaa.com
Kementsietsidis A, Arenas M, Miller RJ (2003) Mapping Data in Peer-to-Peer Systems Semantics and Algorithmic Issues. SIGMOD 2003, pp 325–336, June, 2003, San Diego, CA
Kokkinidis G, Christophides V (2004) Semantic Query Routing and Processing in P2P Database Systems: The ICSFORTH SQPeer Middleware. HDMS ‘04, Athens, Greece
Kostic D, Rodriguez A, Albrecht J, Vahdat A (2003) Bullet: High bandwidth data dissemination using an overlay mesh. In: Proc. of the 20th ACM Symp. on Operating Sys. Principles (SOSP 2003), Oct. 2003
Kwon G, Ryu KD (2004) BYPASS: Topology-Aware Lookup Overlay for DHT-based P2P File Locating. In the 10th IEEE International Conference on Parallel and Distributed Systems (ICPADS), Newport Beach CA, July 2004
Li Z, Mohapatra P (2003) Hostcast: A new overlay multicasting protocol. In: Proc. IEEE 2003 Intl. Conf. on Communications (ICC 2003), May 2003
Lin WK, Chiu DM, Lee YB (2004) Erasure Code Replication Revisited. Proceedings of the 4th IEEE International Conference on Peer-to-Peer Computing (P2P’04), August, 2004
Liu J, Rao SG, Li B, Zhang H (2006) Opportunities and Challenges of Peer-to-Peer Internet Video Broadcast, invited by Proceedings of the IEEE
Liu Y, Guo Y, Liang C (2008) A survey on peer-to-peer video streaming systems. 1(1):18–28, March, 2008
Lua EK, Crowcroft J, Pias M, Sharma R, Lim S (2005) A survey and comparison of peer-to-peer overlay network schemes. Communications Surveys & Tutorials, IEEE 7(2):72–93
Lv Q, Cao P, Cohen E, Li K, Shenker S (2002) Search and Replication in Unstructured Peer-to-Peer Networks. In: Proceedings of 16th ACM International Conference on Supercomputing, June, 2002
Magharei N, Rejaie R (2007) Prime: peer-to-peer receiver driven mesh-based streaming. In: Proceedings of IEEE INFOCOM
Maniatis P, Roussopoulos M, Giuli TJ, Rosenthal DSH, Baker M (2005) The LOCKSS peer-to-peer digital preservation system. ACM Trans Comput Syst 23(1):2–50
Maxim R, Hui SC (2003) Intelligent Content-Based Retrieval for P2P Network. International Conference on Cyberworlds, p 318, December 2003
Mielke M, Aygun RS, Song Y, Zhang A (2002) PLUS: probe-loss utilization streaming mechanism for distributed multimedia presentation systems. IEEE Trans Multimedia 4(4):561–577
Milojicic DS, Kalogeraki V, Lukose R, Nagaraja K, Pruyne J, Richard B, Rollins S, Xu Z (2002) “Peer-to-peer computing,” Technical Report HPL-2002-57, HP Lab
Muthitacharoen A, Morris R, Gil TM, Chen B (2002) Ivy: a read/write peertopeer file system. In the Proceedings of the 5th USENIX Symposium on Operating Systems Design and Implementation (OSDI '02), Boston, Massachusetts, December 2002
Napster, http://www.napster.com
Nejdl W, Siberski W, Sintek M (2003) Design issues and challenges for RDF- and schema-based peer-to-peer systems. ACM SIGMOD Record, SPECIAL ISSUE: Special topic section on peer to peer data management table of contents 32(3):41–46
Nejdl W, Wolf B, Qu C, Decker S, Sintek M, Naeve A, Nilsson M, Palmr M, Risch T (2002) EDUTELLA: a P2P Networking Infrastructure based on RDF. 11th International World Wide Web Conference Hawaii, USA, May 2002
Ng WS, Ooi BC, Tan KL, Zhou AY (2003) PeerDB: A P2P-based system for distributed data sharing. In: Proceedings of the 19th International Conference on Data Engineering, Bangalore, India, March 2003
OceanStore http://oceanstore.cs.berkeley.edu
Ooi BC, Shu Y, Tan KL (2003) Relational Data Sharing in Peer-based Data Management Systems. ACM Sigmod Record, Special issue on P2P data management 32(3):59–64
Osais Y, Abdala S, Matrawy A (2006) Multilayer peer-to-peer framework for distributed synchronous collaboration. IEEE Internet Comput 10(6):33–41
Ouksel AM (2003) In-context peer-to-peer information filtering on the Web. ACM SIGMOD Record, SPECIAL ISSUE: Special topic section on peer to peer data management 32(3):65–70
Qu C, Nejdl W (2003) Searching SCORM Metadata in a RDF-based E-Learning P2P Network Using XQuery and Query by Example. In: Proc. of the 3rd IEEE International Conference on Advanced Learning Technologies (IEEE ICALT 2003), July 2003, Athens, Greece
Patro S, Hu YC (2003) Transparent Query Caching in Peer-to-Peer Overlay Networks. In: Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS), Nice, France, April 2003
Pendarakis D, Shi S, Verma D, Waldvogel M (2001) ALMI: an application level multicast infrastructure. In: Proc. of the 3rd Usenix Symp. on Internet Technologies and Sys. (USITS 2001), Mar. 2001
Pitoura E, Abiteboul S, Pfoser D, Samaras G, Vazirgiannis M (2003) DBGlobe: a service-oriented P2P system for global computing. ACM SIGMOD Rec 32(3):77–82
Peer Name Resolution Protocol http://www.microsoft.com/technet/network/p2p/pnrp.mspx. published in September 2006
Portmann M, Ardon S, Senac P, Seneviratne A (2004) PROST: a programmable structured peer-to-peer overlay network. Proc. of the 4th Int. Conf. Peer-to-Peer computing, Zurich, Switzerland
Ratnasamy S, Francis P, Handley M, Karp R, Shenker S (2001a) A scalable content-addressable network. In: proceedings of the 2001 ACM SIGCOMM, pp 161–172
Ratnasamy S, Handley M, Karp R, Shenker S (2001b) Application-level multicast using content-addressable networks. In: Proc. of 3rd Intl. Workshop on Networked Group Communication (NGC 2002), Nov. 2001
Resource Description Framework (RDF) http://www.w3.org/RDF/
Risson J, Moors T (2006) Survey of research towards robust peer-to-peer networks: search methods. Comput Netw 50(17):3485–3521
Rowstron A, Druschel P (2001a) Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility. 18th ACM SOSP’01, Lake Louise, Alberta, Canada
Rowstron A, Druschel P (2001b) Pastry: Scalable, distributed object location and routing for large-scale peer-to-peer systems. In: Proceedingsof IFIP/ACMMiddleware. Heidelberg, Germany
Ryu S, Butler K, Traynor P, McDaniel P (2007) Leveraging Identity-Based Cryptography for Node ID Assignment in Structured P2P Systems. Advanced Information Networking and Applications Workshops, 2007, AINAW ‘07. 21st International Conference on, vol.1, no., pp519–524, 21–23 May 2007
Sahin OD, Gupta A, Agrawal D, El Abbadi A (2004) A Peer-to-peer Framework for Caching Range Queries. In 20th International Conference on Data Engineering.Boston, Massachusetts, pp 165–176
Serafini L, Giunchiglia F, Mylopoulos J, Bernstein P (2001) The Local Relational Model: Model and Proof Theory. IRST Technical Report 0112-23, Instituto Trentino di Cultura, December 2001
Shirky C (2001) Listening to Napster. Book chapter in Peer-to-Peer: harnessing the power of disruptive technologies, edited by Oram, Andy, 2001, O’Reilly
Stein CA, Tucker MJ, Seltzer MI (2002) Building a Reliable Mutable File System on Peer-to-Peer Storage. 21st IEEE Symposium on Reliable Distributed Systems (SRDS’02), p 324
Stoica I, Morris R, Karger DR, Kaashock MF, Balakrishman H (2001) Chord: A scalable peer-to-peer lookup protocol for internet applications. In: Proceedings of the ACM SIGCOMM, pp. 149–160, San Diego, California, August 2001
Tran DA, Hua K, Do T (2004) A peer-to-peer architecture for media streaming. IEEE Journal on Selected Areas in Communications, Special Issue on Advances in Service Overlay Networks 22:121–133, Jan 2004
Tsoumakos D, Roussopoulos N (2003) Adaptive Probabilistic Search for Peer-to-Peer Networks. In: 3rd IEEE Intl Conference on P2P Computing
Tsoumakos D, Roussopoulos ND (2006) APRE: A Replication Method for Unstructured P2P Networks. Technical Reports of the Computer Science Department of University of Maryland, February, 2006
Tsybulnik N, Hamlen KW, Thuraisingham B (2007) Centralized Security Labels in Decentralized P2P Networks. Computer Security Applications Conference, 2007. ACSAC 2007. Twenty-Third Annual, vol., no., pp 315–324, 10–14 Dec. 2007
Turcan E, Shahmehri N, Graham RL (2002) Intelligent Software Delivery Using P2P. Proceedings of the Second International Conference on Peer-to-Peer Computing (P2P’02)
Villamil M-D-P, Roncancio C, Labbe C (2004) Pins: Peer to Peer Interrogation and Indexing System. Database Engineering and Applications Symposium, 2004. IDEAS ‘04, pp 236–245
Vlavianos A, Iliofotou M, Faloutsos M (2006) Bitos: enhancing bittorrent for supporting streaming applicati ons. In 9th IEEE global internet symposium 2006, April 2006
Wallach DS (2002) A survey of peer-to-peer security issues. International Symposium on Software Security, Tokyo, Japan
Wang C, Xiao L, Liu Y, Zheng P (2006) DiCAS: an efficient distributed caching mechanism for P2P systems. IEEE Trans Parallel Distrib Syst 17(10):1097–1109
Windows Peer-to-Peer Networking. http://www.microsoft.com/technet/prodtechnol/winxppro/deploy/p2pintro.mspx. Accessed in December 2006
Xiang Z, Zhang Q, Zhu W, Zhang Z, Zhang Y-Q (2004) Peer-to-peer based multimedia distribution service. IEEE Trans Multimedia 6(2):343–356
XML Base http://www.w3.org/TR/xmlbase/
Xu D, Hefeeda M, Hambrusch S, Bhargava B (2002) On Peer-to-Peer Media Streaming. Proceedings of the 22nd International Conference on Distributed Computing Systems (ICDCS’02), pp 363–371
Xu Z, Min R, Hu Y (2003) HIERAS: A DHT-Based Hierarchical Peer-to-Peer Routing Algorithm. Proceedings of the 2003 International Conference on Parallel Processing (ICPP’03). Kaohsiung, Taiwan, pp 187–194, October, 2003
Yang C (2003) Peer to Peer Architecture for Content Based Music Retrieval on Acoustic Data. WWW 2003, Budapest, Hungary, pp 376–383, May 20–24, 2003
Zhang X, Liu J, Li B, Yum T-SP (2005) CoolStreaming/DONet: A Data-driven Overlay Network for Live Media Streaming, IEEE INFOCOM’05, Miami, FL, USA, March 2005
Zhang M, Tan K-L (2003) Supporting Rich Queries in DHT-Based Peer-to-Peer Systems. Twelfth International Workshop on Enabling Technologies: Infrastructure for Collaborative Enterprises, pp. 95, 2003 Linz, Austria, Y. Zhu, H. Wang, and Y. Hu. Integrating Semantics-Based Access Mechanisms with P2P File Systems. Third International Conference on Peer-to-Peer Computing (P2P’03), pp 118
Zhao BY, Huang L, Stribling J, Rhea SC, Joseph AD, Kubiatowicz JD (2004) Tapestry: a resilient global-scale overlay for service deployment. IEEE J Sel Areas Commun 22(1):41–53
Zhou J, Bhuyan LN, Banerjee A (2008) An effective pointer replication algorithm in P2P networks. Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on, vol., no., pp1–11, 14–18 April 2008
Zhuang SQ, Zhao BY, Joseph AD, Katz R, Kubiatowicz J (2001) Bayeux: An architecture for scalable and fault-tolerant wide-area data dissemination. In: Proc. of the 11th Intl. Workshop on Network and Operating Sys. Support for Digital Audio and Video (NOSSDAV 2001), June 2001
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Aygün, R.S., Ma, Y., Akkaya, K. et al. A conceptual model for data management and distribution in peer-to-peer systems. Peer-to-Peer Netw. Appl. 3, 294–322 (2010). https://doi.org/10.1007/s12083-009-0061-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12083-009-0061-7