Skip to main content
Log in

File size models for shared content over the BitTorrent Peer-to-Peer network

  • Published:
Peer-to-Peer Networking and Applications Aims and scope Submit manuscript

Abstract

Peer-to-Peer (P2P) traffic has increased rapidly over the past few years, with file sharing providing the main drive behind such traffic. In this work we perform a measurement study of the content shared over the popular BitTorrent P2P file sharing network. We mathematically model the file size distributions of shared files after categorizing them into Audio, Video, Archive and CD image classes. For each of these categories we look into the most popular shared file formats and investigate their file size statistics. This provides an important milestone to building a realistic simulation framework for P2P systems, and for future analytical modeling of P2P networks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Notes

  1. Files in the BitTorrent network are identified by hash values unique to each file.

References

  1. Zhang Y, Chen C, Wang X (2006) Recent advances in research on P2P networks. In: Proceedings of the international conference on parallel and distributed computing, applications and technologies, pp 278–284

  2. Yang J, Ma H, Song W, Cui J, Zhou C (2006) Crawling the eDonkey network. In: Proceedings of the fifth international conference on grid and cooperative computing workshops, pp 133–136

  3. Ipoque (2007) Internet study 2007. Online report. http://www.ipoque.com/sites/default/files/mediafiles/documents/internet-study-2007.pdf

  4. Schulze H, Mochalski K (2009) Internet study 2008/2009. Ipoque. Online report. http://www.ipoque.com/sites/default/files/mediafiles/documents/internet-study-2008-2009.pdf

  5. Saroiu S, Gummadi PK, Gribble SD (2002) A measurement study of peer-to-peer file sharing systems. In: Proceedings of multimedia computing and networking

  6. Sen S, Wang J (2002) Analyzing peer-to-peer traffic across large networks. In: Proceedings of the ACM SIGCOMM workshop on internet measurement, pp 137–150

  7. Tutschku K (2004) A measurement-based traffic profile of the eDonkey filesharing service. In: Proceedings of the annual passive and active measurement workshop, pp 12–21

  8. Handurukande SB, Kermarrec A-M, Le Fessant F, Massoulié L, Patarin S (2006) Peer sharing behaviour in the eDonkey network, and implications for the design of server-less file sharing systems. ACM SIGOPS Oper Syst Rev 40(4):359–371

    Article  Google Scholar 

  9. Caviglionea L, Davolib F (2008) Traffic volume analysis of a nation-wide eMule community. Comput Commun 31(10):2485–2495

    Article  Google Scholar 

  10. Izal M, Urvoy-Keller G, Biersack EW, Felber PA, Al Hambra A, Garcés-Erice L (2004) Dissecting BitTorrent: five months in a torrent’s lifetime. In: Proceedings of the passive and active measurement workshop

  11. Meulpolder M, D’Acunto L, Capota M, Wojciechowski M, Pouwelse JA, Epema DHJ, Sips HJ (2010) Public and private BitTorrent communities: a measurement study. In: Proceedings of the 9th international workshop on peer-to-peer systems

  12. Pouwelse J, Garbacki P, Epema D, Sips H (2005) The BitTorrent P2P file-sharing system: measurements and analysis. In: Proceedings of the international workshop on peer-to-peer systems

  13. Guo L, Chen S, Xiao Z, Tan E, Ding X, Zhang X (2005) Measurements, analysis, and modeling of BitTorrent-like systems. In: Proceedings of the 5th ACM SIGCOMM conference on internet measurement, p 4

  14. Pearson K (1895) Contributions to the mathematical theory of evolution, II: skew variation in homogeneous material. Philos Trans R Soc Lond A 186:343–414

    Article  Google Scholar 

  15. Pearson K (1901) Mathematical contributions to the theory of evolution, X: supplement to a memoir on skew variation. Philos Trans R Soc Lond A 197:443–459

    Article  MATH  Google Scholar 

  16. Draper N, Smith H (1998) Applied regression analysis, 3rd edn. Wiley, New York

    MATH  Google Scholar 

  17. Ernesto (2011) Top 10 most popular torrent sites of 2011. http://torrentfreak.com/top-10-most-popular-torrent-sites-of-2011-110105/

  18. Schlosser D, Hoßfeld T (2009) Mastering selfishness and heterogeneity in mobile P2P content distribution networks with multiple source download in cellular networks. In: Peer-to-peer networking and applications, pp 252–266

  19. Xu J, Wang X, Zhao J, Lim AO (2011) I-swifter: improving chunked network coding for peer-to-peer content distribution. In: Peer-to-peer networking and applications

  20. Ozkasap O, Çaglar M, Alagoz A (2009) Principles and performance analysis of second: a system for epidemic peer-to-peer content distribution. J Netw Comput Appl 32(3):666–683

    Article  Google Scholar 

  21. Wei G, Gu Y, Ge Y (2009) Cluster: an effective solution to the problem of heavy-tailed distribution in P2P networks. In: Proceedings of the international conference on new trends in information and service science, pp 1397–1402

  22. Carra D, Neglia G, Michiardi P, Albanese F (2011) On the robustness of BitTorrent swarms to greedy peers. IEEE Trans Parallel Distrib Syst 22(12):2071–2078

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohammed Hawa.

Additional information

This work was supported in part by the Deanship of Academic Research at The University of Jordan.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hawa, M., Rahhal, J.S. & Abu-Al-Nadi, D.I. File size models for shared content over the BitTorrent Peer-to-Peer network. Peer-to-Peer Netw. Appl. 5, 279–291 (2012). https://doi.org/10.1007/s12083-011-0122-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12083-011-0122-6

Keywords

Navigation