skip to main content
10.1145/1866480.1866506acmconferencesArticle/Chapter ViewAbstractPublication PagesideasConference Proceedingsconference-collections
research-article

Using Monte Carlo simulation for improving data availability in P2P network

Published: 16 August 2010 Publication History

Abstract

In this paper we present a replication strategy to improve data availability in P2P Networks. The focus of the paper is to replicate data to nodes which are highly available and complement one another in terms of uptimes. This would decrease the replication management overhead until the number of replicas falls to a certain threshold. Replication to reliable node would improve the cost of replication by avoiding irregular nodes of the network. We run Monte Carlo simulation based on past traces of Kad, OverNet, Bittorrent and PlanetLab, to present how our replication keeps data sustainable in the network. In our evaluation we show that a life pattern along with the availability of nodes improves overall data availability. We perform our evaluation on a Kademlia network, and show that our approach reduces over head compared to existing approaches to data availability.

References

[1]
}}Bhagwan, R., Savage, S., and Voelker, M. G. Replication strategies for highly available peer-to-peer storage systems. In Proc. of FuDiCo: Future directions in Distributed Computing (2002).
[2]
}}Bhagwan, R., Savage, S., and Voelker, M. G. Understanding availability. In Proc. of the 2nd International Workshop on Peer-to-peer systems, (IPTPS'02) (2002).
[3]
}}Blake, C., and Rodrigues, R. High availability, scalable storage, dynamic peer networks: Pick two. In Proc. of the 9th Workshop on Hot Topics in Operating Systems, (HotOS'03) (2003).
[4]
}}Dandoush, A., Alouf, S., and Nain, P. P2p storage systems modeling, analysis and evaluation. Tech. Rep. RR-6392, Inria, December 2007.
[5]
}}Godfrey, P. B., Shenker, S., and Stoica, I. Minimizing churn in distributed systems. In Proc. of ACM SIGCOMM Š06 (2006).
[6]
}}Gray, J., Helland, P., and Shasha, D. The dangers of replication and a solution. In Proc. of ACM SIGMOD International Conference on Management of Data (1996), pp. 173--182.
[7]
}}Izal, M., Keller, U. G., Biersack, E. W., Felber, P., Hamra, A. A., and Garcés-Erice, L. Dissecting bittorrent: Five months in a torrent's lifetime. In Proc. of In Passive and Active Measurements, (PAM'04) (2004).
[8]
}}Jelasity, M., Montresor, A., Jesi, G. P., and Voulgaris, S. The Peersim simulator. http://peersim.sf.net.
[9]
}}Kingsley, G. Z. Relative frequency as a determinant of phonetic change. Harvard Studies in Classical Philology 40 (1929), 1--95.
[10]
}}Leslie, M., Davies, J., and Huffman, T. A comparison of replication strategies for reliable decentralised storage. JOURNAL OF NETWORKS 1, 6 (2006).
[11]
}}Long, D., Muir, A., and Golding, R. A longitudinal survey of internet host reliability. In Proc. of the 14th Symposium on Reliable Distributed Systems (1995), p. 2--9.
[12]
}}Nurmi, D., and Brevik, J. Modelling machine availability in enterprise and wide-area distributed computing environments. In Proc. of Euro-Par 2005 (2005), pp. 432--441.
[13]
}}Picconi, F., Baynat, B., and Sens, P. Predicting durability in dhts using markov chains. In Proc. of the 2nd International Conference on Digital Information Management, (ICDIM'07) (2007).
[14]
}}Ramabhadran, S., and Pasquale, J. Analysis of long-running replicated systems. In Proc. of the 25th IEEE Annual Conference on Computer Communications, (INFOCOM'06) (2006).
[15]
}}Ranganathan, K., Iamnitchi, A., and Foster, I. Improving data availability through dynamic model-driven replication in large peer-to-peer communities. In Proc. of the 2nd IEEE/ACM International Symposium Cluster Computing and the Grid, (CCGrid'02) (2002), pp. 376--376.
[16]
}}Schwarz, T., Xin, Q., and Miller, E. L. Availability in global peer-to-peer storage systems. In Proc. of Workshop on Distributed Data and Structures, (WDAS'04) (2004).
[17]
}}Shi, S., Yang, G., Yu, J., Wu, Y., and Wang, D. Improving availability of p2p storage systems. In Proc. of International Workshop on Advanced Parallel Processing Technologies (2003), pp. 446--456.
[18]
}}Sripanidkulchai, K. The popularity of gnutella queries and its implications on scalability.
[19]
}}Steiner, M., Biersack, E. W., and Ennajjary, T. Actively monitoring peers in kad. In Proc. of 6th International Workshop on Peer-to-Peer Systems, (IPTPS'07) (2007).
[20]
}}Utard, G., and Vernois, A. Data durability in peer to peer storage systems. In Proc. of the 4th IEEE International Symposium on Cluster Computing and the Grid, (CCGrid'04) (2004), pp. 90--97.
[21]
}}Wang, J.-L. Markov-chain based reliability analysis for distributed systems. Computers and Electrical Engineering 30, 3 (2004), 183--205.
[22]
}}Williams, C., Huibonhoa, P., Holliday, J., Hospodor, A., and Schwarz, T. Redundancy management for p2p storage. In Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid, (CCGRID '07) (2007), pp. 15--22.
[23]
}}Yalag, P., Nath, S., Yu, H., Gibbons, P. B., and Seshan, S. Beyond availability towards a deeper understanding of machine failure characteristics in large distributed systems.

Cited By

View all
  • (2011)Exploring peer heterogeneity: Towards understanding and application2011 IEEE International Conference on Peer-to-Peer Computing10.1109/P2P.2011.6038657(20-29)Online publication date: Aug-2011

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
IDEAS '10: Proceedings of the Fourteenth International Database Engineering & Applications Symposium
August 2010
282 pages
ISBN:9781605589008
DOI:10.1145/1866480
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 August 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Monte Carlo simulation
  2. P2P network
  3. availability
  4. fault tolerance
  5. replication

Qualifiers

  • Research-article

Funding Sources

Conference

IDEAS '10
Sponsor:
  • ACM
  • Concordia University

Acceptance Rates

Overall Acceptance Rate 74 of 210 submissions, 35%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2011)Exploring peer heterogeneity: Towards understanding and application2011 IEEE International Conference on Peer-to-Peer Computing10.1109/P2P.2011.6038657(20-29)Online publication date: Aug-2011

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media