Skip to main content
Log in

AutoProc: An automatic proactive replication scheme for P2P storage

  • Research Papers
  • Special Focus
  • Published:
Science China Information Sciences Aims and scope Submit manuscript

Abstract

Due to the prevalence of peer departures, P2P storage systems need to constantly generate replicas for high durability. However, few existing systems are able to select the right replication rate that balances replication cost and durability, e.g., minimizing replication cost without threatening the durability target. In this paper, we propose a method called AutoProc to automate the replication process in P2P storage systems. AutoProc employs two key techniques to enable cost-effective replication: correctly adapting replication rate with a reliability model, and replicating objects at the right order with a reliability indicator. Extensive simulations based on the real system failure trace show that AutoProc achieves the near-optimal performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Yang Z, Zhao B Y, Xing Y J, et al. AmazingStore: Available, low-cost online storage service using cloudlets. In: Proceedings of the 9th International Workshop on Peer-to-Peer Systems, San Jose, USA, 2010. 1–5

  2. Gharaibeh A, Ripeanu M. Exploring data reliability tradeoffs in replicated storage systems. In: Proceedings of the 18th ACM International Symposium on High Performance Distributed Computing, Munich, Germany, 2009. 217–226

  3. Blake C, Rodrigues R. High availability, scalable storage, dynamic peer networks: Pick two. In: Proceedings of the 9 th Workshop on Hot Topics in Operating Systems, Hawaii, USA, 2003. 1–6

  4. Tati K, Voelker G. On object maintenance in peer-to-peer systems. In: Proceedings of the 6th International Workshop on Peer-to-Peer Systems, Santa Barbara, USA, 2006

  5. Chun B G, Dabek F, Haeberlen A, et al. Efficient replica maintenance for distributed storage systems. In: Proceedings of the 3rd USENIX symposium on Networked Systems Design Implementation, San Jose, USA, 2006. 45–58

  6. Bhagwan R, Tati K, Cheng Y, et al. Total recall: System support for automated availability management. In: Proceedings of the 1st Symposium on Networked Systems Design and Implementation, San Francisco, USA, 2004

  7. Weatherspoon H, Chun B G, So C, et al. Long-term data maintenance in wide-area storage systems: A quantitative approach. Technical Report UCB/CSD-05-1404. 2005

  8. Mislove A, Post A, Haeberlen A, et al. Experiences in building and operating a reliable peer-to-peer application. In: Proceedings of the 1st EuroSys Conference, Leuven, Belgium, 2006

  9. Stribling J, Councill I G, Li J, et al. OverCite: A cooperative digital research library. In: Proceedings of the 4th International Workshop on Peer-to-Peer Systems, Ithaca, USA, 2005

  10. Yang Z, Dai Y F, Xiao Z. Exploring the cost-availability tradeoff in P2P storage systems. In: Proceedings of the 38th International Conference on Parallel Processing, Vienna, Austria, 2009

  11. Chun B G, Dabek F, Haeberlen A, et al. Proactive replication for data durability. In: Proceedings of the 6th International Workshop on Peer-to-Peer Systems, Santa Barbara, USA, 2006

  12. Yang M, Zhao B Y, Dai Y, et al. Deployment of a large scale peer-to-peer social network. In: Proceedings of WORLDS, San Francisco, USA, 2004

  13. Ramabhadran S, Pasquale J. Analysis of durability in replicated distributed storage systems. In: Proceedings of International Parallel & Distributed Processing Symposium (IPDPS), Atlanta, USA, 2010

  14. Duminuco A, Biersack E, En-Najjary T. Proactive replication in distributed storage systems using machine availability estimation. In: Proceedings of the 3rd International Conference on Emerging Networking Experiments and Technologies (CoNEXT), New York, USA, 2007

  15. Datta A, Aberer K. Internet-scale storage systems under churn-a study of the steady state using Markov models. In: Proceedings of IEEE International Conference on Peer-to-Peer Computing (P2P), Cambridge, UK, 2006

  16. Ford D, Labelle F, Popovici F I, et al. Availability in globally distributed storage systems. In: Proceedings of the 9th USENIX Symposium on Operating Systems Design and Implementation, Vancouver, Canada, 2010

  17. Ghemawat S, Gobioff H, Leung S. The Google file system. In: Proceedings of the 19th ACM Symposium on Operating Systems Principles, Bolton Landing, USA, 2003. 29–43

  18. Aldous D, Brown M. Inequalities for rare events in time-reversible Markov chains II. Stochastic Process Appl, 1993, 44: 15–25

    Article  MathSciNet  MATH  Google Scholar 

  19. Trivedi K S. Probability and Statistics with Reliability, Queuing, and Computer Science Applications. 2nd ed. New York: John Wiley Sons, 2001

    Google Scholar 

  20. Ross S. Stochastic Processes. 2nd ed. New York: Wiley, 1996

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to YaFei Dai.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yang, Z., Dai, Y. AutoProc: An automatic proactive replication scheme for P2P storage. Sci. China Inf. Sci. 54, 1151–1160 (2011). https://doi.org/10.1007/s11432-011-4260-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11432-011-4260-5

Keywords

Navigation