Abstract
Despite the leaps and bounds made by the P2P research field in the last few years, the benefit of this innovation has been constrained to a few areas; search and file-sharing and storage to name a few. In particular, this innovation has had little significant impact in the field of distributed computing.
There are several obstacles to be overcome in the development of any distributed computer, most notably: scalability, fault tolerance, security and load balancing. The difficulty of these is compounded in the dynamic, decentralized environment which characterizes the P2P arena. This paper presents a method of recovering from faults which exploits the distributed hash table functionality provided by modern overlay networks. Its effectiveness is evaluated experimentally using a proof of concept P2P distributed computer.
It is hoped that by providing a solution to one of the obstacles, global, decentralized, dependable distributed computers will be one step closer to reality.
This work is supported by Science Foundation Ireland.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
The SETI@Home home page (2002), http://setiathome.ssl.berkerly.edu
The JXTA Project (2003), http://www.jxta.org/
Al-Dmour, N., Teahan, W.J.: ParCop: A Decentralized Peer-to-Peer Computing System. In: Third International Symposium on Parallel and Distributed Computing, pp. 162–168 (July 2004)
Butt, A.R., Fang, X., Midkiff, S.: Java, Peer-to-Peer, and Accountability: Building Blocks for Distributed Cycle Sharing. In: Proceedings of the 3rd USENIX Virtual Machines Research and Technology Syposium (May 2004)
Bubak, M., Paszczak, P.: HYDRA - Decentralised and Adaptative Approach to Distributed Computing. In: Sørevik, T., Manne, F., Moe, R., Gebremedhin, A.H. (eds.) PARA 2000. LNCS, vol. 1947, pp. 242–249. Springer, Heidelberg (2001)
Foster, I., Iamnitchi, A.: On Death, Taxes, and the Convergence of Peer-to-Peer and Grid Computing. In: Kaashoek, M.F., Stoica, I. (eds.) IPTPS 2003. LNCS, vol. 2735, Springer, Heidelberg (2003)
Labonte, K., Chu, J., Levine, B.: Availability and Locality Measurements of Peer-to-Peer File Systems. Scalability and Traffic Control in IP Networks, ITCom (July 2002)
Jelasity, M., Preuß, M., Paechter, B.: Maintaining Connectivity in a Scaleable and Robust Distributed Environment. In: Proceedings of the IEEE International Symposium on Cluster Computing and the Grid, Berlin, Germany, pp. 389–394 (May 2002)
Maymounkov, P., Mazieres, D.: Kademlia: A peer-to-peer information system based on the xor metric. In: Proceedings of IPTPS 2002, Cambridge, USA (2002)
Morrison, J.P., Power, K.: Compeer: Peer-to-Peer Applications on a Peer-to-Peer DCOM Architecture. In: Parallel and Distributed Computing and Systems, Anaheim, California, USA (August 2001)
Morrisson, J.P.: Condensed Graphs: Unifying Availability-Driven, Coercion-Driven and Control-Driven Computing. PhD thesis, Technische Universiteit Eindhoven (October 1996)
Power, K.: Compeer: A Scalable, Self-Organizing, Peer-to-Peer Metacomputer. PhD thesis, National University Ireland Cork (December 2003)
Ratnasamy, S., Francis, P., Handley, M., Karp, R., Shenker, S.: A Scalable Content Addressable Network. In: Proceedings of ACM SIGCOMM 2001 (2001)
Rowstron, A., Druschel, P.: Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems. In: Guerraoui, R. (ed.) Middleware 2001. LNCS, vol. 2218, p. 329. Springer, Heidelberg (2001)
Stoica, I., Morris, R., Karger, D., Kaashoek, M.F., Balakrishnan, H.: Chord: A scalable peer-to-peer lookup service for internet applications. In: Proceedings of the 2001 conference on applications, technologies, architectures, and protocols for computer communications, San Diego, California, United States, pp. 149–160. ACM Press, New York (2001)
Tanenbaum, A.S., van Steen, M.: Distributed Systems: Principles and Paradigms. Prentice Hall, New York (2002)
Verbeke, J., Nadgir, N., Ruetsch, G., Sharapov, I.: Framework for Peer-to-Peer Distributed Computing in a Heterogeneous, Decentralized Environment. In: Parashar, M. (ed.) GRID 2002. LNCS, vol. 2536, pp. 1–12. Springer, Heidelberg (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Power, K., Morrison, J.P. (2005). A Fault Recovery Scheme for P2P Metacomputers. In: Bader, D.A., Parashar, M., Sridhar, V., Prasanna, V.K. (eds) High Performance Computing – HiPC 2005. HiPC 2005. Lecture Notes in Computer Science, vol 3769. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11602569_10
Download citation
DOI: https://doi.org/10.1007/11602569_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30936-9
Online ISBN: 978-3-540-32427-0
eBook Packages: Computer ScienceComputer Science (R0)