Abstract
In this paper we examine the performance and fault tolerance of different multicast based methods for maintaining the consistence of distributed data, depending on the network topology and packet loss probability. We present a system model and analytical results. This work is motivated by experiences with our process migration system PBeam. There we use a global virtual name space to achieve location transparency for process migration and checkpointing / rollback for distributed applications on clusters of Unix workstations. First measurements have shown that the maintenance of this global name space is critical for the performance of the entire system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
D. Bailey, E. Barszcz, J. Barton, D. Browning, R. Carter, L. Dagum, R. Fatoohi, S. Fineberg, P. Frederickson, T. Lasinski, R. Schreiber, H. Simon, V. Venkatakrishnan, and S. Weeratunga. The NAS Parallel Benchmarks. Report RNR-94- 007, Department of Mathematics and Computer Science, Emory University, March 1994.
Amnon Barak, Shai Guday, and Richard G. Wheeler. The MOSIX Distributed Operating System, volume 672 of LNCS. Springer Verlag, 1993.
Matthias Becker. Leistungsbewertung von Multicast Synchronisationsverfahren. Master’s thesis, TU Braunschweig, 1996. In preparation.
Clemens H. Cap. Workstation Cluster Computing aus der Sicht des Anwenders. PIK - Praxis der Informationsverarbeitung und Kommunikation, 17(4):230–237, October 1994.
Jeremy Casas, Dan L. Clark, Ravi Konuru, Steve W. Otto, Robert M. Prouty, and Jonathan Walpole. MPVM: A migration transparent version of PVM. Computing Systems, 8(2):171–216, 1995.
K. Mani Chandy. A mutual exclusion algorithm for distributed systems. Technical report, University of Texas, 1982.
S. Deering. Host Extensions for IP Multicasting. RFC 1112, Stanford University, August 1989.
Fred Douglis and John Ousterhout. Transparent Process Migration: Design Alternatives and the Sprite Implementation. Software - Practice and Experience, 21(8):757–785, August 1991.
H. Langendörfer and B. Schnor. Verteilte Systeme. Hanser Verlag, München, 1994.
Michael J. Litzkow, Miron Livny, and Matt W. Mutka. Condor - A Hunter of Idle Workstations. In Proceedings of the 8th International Conference on Distributed Computer Systems, pages 104–111. IEEE, June 1988.
Michael J. Litzkow and Marvin Solomon. Supporting Checkpointing and Process Migration Outside the UNIX Kernel. In Usenix Conference Proceedings, pages 283–290, San Francisco, CA, January 1992.
Thomas Ludwig. Automatische Lastverteilung für Parallelrechner. Reihe Informa- tik. BI-Wissenschaftsverlag, 1993.
Dejan S. Milojičić. Load Distribution - Implementation for the Mach Microkernel Vieweg Verlag, Braunschweig, 1994.
L.E. Moser, P.M. MeUiar-Smith, D.A. Agarwal, R.K. Budhia, and C.A. Lingley- -Papadopoulos. Totem: A fault-tolerant multicast group communication system. Communications of the ACM, 39(4):54–63, April 1996.
C. Perkins. IP Encapsulation within IP. Internet draft (work in progress), Mobile IP Working Group of the Internet Engineering Task Force (IETF)/IBM, May 1996. draft-ietf-mobileip-ip4inip4–03.txt.
S. Petri and H. Langendörfer. Load Balancing and Fault Tolerance in Workstation Clusters - Migrating Groups of Communicating Processes. Operating Systems Review, 29(4):25–36, October 1995.
S. Petri, B. Schnor, H. Langendörfer, and J. Steinborn. Consistent Global Checkpoints for Distributed Applications on Clusters of Unix Workstations. In H.G. Matthies and J. Schüle, editors, Paralleles und verteiltes Rechnen - Beiträge zum 4. Workshop üher wissenschaftüches Rechnen, pages 77–86, Aachen, October 1996. TU Braunschweig, Shaker.
James S. Plank, Micah Beck, Gerry Kingsley, and Kai Li. Libckpt: Transparent Checkpointing under Unix. In Usenix Conference Proceedings. USENIX, January 1995.
B. Schnor, S. Petri, R. Oleyniczak, and H. Langendörfer. Scheduling of Parallel Applications on Heterogeneous Workstation Clusters. Will appear in proceedings of PDCS’96 - Ninth International Conference on Parallel and Distributed Computing Systems, September 1996.
Georg Stellner. Resource Management and Checkpointing for PVM. In Proceedings of the Second European PVM User Group Meeting, Lyon, 1995.
R.H. Thomas. A Majority Consensus Approach to Concurrency Control for Multiple Copy Databases. ACM Transactions on Database Systems, 4(2):180–209, 1979.
Bernd Hinrichs und Titus Tscharntke. Aufbau eines echt verteilten Namensraum- services für die transparente Prozeßimigration zwischen Workstations. Studienar- beit, August 1996.
S. White, A. Ålund, and V.S. Sunderam. Performance of the NAS Parallel Benchmarks on PVM Based Networks. Report RNR-94–008, Department of Mathematics and Computer Science, Emory University, May 1994.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Petri, S., Schnor, B., Becker, M., Hinrichs, B., Tscharntke, T., Langendörfer, H. (1997). Evaluation of Multicast Methods to Maintain a Global Name Space for Transparent Process Migration in Workstation Clusters. In: Zitterbart, M. (eds) Kommunikation in Verteilten Systemen. Informatik aktuell. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-60729-5_16
Download citation
DOI: https://doi.org/10.1007/978-3-642-60729-5_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-62565-0
Online ISBN: 978-3-642-60729-5
eBook Packages: Springer Book Archive