Abstract
The Bulk-Synchronous Parallel (BSP) model is a universal abstraction of parallel computation that can be used to design portable parallel software. Advances in processor architecture and network communication enable clusters of workstations to be used as parallel computers. This paper focuses on using the idle computing power of a network of workstations to run parallel programs. The transient nature of the processors causes straightforward execution of synchronous BSP programs to perform poorly in such an environment. In this paper, we propose a scheme, based on the eager replication of state data and lazy replication of processes, that allows BSP programs to run efficiently on transient processors. The scheme is integrated into the Oxford BSP library.
This work was partially supported by NSF Grants CCR-9216053 and CCR-9527151. The content does not necessarily reflect the position or policy of the U.S. Government.
Chapter PDF
References
C. K. Birdsall and A. B. Langdon. Plasma Physics via Computer Simulation. The Adam Hilger Series on Plasma Physics. Adam Hilger, New York, 1991.
A. Bricker, M. Litzkow, and M. Livny. Condor Technical Summary. Technical Report CS-TR-92-1069, Computer Sciences Department, University of Wisconsin-Madison, Jan 1992.
Pankaj Jalote. Fault Tolerance in Distributed Systems. Prentice Hall, Englewood Cliffs, New Jersey 07632, 1994.
L. Kleinrock and W. Korfhage. Collecting Unused Processing Capacity: An Analysis of Transient Distributed Systems. IEEE Transactions on Parallel and Distributed Systems, 4(5):535–546, May 1993.
W F McColl. BSP Programming. In G Bleiloch, M Chandy, and S Jagannathan, editors, Proc. DIMACS Workshop on Specification of Parallel Algorithms, Princeton, May 94. American Mathematical Society.
Richard Miller. A Library for Bulk-synchronous Parallel Programming. In British Computer Society Parallel Processing Specialist Group workshop on General Purpose Parallel Computing, December 1993.
Richard Miller and Joy Reed. The Oxford BSP Library Users' Guide, version 1.0. Technical report, Oxford Parallel, 1993.
Sape Mullender. Distributed Systems. ACM Press Frontier Series. ACM Press, New York, 2nd edition, 1993.
M. V. Nibhanupudi, C. D. Norton, and B. K. Szymanski. Plasma Simulation On Networks Of Workstations Using The Bulk-Synchronous Parallel Model. In Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA '95), pages 13–22, Athens, Georgia, November 3–4, 1995.
M. V. Nibhanupudi and B. K. Szymanski. Efficiency Of Parallel Computation Replication On A Network Of Transient Processors. Submitted to Eighth IEEE Symposium on Parallel and Distributed Processing to be held in October 1996.
C. D. Norton, B. K. Szymanski, and V. K. Decyk. Object Oriented Parallel Computation for Plasma PIC Simulation. Communications of the ACM, 38(10), October 1995.
Leslie G. Valiant. A Bridging Model for Parallel Computation. Communications of the ACM, 33(8):103–111, August 1990.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1996 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nibhanupudi, M.V., Szymanski, B.K. (1996). Adaptive parallelism in the Bulk-synchronous Parallel model. In: Bougé, L., Fraigniaud, P., Mignotte, A., Robert, Y. (eds) Euro-Par'96 Parallel Processing. Euro-Par 1996. Lecture Notes in Computer Science, vol 1124. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0024718
Download citation
DOI: https://doi.org/10.1007/BFb0024718
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-61627-6
Online ISBN: 978-3-540-70636-6
eBook Packages: Springer Book Archive