Abstract
Advances in networking infrastructure have made it possible to build very large scale applications whose execution spans multiple supercomputers. In such very large scale or ultra-scale applications, a central requirement is the ability to simultaneously co-allocate large collections of resources, to initiate a computation on those resources and to initialize the distributed collection of components to construct a single, integrated computation. In a previous paper [3], we defined a general resource management architecture for high-performance distributed systems in which resource co-allocation was an integral component. In this extended abstract, we examine co-allocation in more detail and describe the implementation of a specific resource co-allocator called the Dynamically Updated Request Online Co-allocator, or DUROC. DUROC has been implemented as part of the Globus grid toolkit. We briefly describe the design of DUROC and discuss how is has been used to support a range of large-scale grid applications.
Preview
Unable to display preview. Download preview PDF.
References
Sharon Brunett, Karl Czajkowski, Ian Foster, Steven Fitzgerald, Andy Johnson, Carl Kesselman, Jason Leigh, and Steven Tuecke. Application experiences with the globus toolkit. IEEE Computer Society Press, 1998.
C. Catlett and L. Smarr. Metacomputing. Communications of the ACM, 35(6):44–52, 1992.
K. Czajkowski, I. Foster, N. Karonis, C. Kesselman, S. Martin, W. Smith, and S. Tuecke. A resource management architecture for metacomputing systems. In The 4th Workshop on Job Scheduling Strategies for Parallel Processing, 1998.
I. Foster, J. Geisler, W. Gropp, N. Karonis, E. Lusk, G. Thiruvathukal, and S. Tuecke. A wide-area implementation of the Message Passing Interface. Parallel Computing, 1998. to appear.
I. Foster and C. Kesselman. The Globus project: A status report. In Proceedings of the Heterogeneous Computing Workshop, pages 4–18. IEEE Computer Society Press, 1998.
I. Foster and C. Kesselman, editors. The Grid: Blueprint for a Future Computing Infrastructure. Morgan Kaufmann Publishers, 1998.
Paul Messina. Distributed supercomputing applications, 1998.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Czajkowski, K., Foster, I., Kesselman, C. (1998). Resource management for ultra-scale computational grid applications. In: Kågström, B., Dongarra, J., Elmroth, E., Waśniewski, J. (eds) Applied Parallel Computing Large Scale Scientific and Industrial Problems. PARA 1998. Lecture Notes in Computer Science, vol 1541. Springer, Berlin, Heidelberg . https://doi.org/10.1007/BFb0095324
Download citation
DOI: https://doi.org/10.1007/BFb0095324
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65414-8
Online ISBN: 978-3-540-49261-0
eBook Packages: Springer Book Archive