Abstract
A framework for cluster management is proposed that enables a cluster to be more efficiently utilized within a research environment. It does so by removing cluster management to a management node, leaving the compute nodes as essentially bare machinery. Users may schedule access to one or more of the compute nodes via the management node. At the scheduled time, a previously-saved image of their research environment is loaded, and the session begun. At the end of the session the user may save a new image of the environment on the management node, to be reloaded at another time. Thus the user may work with a customized environment, which may even be a fledgling operating system, without fear of interference to other researchers. This enables the capital investment of a systems research cluster to be amortized over a greater number of researchers.
Chapter PDF
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Keller, A., Reinefeld, A., “CCS Resource Management in Networked HPC Systems”, Proc. Heterogeneous Computing Workshop HCW98, 1998
Portable Batch System Documentation, MRJ Ltd., 1998. http://pbs.mrj.com/docs/html
Henderson, R.L., “Job Scheduling under the Portable Batch System, In: Job Scheduling Strategies for Parallel Processing”, Feitelson, D.G. and Rudolph, L. (eds), LNCS, pp. 279–294, Vol. 949, Springer-Verlag, 1995.
Load Sharing Facility Suite 3.2 Documentation, Platform Computing Inc., 1998.
Prennis, A. jnr, “Loadleveller: workload management for parallel and distributed computing environments”, Proc. Supercomputing Europe (SUPEUR’96), October 1996.
Foster, G.T., Glover, J.P.N., Warwick, K., “Flexible Distributed Control of Manufacturing Systems Using Local Operating Networks”, Proc. LonUsers International Fall Conference, 1995
Magic Packet Technology White Paper, AMD Publication no.20213, Advanced Micro Devices Inc., November 1995
Wimer, W., “Clarifications and Extensions for the Bootstrap Protocol”, IETF Request For Comments Document no. 1542, October 1993.
Sollins, K., “The TFTP Protocol (Revision 2)”, IETF Request For Comments Document no. 1350, July 1992.
Rembo Technology, http://www.bpbatch.org
Free Software Foundation, “Grand Unified Bootloader” http://www.gnu.org/software/grub.en.html
Yap, K; Savoye, R; “Network Interface Loader” http://nilo.sourcefourge.net/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cunniffe, R., Coghlan, B.A. (2000). Encouraging the Unexpected: Cluster Management for OS and Systems Research. In: Bode, A., Ludwig, T., Karl, W., Wismüller, R. (eds) Euro-Par 2000 Parallel Processing. Euro-Par 2000. Lecture Notes in Computer Science, vol 1900. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44520-X_161
Download citation
DOI: https://doi.org/10.1007/3-540-44520-X_161
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67956-1
Online ISBN: 978-3-540-44520-3
eBook Packages: Springer Book Archive