Abstract
Nondedicated clusters are currently at the forefront of the development of high performance computing systems. These clusters are relatively intolerant of hardware failures and cannot manage dynamic cluster membership efficiently. This report presents the logical design of an innovative self discovery service that provides for automated cluster management and resource discovery. The proposed service has an ability to share or recover unused computing resources, and to adapt to transient conditions autonomically, as well as the capability of providing dynamically scalable virtual computers on demand.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Goscinski, A., Zhou, W.: Client Server Systems. In: Webster, J.G. (ed.) Wiley Encyclopedia of Electrical and Electronics Engineering, vol. 3, pp. 431–451. John Wiley & Sons, Chichester (1999)
Goscinski, A.: Finding Expressing and Managing Parallelism in programs executing on clusters of workstations. Computer Communications 22, 998–1016 (1999)
Goscinski, A.: Towards and operating system managing parallelism of computers on clusters. Future Generation Computer Systems 17, 293–314 (2000)
Goscinski, A., Fikkers, P., Zhou, B.: A Global Scheduling Facility for Clusters Executing Communication Bound Parallel Applications. School of Computing and Mathematics. Deakin University (2002)
Geist, A., Beguelin, A., Dongarra, J., Jiang, W., Manchek, R., Sunderam, V.: PVM: A Users’ Guide and Tutorial for Networked Parallel Computing. MIT Press, Cambridge (1994)
Merkey, P.: Beowulf History (2003), http://www.beowulf.org/beowulf/history.html
Sterling, T., Savarese, D.: A Coming of Age for Beowulf-class Computing. Center for Advanced Computing Research. California Institute of Technology (June 1999)
Zaki, M., Parthasathy, S.: Customised dynamic load balancing for a network of work stations. Technical Report, The University of Rochester, New York (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dines, E., Goscinski, A. (2005). Toward Self Discovery for an Autonomic Cluster. In: Hobbs, M., Goscinski, A.M., Zhou, W. (eds) Distributed and Parallel Computing. ICA3PP 2005. Lecture Notes in Computer Science, vol 3719. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11564621_14
Download citation
DOI: https://doi.org/10.1007/11564621_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29235-7
Online ISBN: 978-3-540-32071-5
eBook Packages: Computer ScienceComputer Science (R0)