Abstract
PC clusters are still more popular platform for high performance computing. But there is still lack of freely available tools for resource monitoring and management usable for efficient workload distribution. In this paper, a monitoring system for PC clusters called Cluster Information Service (CIS) is described. Its purpose is to provide clients (resource management system or application scheduler) with information about availability of resources in PC cluster. This information can help the scheduler to improve performance of parallel application. CIS is designed to have as low intrusiveness as possible while keeping a high detail of monitoring data.
The possibility of improving the performance of PVM/MPI applications is also discussed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
K. Baumgartner and B. W. Wah, Computer Scheduling Algorithms: Past, Present, and Future, Information Sciences, vol. 57 & 58, pp. 319–345, Elsevier Science Pub. Co., Inc., New York,NY, Sept.-Dec. 1991.
R. Buyya, K. Mohan, B. Gopal, PARMON: A Comprehensive Cluster Monitoring System, Proceedings of the Australian Users Group for UNIX and Open Systems Conference and Exhibition, AUUG98-Open Systems: The Common Thread, Sydney, Australia, 1998.
I. Foster, C. Kesselman, Globus: A metacomputing infrastructure toolkit, Internatioal Journal of Supercomputer Applications and High Performance Computing 11(2) (1997) 115–128.
L. Hluchy, M. Dobrucky, J. Astalos: Hybrid Approach to Task Allocation in Distributed Systems. Computer and Artificial Intelligence, Vol. 17, No. 5, 1998, pp. 469–480, ISSN 0232-0274.
P. Kacsuk, G. Dozsa, and T. Fadgyas: Designing parallel programs by the graphical language GRAPNEL, Microprocessing and Microprogramming 41 (1996) 625–643.
Z. Liang, Y. Sun, and C.-L. Wang, ClusterProbe: An Open, Flexible and Scalable Cluster Monitoring Tool, 1st IEEE Computer Society International Workshop on Cluster Computing, Melbourne, Australia, December 1999.
M. Mansouri-Samani, M. Sloman, Monitoring Distributed Systems (A Survey) Imperial College Research Report No. DOC92/23, Imperial College of Science Technology and Medicine, London, 1992.
M. Nuttall and M. Sloman, Workload characteristics for Process Migration and Load Balancing, Proc. of the 17th Int. Conf. on Distributed Computing Systems, pp. 133–140, May 1997.
C. Roder, T. Ludwig, and A. Bode, NSR-A Tool for Load Measurement in Heterogeneous Environments. In A. Bode, A. Ganz, C. Gold, S. Petri, N. Reimer, B. Schiemann, and T. Schneckenburger, editors, Anwendungsbezogene Lastverteilung-ALV’98, number TUM-I9806, SFB-Bericht Nr. 342/01/98 A, pages 133–144. Technische Universitat Munchen, February 1998.
P. Uthayopas, S. Phaisithbenchapol, K. Chongbarirux, Building a Resources Monitoring System for SMILE Beowulf Cluster, Proceeding of the High Performance Computing Conference ASIA, Singapore, September 1998.
R. Wolski and N. Spring and C. Peterson, Implementing a Performance Forecasting System for Metacomputing: The Network Weather Service, Proceedings of the 1997 ACM/IEEE SC97 Conference in San Jose California, November, 1997.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Astaloš, J., Hluchý, L. (2000). CIS - A Monitoring System for PC Clusters. In: Dongarra, J., Kacsuk, P., Podhorszki, N. (eds) Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI 2000. Lecture Notes in Computer Science, vol 1908. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45255-9_32
Download citation
DOI: https://doi.org/10.1007/3-540-45255-9_32
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41010-2
Online ISBN: 978-3-540-45255-3
eBook Packages: Springer Book Archive