Abstract
The availability of multiprocessors and high performance networks offer the opportunity to build CLUMPs (Cluster of Multi- processors) and use them as parallel computing platforms. The main distinctive feature of the CLUMP architecture over the usual parallel computers is its hybrid memory model (message passing between the nodes and shared memory inside the nodes). To be largely used, the CLUMPs must be able to execute the existing programs with few mod- ifications. We investigate the performance of a programming approach based on the MPI for inter-multiprocessor communications and OpenMP standards for intra-multiprocessor exchanges. The approach consists in the intra-node parallelization of the MPI programs with an OpenMP directive based parallel compiler. The paper details the approach in the context of the biprocessor PC CLUMPs and presents a performance eval- uation for the NAS parallel benchmarks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
E. L. Lusk W. W. Gropp. A taxonomy of programming models for symmetric multiprocessors and smp clusters. In in Proceedings of Programming Models for Massively Parallel Computers, pages 2–7, 1995.
Hakono. Bugge and Per0. Husoy. Efficient sar processing on the scali system. Report IPPS97, Scali Computer AS, 1997.
M. Bernaschi. Efficient message passing on shared memory multiprocessors. Lecture Notes in Computer Science, 1156:221, 1996.
Steven S. Lumetta, Alan Mainwaring, and David E. Culler. Multi-protocol active messages on a cluster of SMPs. In ACM, editor, SC’97: High Performance Networking and Computing: Proceedings of the 1997 ACM/IEEE SC97 Conference:340 November 15-21, 1997, San Jose, California, USA. pages ??-??, New York, NY 10036, USA and 1109 Spring Street, Suite 300, Silver Spring, MD 20910, USA, 1997. ACM Press and IEEE Computer Society Press.
David A. Bader and Joseph J. J. SIMPLE: A methodology for programming high performance algorithms on clusters of symmetric multiprocessors (SMPs). Technical Report CS-TR-3798 and UMIACS-TR-97-48, Institute for Advanced Computer Studies, University of Maryland, College Park, MD, May 1997.
R. Samanta, A. Bilas, L. Iftode, and J. P. Singh. Home-based SVM protocols for SMP clusters: Design and performance. In Proc. of the 4th IEEE Symp. on High-Performance Computer Architecture (HPCA-4), February 1998
D. J. Scales, K. Gharachorloo, and A. Aggarwal. Fine-grain software distributed shared memory on SMP clusters. In Proc. of the 4ĥ IEEE Symp. on Highperformance Computer Architecture (HPCA-4), February 1998.
R. Stets, S. Dwarkadas, N. Hardavellas, G. Hunt, L. Kontothanassis, S. Parthasarathy, and Michael Scott. Cashmere-2L: Software coherent shared memory on a clustered remote-write network. In Proc. of the 16th ACM Symp. on Operating Systems Principles (SOSP-16), October 1997.
Andrew Eriichson, Neal Nuckolls, Greg Chesson, and John Hennessy. SoftFLASH: 340 Analyzing the performance of clustered distributed virtual shared memory. In Proceedings of the Seventh International Conference on Architectural Support for Programming Languages and Operating Systems, pages 210–220, Cambridge, Massachusetts, October1-5, 1996. ACM SIGARCH, SIGOPS, SIGPLAN, and the IEEE Computer Society.
Charlie Hu Honghui Lu and Willy Zwaenepoel. Openmp on networks of workstations. In Proc. of Super Computing 98, Orlando, 1998.
NAS Parallel Benchmark Home page. http://science.nas.nasa.gov/software/npb/. Technical report.
M. Ando K. Kazuto Y. Tanaka, M. Matsuda and M. Sato. Compas: A pentium pro pc-based smp cluster and its experience. In IPPS Workshop on Personal Computer Based Networks of Workstations, pages 486–497. LNCS, 1998
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cappello, F., Richard, O., Etiemble, D. (1999). Performance of the NAS Benchmarks on a Cluster of SMP PCs Using a Parallelization of the MPI Programs with OpenMP. In: Malyshkin, V. (eds) Parallel Computing Technologies. PaCT 1999. Lecture Notes in Computer Science, vol 1662. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48387-X_36
Download citation
DOI: https://doi.org/10.1007/3-540-48387-X_36
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66363-8
Online ISBN: 978-3-540-48387-8
eBook Packages: Springer Book Archive