Skip to main content

Performance of the NAS Benchmarks on a Cluster of SMP PCs Using a Parallelization of the MPI Programs with OpenMP

  • Conference paper
  • First Online:
Parallel Computing Technologies (PaCT 1999)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1662))

Included in the following conference series:

Abstract

The availability of multiprocessors and high performance networks offer the opportunity to build CLUMPs (Cluster of Multi- processors) and use them as parallel computing platforms. The main distinctive feature of the CLUMP architecture over the usual parallel computers is its hybrid memory model (message passing between the nodes and shared memory inside the nodes). To be largely used, the CLUMPs must be able to execute the existing programs with few mod- ifications. We investigate the performance of a programming approach based on the MPI for inter-multiprocessor communications and OpenMP standards for intra-multiprocessor exchanges. The approach consists in the intra-node parallelization of the MPI programs with an OpenMP directive based parallel compiler. The paper details the approach in the context of the biprocessor PC CLUMPs and presents a performance eval- uation for the NAS parallel benchmarks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. E. L. Lusk W. W. Gropp. A taxonomy of programming models for symmetric multiprocessors and smp clusters. In in Proceedings of Programming Models for Massively Parallel Computers, pages 2–7, 1995.

    Google Scholar 

  2. Hakono. Bugge and Per0. Husoy. Efficient sar processing on the scali system. Report IPPS97, Scali Computer AS, 1997.

    Google Scholar 

  3. M. Bernaschi. Efficient message passing on shared memory multiprocessors. Lecture Notes in Computer Science, 1156:221, 1996.

    Google Scholar 

  4. Steven S. Lumetta, Alan Mainwaring, and David E. Culler. Multi-protocol active messages on a cluster of SMPs. In ACM, editor, SC’97: High Performance Networking and Computing: Proceedings of the 1997 ACM/IEEE SC97 Conference:340 November 15-21, 1997, San Jose, California, USA. pages ??-??, New York, NY 10036, USA and 1109 Spring Street, Suite 300, Silver Spring, MD 20910, USA, 1997. ACM Press and IEEE Computer Society Press.

    Google Scholar 

  5. David A. Bader and Joseph J. J. SIMPLE: A methodology for programming high performance algorithms on clusters of symmetric multiprocessors (SMPs). Technical Report CS-TR-3798 and UMIACS-TR-97-48, Institute for Advanced Computer Studies, University of Maryland, College Park, MD, May 1997.

    Google Scholar 

  6. R. Samanta, A. Bilas, L. Iftode, and J. P. Singh. Home-based SVM protocols for SMP clusters: Design and performance. In Proc. of the 4th IEEE Symp. on High-Performance Computer Architecture (HPCA-4), February 1998

    Google Scholar 

  7. D. J. Scales, K. Gharachorloo, and A. Aggarwal. Fine-grain software distributed shared memory on SMP clusters. In Proc. of the 4ĥ IEEE Symp. on Highperformance Computer Architecture (HPCA-4), February 1998.

    Google Scholar 

  8. R. Stets, S. Dwarkadas, N. Hardavellas, G. Hunt, L. Kontothanassis, S. Parthasarathy, and Michael Scott. Cashmere-2L: Software coherent shared memory on a clustered remote-write network. In Proc. of the 16th ACM Symp. on Operating Systems Principles (SOSP-16), October 1997.

    Google Scholar 

  9. Andrew Eriichson, Neal Nuckolls, Greg Chesson, and John Hennessy. SoftFLASH: 340 Analyzing the performance of clustered distributed virtual shared memory. In Proceedings of the Seventh International Conference on Architectural Support for Programming Languages and Operating Systems, pages 210–220, Cambridge, Massachusetts, October1-5, 1996. ACM SIGARCH, SIGOPS, SIGPLAN, and the IEEE Computer Society.

    Google Scholar 

  10. Charlie Hu Honghui Lu and Willy Zwaenepoel. Openmp on networks of workstations. In Proc. of Super Computing 98, Orlando, 1998.

    Google Scholar 

  11. NAS Parallel Benchmark Home page. http://science.nas.nasa.gov/software/npb/. Technical report.

  12. M. Ando K. Kazuto Y. Tanaka, M. Matsuda and M. Sato. Compas: A pentium pro pc-based smp cluster and its experience. In IPPS Workshop on Personal Computer Based Networks of Workstations, pages 486–497. LNCS, 1998

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Cappello, F., Richard, O., Etiemble, D. (1999). Performance of the NAS Benchmarks on a Cluster of SMP PCs Using a Parallelization of the MPI Programs with OpenMP. In: Malyshkin, V. (eds) Parallel Computing Technologies. PaCT 1999. Lecture Notes in Computer Science, vol 1662. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48387-X_36

Download citation

  • DOI: https://doi.org/10.1007/3-540-48387-X_36

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-66363-8

  • Online ISBN: 978-3-540-48387-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics