Skip to main content
Log in

A Software Suite for High-Performance Communications on Clusters of SMPs

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

A cluster, by opposition to a parallel computer, is a set of separate workstations interconnected by a high-speed network. The performances one can get on a cluster heavily depend on the performances of the lowest communication layers. In this paper, we address the special case where the cluster contains multi-processor machines. These shared-memory multi-processors desktop machines (SMPs) with 2 or 4 processors are now becoming very popular and present a high performance/price ratio. We present a software suite for achieving high-performance communications on a Myrinet-based cluster: BIP, BIP-SMP and MPI-BIP. The software suite supports single-processor (Intel PC and Digital Alpha) and multi-processor machines, as well as any combination of the two architectures.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. N.J. Boden, D. Cohen, R.E. Felderman, A.E. Kulawik, C.L. Seitz, J.N. Seizovic and W.-K. Su, Myrinet - a gigabit-per-second localarea network, IEEE Micro 15(1) (February 1995) pp. 29–36. Myricom, http://www.myri.com.

    Google Scholar 

  2. T.E. Anderson, D.E. Culler, D.A. Patterson and the NOW team, A case for NOW (Networks of Workstations), IEEE Micro 15(1) (February 1995) 54–64.

    Google Scholar 

  3. W. Gropp, E. Lusk, N. Doss and A. Skjellum, A high-performance, portable implementation of the MPI message passing interface standard, Parallel Computing 22(6) (September 1996) 789–828.

    Google Scholar 

  4. L. Prylli and B. Tourancheau, BIP: A new protocol designed for high performance networking on myrinet, in: Workshop PC-NOW, IPPS/SPDP98, Orlando, USA (1998).

  5. L. Prylli, B. Tourancheau and R. Westrelin, An improved NIC program for high-performance MPI, in: International Conference on Supercomputing (ICS'99), Workshop on Cluster-based Computing, eds. N.P. Carter and S.S. Lumetta.

  6. L. Prylli, B. Tourancheau and R. Westrelin, The design for a high performance mpi implementation on the myrinet network, in: Euro-PVM/MPI'99 (1999).

  7. Myricom, The GM api, December 1998. http://www.myri.com/GM/ doc/gm_toc.html.

  8. P. Geoffray, L. Prylli and B. Tourancheau, BIP-SMP: High performance message passing over a cluster of commodity SMPs, in: Supercomputing' 99 (SC99).

  9. P. Husbands and J.C. Hoe, Mpi-start: Delivering network performance to numerical applications, in: SuperComputing (SC'98), Orlando, Florida, November 1998.

  10. S.S. Lumetta, A.M. Mainwaring and D.E. Culler, Multi-protocol active messages on a cluster of smp's, in: SuperComputing (SC'97), University of California at Berkeley, August 1997.

  11. T. von Eicken, Active messages: An efficient communication architecture for multiprocessors, Ph.D. thesis, University of California at Berkeley, November 1993.

  12. A. Singhal, D. Broniarczyk, F. Cerauskis, J. Price, L. Yuan, C. Cheng, D. Doblar, S. Fosth, N. Agarwal, K. Harvey, E. Hagersten and B. Liencres, Gigaplane: A high performance bus for large smps, in: Hot Interconnects IV, Stanford, California, August 1996, pp. 41-52.

  13. R.M. Butler and E.L. Lusk, Monitors, messages, and clusters: The p4 parallel programming system, Technical report, University of North Florida and Argonne National Laboratory, 1993. http://wwwfp. mcs.anl.gov/ lusk/p4/p4-paper/paper.html.

  14. S.S. Lumetta and D.E. Culler, Managing concurrent access for shared memory active messages, in: International Parallel Processing Symposium, Orlando, Florida, April 1998.

  15. X. Leroy, The linuxthreads library, June 1999. http://pauillac.inria. fr/ xleroy/linuxthreads/.

  16. NASA, NAS Parallel Benchmark 2.3. http://science.nas.nasa.gov/ Software/NPB/.

  17. J.J. Dongarra, Performance of various computers using standard linear equations software, Technical report CS-89-85, University of Tennessee Computer Science, 1999.

  18. R.C. Whaley and J.J. Dongarra, Automatically tuned linear algebra software, in: SuperComputing (SC'98) (1998).

  19. H. Brunst, CongDuc Pham and S. Fdida, Conservative simulation of load-balanced routing in a large ATM network model, in: 12th Parallel and Distributed Simulation Workshop (PADS'98), Banff, Canada, May 1998.

  20. K. Chandy and J. Misra, Distributed simulation: A case study in design and verification of distributed programs, IEEE Trans. Software Engrg. 5(5) (1979).

  21. J. Dongarra, Linpack benchmark-parallel, April 1999. http://performance. netlib.org/performance/html/linpack-parallel.data.col0.html.

  22. L. Prylli, Bip user reference manual, Technical report TR97-02, LIP/ENS-LYON, Septembre 1997.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Geoffray, P., Pham, C. & Tourancheau, B. A Software Suite for High-Performance Communications on Clusters of SMPs. Cluster Computing 5, 353–363 (2002). https://doi.org/10.1023/A:1019756120212

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1019756120212

Navigation