Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 3666))

Abstract

In this paper we describe the implementation of the gather and allgather collectives on QsNetII. Results from a cluster of 980 4-CPU nodes show good latencies, bandwidths and scaling, with a 3920 process, 8-byte, gather completing in 88 microsecs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Petrini, F., Coll, S., Frachtenberg, E., Hoisie, A.: Hardware - and Software Based Collective Communication on the Quadrics Network. In: Proceedings of the 2001 IEEE International Symposium on Network Computing and Applications (NCA 2001), Cambridge, Mass, October 8-10 (2001)

    Google Scholar 

  2. Addison, D., Beecroft, J., Hewson, D., McLaren, M., Petrini, F.: Quadrics QsNetII: A network for Supercomputing Applications. In: Hot Chips 15, Stanford University, CA (August 2003)

    Google Scholar 

  3. Beecroft, J., Addison, D., Hewson, D., McLaren, M., Petrini, F., Roweth, D.: Quadrics QsNetII: Pushing the Limit of the Design of High-Performance Networks for Supercomputers. In: IEEE Micro (2005) (to appear)

    Google Scholar 

  4. Roweth, D., Pittman, A., Beecroft, J.: Optimised Collectives on QsNetII, http://www.quadrics.com/documentation

  5. Snir, M., Otto, S., Huss-Lederman, S., Walker, D., Dongarra, J.: MPI: The Complete Reference, 2nd edn., September 1998. The MPI Core, vol. 1. The MIT Press, Cambridge (1998)

    Google Scholar 

  6. Meuer, H.W., Strohmaier, E., Dongarra, J.J., Simon, H.D.: Top500 Supercomputer Sites (June 2003), Available from www.top500.org

  7. Cray Man Page Collection: Shared Memory Access (SHMEM) S?2383?23, available from the Cray website, http://www.cray.com/craydoc

  8. Elan Programming Manual, Available from http://www.quadrics.com/documentation

  9. Benson, G.D., Chu, C.-W., Huang, Q., Caglar, S.G.: A Comparison of MPICH Allgather Algorithms on Switched Networks, October 2003. LNCS, pp. 335–343. Springer GmbH, Heidelberg (2003)

    Google Scholar 

  10. Petrini, F., Kerbyson, D., Pakin, S.: The Case of the Missing Supercomputer Performance: Achieving Optimal Performance on the 8,192 Processors of ASCI Q. In: IEEE/ACM SC 2003, Phoenix, AZ (November 2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Roweth, D., Addison, D. (2005). Optimised Gather Collectives on QsNetII . In: Di Martino, B., KranzlmĂĽller, D., Dongarra, J. (eds) Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI 2005. Lecture Notes in Computer Science, vol 3666. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11557265_52

Download citation

  • DOI: https://doi.org/10.1007/11557265_52

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29009-4

  • Online ISBN: 978-3-540-31943-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics