Skip to main content

Using Triggered Operations to Offload Collective Communication Operations

  • Conference paper
Book cover Recent Advances in the Message Passing Interface (EuroMPI 2010)

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 6305))

Included in the following conference series:

Abstract

Efficient collective operations are a major component of application scalability. Offload of collective operations onto the network interface reduces many of the latencies that are inherent in network communications and, consequently, reduces the time to perform the collective operation. To support offload, it is desirable to expose semantic building blocks that are simple to offload and yet powerful enough to implement a variety of collective algorithms. This paper presents the implementation of barrier and broadcast leveraging triggered operations — a semantic building block for collective offload. Triggered operations are shown to be both semantically powerful and capable of improving performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Buntinas, D., Panda, D.K., Sadayappan, P.: Fast NIC-based barrier over Myrinet/GM. In: Proceedings of the International Parallel and Distributed Processing Symposium (April 2001)

    Google Scholar 

  2. Riesen, R.E., Pedretti, K.T., Brightwell, R., Barrett, B.W., Underwood, K.D., Hudson, T.B., Maccabe, A.B.: The Portals 4.0 message passing interface. Technical Report SAND2008-2639, Sandia National Laboratories (April 2008)

    Google Scholar 

  3. Underwood, K.D., Coffman, J., Larsen, R., Hemmert, K.S., Barrett, B.W., Brightwell, R., Levenhagen, M.: Enabling flexible collective communication offload with triggered operations. Submitted to Proceedings of the 2010 IEEE International Conference on Cluster Computing (September 2010)

    Google Scholar 

  4. Scott, S.L., Thorson, G.: Optimized routing in the Cray T3D. In: Bolding, K., Snyder, L. (eds.) PCRCW 1994. LNCS, vol. 853, pp. 281–294. Springer, Heidelberg (1994)

    Google Scholar 

  5. Yih Huang, P.K.M.: Efficient collective operations with ATM network interface support. In: Proceedings of the International Conference on Parallel Processing, August 1996, pp. 34–43 (1996)

    Google Scholar 

  6. Yu, W., Buntinas, D., Graham, R.L., Panda, D.K.: Efficient and scalable barrier over Quadrics and Myrinet with a new NIC-based collective message passing protocol. In: Proceedings of the Workshop on Communication Architecture for Clusters (April 2004)

    Google Scholar 

  7. Buntinas, D., Panda, D.K., Duato, J., Sadayappan, P.: Broadcast/multicast over Myrinet using NIC-assisted multidestination messages. In: Proceedings of the Fourth International Workshop on Communication, Architecture, and Applications for Network-Based Parallel Computing (January 2000)

    Google Scholar 

  8. Wagner, A., Jin, H.-W., Panda, D.K., Riesen, R.: NIC-based offload of dynamic user-defined modules for Myrinet clusters. In: Proceedings of the 2004 IEEE International Conference on Cluster Computing, September 2004, pp. 205–214 (2004)

    Google Scholar 

  9. Underwood, K.D., Levenhagen, M., Rodrigues, A.: Simulating Red Storm: Challenges and successes in building a system simulation. In: 21st International Parallel and Distributed Processing Symposium (IPDPS 2007) (March 2007)

    Google Scholar 

  10. Hoefler, T., Mehlan, T., Mietke, F., Rehm, W.: Fast barrier synchronization for InfiniBand. In: 20th International Parallel and Distributed Processing Symposium, IPDPS 2006. (April 2006)

    Google Scholar 

  11. Ferreira, K.B., Bridges, P., Brightwell, R.: Characterizing application sensitivity to OS interference using kernel-level noise injection. In: SC 2008: Proceedings of the, ACM/IEEE conference on Supercomputing, pp. 1–12. IEEE Press, Piscataway (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hemmert, K.S., Barrett, B., Underwood, K.D. (2010). Using Triggered Operations to Offload Collective Communication Operations. In: Keller, R., Gabriel, E., Resch, M., Dongarra, J. (eds) Recent Advances in the Message Passing Interface. EuroMPI 2010. Lecture Notes in Computer Science, vol 6305. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15646-5_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-15646-5_26

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-15645-8

  • Online ISBN: 978-3-642-15646-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics