Design of Kernel-Level Asynchronous Collective Communication

Nomura, Akihiro; Ishikawa, Yutaka

doi:10.1007/978-3-642-15646-5_10

Akihiro Nomura²⁰ &
Yutaka Ishikawa²⁰

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 6305))

Included in the following conference series:

European MPI Users' Group Meeting

1058 Accesses
6 Citations

Abstract

Overlapping computation and communication, not only point-to-point but also collective communications, is an important technique to improve the performance of parallel programs. Since the current non-blocking collective communications have been mostly implemented using an extra thread to progress communication, they have extra overhead due to thread scheduling and context switching. In this paper, a new non- blocking communication facility, called KACC is proposed to provide fast asynchronous collective communications. KACC is implemented in the OS kernel interrupt context to perform non-blocking asynchronous collective operations without an extra thread. The experimental results show that the CPU time cost of this method is sufficiently small.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Dynamic Placement of Progress Thread for Overlapping MPI Non-blocking Collectives on Manycore Processor

Maximizing Communication–Computation Overlap Through Automatic Parallelization and Run-time Tuning of Non-blocking Collective Operations

Article 26 November 2016

Progress Thread Placement for Overlapping MPI Non-blocking Collectives Using Simultaneous Multi-threading

References

Argonne National Laboratory: MPICH2: High-performance and widely portable MPI, http://www.mcs.anl.gov/research/projects/mpich2/
Hoefler, T., Lumsdaine, A.: Design, Implementation, and Usage of LibNBC. Tech. rep., Open Systems Lab, Indiana University (August 2006)
Google Scholar
Hoefler, T., Lumsdaine, A., Rehm, W.: Implementation and Performance Analysis of Non-Blocking Collective Operations for MPI. In: Proceedings of the 2007 International Conference on High Performance Computing, Networking, Storage and Analysis, SC 2007. IEEE Computer Society/ACM (November 2007)
Google Scholar
Kashyap, V.: IP over InfiniBand (IPoIB) Architecture. RFC 4392 (Informational) (April 2006), http://www.ietf.org/rfc/rfc4392.txt
MPI Forum: Message passing interface , http://www.mpi-forum.org/
MPI Forum: MPIplans - an alternative for all other collectives proposals? https://svn.mpi-forum.org/trac/mpi-forum-web/wiki/MPIplans
Myricom, Inc.: IP over myrinet, http://www.myri.com/scs/documentation/mug/ip/
Petitet, A., Whaley, R.C., Dongarra, J., Cleary, A.: HPL - a portable implementation of the high-performance linpack benchmark for distributed-memory computers, http://www.netlib.org/benchmark/hpl/

Download references

Author information

Authors and Affiliations

Dept. of Computer Science, Graduate School of Information Science and Technology, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, Japan
Akihiro Nomura & Yutaka Ishikawa

Authors

Akihiro Nomura
View author publications
You can also search for this author in PubMed Google Scholar
Yutaka Ishikawa
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

High Performance Computing Center Stuttgart (HLRS), Universität Stuttgart, Nobelstr. 19, 70569, Stuttgart, Germany
Rainer Keller
Parallel Software Technologies Laboratory, Department of Computer Science, University of Houston,
Edgar Gabriel
High Performance Computing Center Stuttgart, University of Stuttgart, Nobelstr. 19, 70569, Stuttgart, Germany
Michael Resch
Department of Electrical Engineering and Computer Science, University of Tennessee, 37996-3450, Knoxville, TN, USA
Jack Dongarra

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nomura, A., Ishikawa, Y. (2010). Design of Kernel-Level Asynchronous Collective Communication. In: Keller, R., Gabriel, E., Resch, M., Dongarra, J. (eds) Recent Advances in the Message Passing Interface. EuroMPI 2010. Lecture Notes in Computer Science, vol 6305. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15646-5_10

Download citation

DOI: https://doi.org/10.1007/978-3-642-15646-5_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15645-8
Online ISBN: 978-3-642-15646-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics