ABSTRACT
We present the architecture of the Deep Computing Messaging Framework (DCMF), a message passing runtime designed for the Blue Gene/P machine and other HPC architectures. DCMF has been designed to easily support several programming paradigms such as the Message Passing Interface (MPI), Aggregate Remote Memory Copy Interface (ARMCI), Charm++, and others. This support is made possible as DCMF provides an application programming interface (API) with active messages and non-blocking collectives. DCMF is being open sourced and has a layered component based architecture with multiple levels of abstraction, allowing the members of the community to contribute new components to its design at the various layers. The DCMF runtime can be extended to other architectures through the development of architecture specific implementations of interface classes. The production DCMF runtime on Blue Gene/P takes advantage of the direct memory access (DMA) hardware to offload message passing work and achieve good overlap of computation and communication. We take advantage of the fact that the Blue Gene/P node is a symmetric multi-processor with four cache-coherent cores and use multi-threading to optimize the performance on the collective network. We also present a performance evaluation of the DCMF runtime on Blue Gene/P and show that it delivers performance close to hardware limits.
- Open Fabrics Alliance. http://www.openfabrics.orgGoogle Scholar
- N. R. Adiga et al. Blue Gene/L torus interconnection network. IBM J. Res. Dev., 49:265--276, (2005) Google ScholarDigital Library
- G. Almasi et al. Design and implementation of message-passing services for the Blue Gene/L supercomputer. IBM J. Res. Dev., 49:393--406, (2005) Google ScholarDigital Library
- M. Banikazemi, R. Govindaraju, R. Blackmore, and D. K. Panda. MPI-LAPI: An efficient implementation of MPI for IBM RS/6000 SP systems. IEEE Transactions on Parallel and Distributed Systems, 12(10):1081--1093, 2001 Google ScholarDigital Library
- C. Barton, C. Cascaval, S. Chatterjee, G. Almasi, Y. Zheng, M. Farreras, and J. Amaral. Shared memory programming for large scale machines. In Proceedings of ACM SIGPLAN Conference on Programming Language Design and Implementation, June 2006 Google ScholarDigital Library
- DCMF. http://dcmf.anl-external.org/wiki, 2008Google Scholar
- J. Dongarra, E. Strohmaier, H. Simon, and H. Meuer. www.top500.org, 2007. Date retrieved: 10 Jan 2008Google Scholar
- M. P. I. Forum. MPI-2: Extensions to the message-passing interface, 1997. http://www.mpiforum.org/docs/mpi-20-html/mpi2-report.htmlGoogle Scholar
- W. Gropp, E. Lusk, N. Doss, and A. Skjellum. Mpich: A high-performance, portable implementation of the mpi message passing interface standard. Parallel Computing, 22(6):789--828, September 1996 Google ScholarDigital Library
- IBM Blue Gene Team. Overview of the Blue Gene/P project. IBM J. Res. Dev., 52(1/2), January (2008). http://www.research.ibm.com/journal/rd/521/team.html Google ScholarDigital Library
- L. V. Kale, M. Bhandarkar, N. Jagathesan, e S. Krishnan, and J. Yelon. Converse: An Interoperable Framework for Parallel Programming. In Proceedings of the 10th International Parallel Processing Symposium, pages 212--217, Honolulu, Hawaii, April 1996 Google ScholarDigital Library
- L. V. Kale and S. Krishnan. Charm++: Parallel Programming with Message-Driven Objects. In G. V. Wilson and P. Lu, editors, Parallel Programming using C++, pages 175--213. MIT Press, 1996Google Scholar
- S. Kumar, C. Huang, G. Almasi, and L. V. Kale Achieving strong scaling with NAMD on Blue Gene/L. In Proceedings of IEEE International Parallel and Distributed Processing Symposium 2006, April 2006 Google ScholarDigital Library
- Myrinet Inc. Myrinet Express (MX), A High Performance Low Level Message Passing Interface for Myrinet, January 2006Google Scholar
- J. Nieplocha and B. Carpenter. ARMCI: A portable remote memory copy library for distributed array libraries and compiler run-time systems. Lecture Notes in Computer Science, 1586, 1999. Google ScholarDigital Library
- F. Petrini, W. chun Feng, A. Hoisie, S. Coll, and E. Frachtenberg. The quadrics network: high-performance clustering technology. IEEE Micro, 22(1):46--57, 2002. Google ScholarDigital Library
- W. Gropp and E. Lusk. MPICH ADI Implementation Reference Manual, August 1995.Google Scholar
Index Terms
- The deep computing messaging framework: generalized scalable message passing on the blue gene/P supercomputer
Recommendations
PAMI: A Parallel Active Message Interface for the Blue Gene/Q Supercomputer
IPDPS '12: Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing SymposiumThe Blue Gene/Q machine is the next generation in the line of IBM massively parallel supercomputers, designed to scale to 262144 nodes and sixteen million threads. With each BG/Q node having 68 hardware threads, hybrid programming paradigms, which use ...
Architecture of the Component Collective Messaging Interface
Different programming paradigms utilize a variety of collective communication operations, often with different semantics. We present the component collective messaging interface (CCMI) that can support asynchronous non-blocking collectives and is ...
Implementation and evaluation of shared-memory communication and synchronization operations in MPICH2 using the Nemesis communication subsystem
This paper presents the implementation of MPICH2 over the Nemesis communication subsystem and the evaluation of its shared-memory performance. We describe design issues as well as some of the optimization techniques we employed. We conducted a ...
Comments