The TH Express high performance interconnect networks

Pang, Zhengbin; Xie, Min; Zhang, Jun; Zheng, Yi; Wang, Guibin; Dong, Dezun; Suo, Guang

doi:10.1007/s11704-014-3500-9

The TH Express high performance interconnect networks

Research Article
Published: 06 June 2014

Volume 8, pages 357–366, (2014)
Cite this article

Frontiers of Computer Science Aims and scope Submit manuscript

Zhengbin Pang^1,2,
Min Xie²,
Jun Zhang²,
Yi Zheng²,
Guibin Wang²,
Dezun Dong^1,2 &
…
Guang Suo²

353 Accesses
37 Citations
3 Altmetric
Explore all metrics

Abstract

Interconnection network plays an important role in scalable high performance computer (HPC) systems. The TH Express-2 interconnect has been used in MilkyWay-2 system to provide high-bandwidth and low-latency interprocessor communications, and continuous efforts are devoted to the development of our proprietary interconnect. This paper describes the state-of-the-art of our proprietary interconnect, especially emphasizing on the design of network interface. Several key features are introduced, such as user-level communication, remote direct memory access, offload collective operation, and hardware reliable end-to-end communication, etc. The design of a low level message passing infrastructures and an upper message passing services are also proposed. The preliminary performance results demonstrate the efficiency of the TH interconnect interface.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

In-Network Monitoring Strategies for HPC Cloud

Human information processing in complex networks

Article 15 June 2020

INAM2: InfiniBand Network Analysis and Monitoring with MPI

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

References

Top500, http://www.top500.org, 2013
Liao K X, Xiao Q L, Yang Q C, Lu T Y. MilkyWay-2 supercomputer system and application. Submitted to Frontiers of Computer Science, 2013
Google Scholar
Pritchard H, Gorodetsky I, Buntinas D. A ugni-based mpich2 nemesis network module for the cray xe. In: Proceedings of the 18th European MPI Users’ Group Conference on Recent Advances in the Message Passing Interface. 2011, 110–119
Chapter Google Scholar
Xie M, Lu Y, Liu L, Cao H, Yang X. Implementation and evaluation of network interface and message passing services for Tianhe-1a supercomputer. In: Proceedings of the 19th IEEE Annual Symposium on High Performance Interconnects. 2011, 78–86
Google Scholar
Chun B N, Mainwaring A, Culler D E. Virtual network transport protocols for myrinet. IEEE Micro, 1998, 18(1): 53–63
Article Google Scholar
Araki S, Bilas A, Dubnicki C, Edler J, Konishi K, Philbin J. User-space communication: a quantitative study. In: Proceedings of the 1998 ACM/IEEE Conference on Supercomputing (CDROM). 1998, 1–16
Google Scholar
Bhoedjang R A, Ruhl T, Bal H E. User-level network interface protocols. Computer, 1998, 31(11): 53–60
Article Google Scholar
Schoinas I, Hill M D. Address translation mechanisms in network interfaces. In: Proceedings of the 4th International Symposium on High-Performance Computer Architecture. 1998, 219–230
Google Scholar
InfiniBand Architecture Specification: Release 1.0. InfiniBand Trade Association, 2000
Google Scholar
Graham R L, Poole S, Shamis P, Bloch G, Bloch N, Chapman H, Kagan M, Shahar A, Rabinovitz I, Shainer G. Overlapping computation and communication: Barrier algorithms and connectx-2 core-direct capabilities. In: Proceedings of the 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum. 2010, 1–8
Google Scholar
Kandalla K, Subramoni H, Vienne J, Raikar S P, Tomko K, Sur S, Panda D K. Designing non-blocking broadcast with collective offload on infiniband clusters: A case study with hpl. In: Proceedings of the 19th IEEE Annual Symposium on High Performance Interconnects. 2011, 27–34
Google Scholar
MPICH2: High-performance and Widely Portable MPI. http://www.mcs.anl.gov/research/projects/mpich2/
Buntinas D, Goglin B, Goodell D, Mercier G, Moreaud S. Cacheefficient, intranode, large-message mpi communication with mpich2-nemesis. In: Proceedings of the 2009 International Conference on Parallel Processing. 2009, 462–469
Chapter Google Scholar
Lauria M, Pakin S, Chien A. Efficient layering for high speed communication: Fast messages 2. x. In: Proceedings of the 7th International Symposium on High Performance Distributed Computing. 1998, 10–20
Google Scholar
Liu J, Panda D K. Implementing efficient and scalable flow control schemes in MPI over infiniband. In: Proceedings of the 2004 International Parallel and Distributed Processing Symposium. 2004, 183b
Google Scholar
Tezuka H, O’Carroll F, Hori A, Ishikawa Y. Pin-down cache: a virtual memory management technique for zero-copy communication. In: Proceedings of the 1998 Symposium on Parallel and Distributed Processing. 1998, 308–314
Google Scholar
MVAPICH: MPI over InfiniBand, 10GigE/iWARP and RoCE, 2013
Vetter J S, Mueller F. Communication characteristics of large-scale scientific applications for contemporary cluster architectures. Journal of Parallel and Distributed Computing, 2003, 63(9): 853–865
Article MATH Google Scholar
Chiu G. The IBM blue gene project. IBM Journal of Research and Development, 2013, 57(1): 1–6
Google Scholar
Chen D, Eisley N A, Heidelberger P, Senger R M, Sugawara Y, Kumar S, Salapura V, Satterfield D L, Steinmacher-Burow B, Parker J J. The IBM blue gene/q interconnection fabric. IEEE Micro, 2012, 32(1): 32–43
Article MATH Google Scholar
Ajima Y, Takagi Y, Inoue T, Hiramoto S, Shimizu T. The tofu interconnect. In: Proceedings of the 19th IEEE Annual Symposium on High Performance Interconnects. 2011, 87–94
Google Scholar
Alverson R, Roweth D, Kaplan L. The gemini system interconnect. In: Proceedings of the 18th IEEE Annual Symposium on High Performance Interconnects. 2010, 83–87
Google Scholar
Schroeder B, Gibson G A. Understanding failures in petascale computers. In: Journal of Physics: Conference Series. 2007, Article 012022
Google Scholar
Graham R L, Poole S, Shamis P, Bloch G, Bloch N, Chapman H, Kagan M, Shahar A, Rabinovitz I, Shainer G. Connectx-2 infiniband management queues: first investigation of the new support for network offloaded collective operations. In: Proceedings of the 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing. 2010, 53–62
Google Scholar
Subramoni H, Kandalla K, Sur S, Panda D K. Design and evaluation of generalized collective communication primitives with overlap using connectx-2 offload engine. In: Proceedings of the 18th IEEE Annual Symposium on High Performance Interconnects. 2010, 40–49
Google Scholar
Arimilli B, Arimilli R, Chung V, Clark S, Denzel W, Drerup B, Hoefler T, Joyner J, Lewis J, Li J. The percs high-performance interconnect. In: Proceedings of the 18th IEEE Annual Symposium on High Performance Interconnects. 2010, 75–82
Google Scholar

Download references

Author information

Authors and Affiliations

Science and Technology on Parallel and Distributed Processing Laboratory, National University of Defense Technology, Changsha, 410073, China
Zhengbin Pang & Dezun Dong
College of Computer, National University of Defense Technology, Changsha, 410073, China
Zhengbin Pang, Min Xie, Jun Zhang, Yi Zheng, Guibin Wang, Dezun Dong & Guang Suo

Authors

Zhengbin Pang
View author publications
Search author on:PubMed Google Scholar
Min Xie
View author publications
Search author on:PubMed Google Scholar
Jun Zhang
View author publications
Search author on:PubMed Google Scholar
Yi Zheng
View author publications
Search author on:PubMed Google Scholar
Guibin Wang
View author publications
Search author on:PubMed Google Scholar
Dezun Dong
View author publications
Search author on:PubMed Google Scholar
Guang Suo
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Guibin Wang.

Additional information

Zhengbin Pang received the BS, MS, and PhD degrees in computer science from National University of Defense Technology (NUDT), China. He is a professor in College of Computer, NUDT. His research interests include parallel and distributed computing, and high performance computer systems.

Min Xie is a professor in College of Computer at National University of Defense Technology (NUDT), China. His research interests include high-speed interconnects, system software and parallel and distributed computing. He has a PhD in computer science from NUDT.

Jun Zhang received the MS degree in computer science from National University of Defense Technology (NUDT), China. Currently he is an assistant professor at the university. His research interests include high speed communication and ASIC design.

Yi Zheng received the PhD degrees in computer science from National University of Defense Technology (NUDT), China. Currently he is an associate professor at the university. His research interests including high performance computer architecture and high performance networks.

Guibin Wang received the BS, MS, and PhD degrees from National University of Defense Technology (NUDT), China in 2004, 2007, and 2011, respectively. Currently, he is an assistant professor in College of Computer, NUDT. His research interests include high-performance computer systems, heterogeneous parallel systems.

Dezun Dong received the BS, MS, and PhD degrees from the National University of Defense Technology (NUDT), China in 2002, 2004, and 2010, respectively. Currently, he is an associate professor in College of Computer, NUDT, China. His research interests include high-performance computer systems, distributed computing, and wireless networks. He is a member of ACM and IEEE.

Guang Suo received his BS in computer science from National University of Defense Technology (NUDT), China in 2003, and received his MS and PhD in computer science from NUDT in 2005 and 2009, respectively. He is an assistant professor in Institute of Computers, NUDT. He has played an important role in the implementation and optimization of MPI library of MilkWay supercomputers. His research interests are in parallel copmuting, operating system, and HPC runtime systems.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pang, Z., Xie, M., Zhang, J. et al. The TH Express high performance interconnect networks. Front. Comput. Sci. 8, 357–366 (2014). https://doi.org/10.1007/s11704-014-3500-9

Download citation

Received: 16 December 2013
Accepted: 06 March 2014
Published: 06 June 2014
Issue Date: June 2014
DOI: https://doi.org/10.1007/s11704-014-3500-9

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The TH Express high performance interconnect networks

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

In-Network Monitoring Strategies for HPC Cloud

Human information processing in complex networks

INAM2: InfiniBand Network Analysis and Monitoring with MPI

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now