skip to main content
research-article

WCTT bounds for MPI primitives in the PaterNoster NoC

Published: 03 November 2016 Publication History

Abstract

This paper applies several variants of application independent time-division multiplexing to MPI primitives and investigates their applicability for different scopes of communication. Thereby, the scopes are characterized by the size of the network-on-chip, the number of participating nodes and the message size sent to each receiver or received from each sender, respectively. The evaluation shows that none of the observed variants feature the lowest worst-case traversal time in all situations. Instead there are multiple schedule variants which each perform best in a different scope of communication parameters.

References

[1]
M. Frieb, A. Stegmeier, J. Mische, and T. Ungerer. Employing MPI Collectives for Timing Analysis on Embedded Multi-Cores. In 16th International Workshop on Worst-Case Execution Time Analysis (WCET), 2016.
[2]
K. Goossens, A. Azevedo, K. Chandrasekar, et al. Virtual execution platforms for mixed-time-criticality systems: The CompSOC architecture and design flow. ACM SIGBED Review, 10(3):23--34, 2013.
[3]
K. Goossens and A. Hansson. The Aethereal Network on Chip After Ten Years: Goals, Evolution, Lessons, and Future. In 47th Design Automation Conference, 2010.
[4]
T. Hoefler, T. Mehlan, et al. A survey of barrier algorithms for coarse grained supercomputers. 2004.
[5]
Intel Labs. SCC external architecture specification (EAS). Technical report, Intel Corparation, 2010.
[6]
A. Kanevsky, A. Skjellum, and A. Rounbehler. MPI/RT-an emerging standard for high-performance real-time systems. In 31th Hawaii International Conference on System Sciences, pages 157--166. IEEE, 1998.
[7]
P. K. McKinley, Y. jia Tsai, and D. F. Robinson. Collective communication in wormhole-routed massively parallel computers. Computer, 28(12):39--50, Dec 1995.
[8]
J. Mische and T. Ungerer. Guaranteed Service Independent of the Task Placement in NoCs with Torus Topology. In 22Nd International Conference on Real-Time Networks and Systems, RTNS '14, pages 151--160. ACM, 2014.
[9]
MPI-forum. MPI: A Message-Passing Interface Standard Version 3.0, 2012. available at http://www.mpi-forum.org/docs/mpi-3.1/mpi31-report.pdf.
[10]
S. Oral and A. D. George. Multicast performance modeling and evaluation for high-speed unidirectional torus networks. Microprocessors and Microsystems, 28(9):477--489, 2004.
[11]
W. Puffitsch, E. Noulard, and C. Pagetti. Mapping a multi-rate synchronous language to a many-core processor. In 19th Real-Time and Embedded Technology and Applications Symposium (RTAS), pages 293--302, 2013.
[12]
D. F. Robinson, P. K. McKinley, and B. H. Cheng. Optimal multicast communication in wormhole-routed torus networks. IEEE Transactions on Parallel and Distributed Systems, 6(10):1029--1042, 1995.
[13]
D. F. Robinson, P. K. McKinley, and B. H. Cheng. Path-based multicast communication in wormhole-routed unidirectional torus networks. Journal of Parallel and Distributed Computing, 45(2):104--121, 1997.
[14]
J. Scheller. Real-time operating systems for many-core platforms. Mém. de mast. Toulouse, France: ISAE/ONERA, 2012.
[15]
M. Schoeberl. A Time-Triggered Network-on-Chip. In 2007 International Conference on Field Programmable Logic and Applications, pages 377--382, Aug 2007.
[16]
M. Schoeberl, F. Brandner, J. Sparsø, and E. Kasapaki. A Statically Scheduled Time-Division-Multiplexed Network-on-Chip for Real-Time Systems. In Proceedings of the 2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip, NOCS '12, pages 152--160, Washington, DC, USA, 2012. IEEE Computer Society.
[17]
A. Skjellum, A. Kanevsky, Y. S. Dandass, J. Watts, S. Paavola, D. Cottel, G. Henley, L. S. Hebert, Z. Cui, and A. Rounbehler. The Real-Time Message Passing Interface Standard (MPI/RT-1.1). Concurrency and Computation: Practice and Experience, 16(S1), 2004.
[18]
R. B. Sørensen, W. Puffitsch, M. Schoeberl, and J. Sparsø. Message passing on a time-predictable multicore processor. In 18th International Symposium on Real-Time Distributed Computing (ISORC), pages 51--59. IEEE, 2015.
[19]
R. A. Stefan, A. Molnos, and K. Goossens. dAElite: A TDM NoC Supporting QoS, Multicast, and Fast Connection Set-Up. IEEE Transactions on Computers, 63(3):583--594, March 2014.
[20]
R. Thakur and W. D. Gropp. Improving the performance of collective operations in MPICH. In Recent Advances in Parallel Virtual Machine and Message Passing Interface, pages 257--267. Springer, 2003.
[21]
Y.-C. Tseng, D. K. Panda, and T.-H. Lai. A trip-based multicasting model in wormhole-routed networks with virtual channels. IEEE Transactions on Parallel and Distributed Systems, 7(2):138--150, Feb 1996.

Cited By

View all
  • (2018)Analysing Real-Time Behaviour of Collective Communication Patterns in MPIProceedings of the 26th International Conference on Real-Time Networks and Systems10.1145/3273905.3273906(137-147)Online publication date: 10-Oct-2018
  • (2017)Reduced Complexity Many-Core: Timing Predictability Due to Message-PassingArchitecture of Computing Systems - ARCS 201710.1007/978-3-319-54999-6_11(139-151)Online publication date: 4-Mar-2017

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGBED Review
ACM SIGBED Review  Volume 13, Issue 4
Special Issue on 14th International Workshop on RealTime Networks (RTN 2016)
September 2016
65 pages
EISSN:1551-3688
DOI:10.1145/3015037
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 November 2016
Published in SIGBED Volume 13, Issue 4

Check for updates

Author Tags

  1. MPI primitives
  2. TDM
  3. WCTT
  4. real-time NoC

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2018)Analysing Real-Time Behaviour of Collective Communication Patterns in MPIProceedings of the 26th International Conference on Real-Time Networks and Systems10.1145/3273905.3273906(137-147)Online publication date: 10-Oct-2018
  • (2017)Reduced Complexity Many-Core: Timing Predictability Due to Message-PassingArchitecture of Computing Systems - ARCS 201710.1007/978-3-319-54999-6_11(139-151)Online publication date: 4-Mar-2017

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media