skip to main content
10.1145/3273905.3273906acmotherconferencesArticle/Chapter ViewAbstractPublication PagesrtnsConference Proceedingsconference-collections
research-article

Analysing Real-Time Behaviour of Collective Communication Patterns in MPI

Published: 10 October 2018 Publication History

Abstract

Worst-case execution time (WCET) analysis is crucial for designing real-time systems. While the WCET of tasks in a single core system can be upper bounded in isolation, the tasks in a manycore system are subject to shared memory interferences which impose high overestimation of the WCET bounds. However, manycore-based massively parallel applications will enter the area of real-time systems in the years ahead. Explicit message-passing and a clear separation of computation and communication facilitates WCET analysis for those programs. Thereby, the separation is especially ensured if applying collective communication.
We propose a process of analysing state-of-the-art communication patterns with respect to worst case timing. As MPI is the standard for performing collective communication, we use it to show how to evaluate the timing behaviour in detail. We compare different communication patterns and show the tremendous impact of choosing an appropriate one.

References

[1]
Clément Ballabriga, Hugues Cassé, Christine Rochange, and Pascal Sainrat. 2011. OTAWA: An Open Toolbox for Adaptive WCET Analysis. In Software Technologies for Embedded and Ubiquitous Systems. LNCS, Vol. 6399. Springer Berlin Heidelberg, 35--46.
[2]
Gregory D. Benson, Cho-Wai Chu, Qing Huang, and Sadik G. Caglar. 2003. A comparison of MPICH allgather algorithms on switched networks. In Recent Advances in Parallel Virtual Machine and Message Passing Interface: 10th European PVM/MPI User's Group Meeting, Venice, Italy, September 29 - October 2, 2003. Proceedings. Springer, 335--343.
[3]
David Culler, Richard Karp, David Patterson, Abhijit Sahay, Klaus Erik Schauser, Eunice Santos, Ramesh Subramonian, and Thorsten Von Eicken. 1993. LogP: Towards a realistic model of parallel computation. In ACM Sigplan Notices, Vol. 28. ACM, 1--12.
[4]
Martin Frieb, Alexander Stegmeier, Jörg Mische, and Theo Ungerer. 2016. Employing MPI Collectives for Timing Analysis on Embedded Multi-Cores. In 16th International Workshop on Worst-Case Execution Time Analysis (WCET).
[5]
Edgar Gabriel, Graham E. Fagg, George Bosilca, Thara Angskun, Jack J. Dongarra, Jeffrey M. Squyres, Vishal Sahay, Prabhanjan Kambadur, Brian Barrett, Andrew Lumsdaine, Ralph H. Castain, David J. Daniel, Richard L. Graham, and Timothy S. Woodall. 2004. Open MPI: Goals, Concept, and Design of a Next Generation MPI Implementation. In Recent Advances in Parallel Virtual Machine and Message Passing Interface: 11th European PVM/MPI Users' Group Meeting Budapest, Hungary, September 19 - 22, 2004. Proceedings, Dieter Kranzlmüller, Péter Kacsuk, and Jack Dongarra (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 97--104.
[6]
Kees Goossens and Andreas Hansson. 2010. The Aethereal Network on Chip After Ten Years: Goals, Evolution, Lessons, and Future. In 47th Design Automation Conference.
[7]
T. Hoefler, L. Cerquetti, T. Mehlan, F. Mietke, and W. Rehm. 2005. A practical approach to the rating of barrier algorithms using the LogP model and Open MPI. In 2005 International Conference on Parallel Processing Workshops (ICPPW'05). 562--569.
[8]
T. Hoefler, A. Lumsdaine, and W. Rehm. 2007. Implementation and performance analysis of non-blocking collective operations for MPI. In Proceedings of the 2007 ACM/IEEE Conference on Supercomputing, 2007. SC '07. 1--10.
[9]
Jiri Jaros, Milos Ohlidal, and Vaclav Dvorak. 2006. Complexity of collective communications on NoCs. In International Symposium on Parallel Computing in Electrical Engineering, 2006. PAR ELEC 2006. IEEE, 127--133.
[10]
Arkady Kanevsky, Anthony Skjellum, and Anna Rounbehler. 1998. MPI/RT - an emerging standard for high-performance real-time systems. In 31th Hawaii International Conference on System Sciences. IEEE, 157--166.
[11]
Jiuxing Liu, Balasubramanian Chandrasekaran, Jiesheng Wu, Weihang Jiang, Sushmitha Kini, Weikuan Yu, Darius Buntinas, Peter Wyckoff, and D K. Panda. 2003. Performance Comparison of MPI Implementations over InfiniBand, Myrinet and Quadrics. In Proceedings of the 2003 ACM/IEEE Conference on Supercomputing (SC '03). ACM, New York, NY, USA.
[12]
Jörg Mische, Martin Frieb, Alexander Stegmeier, and Theo Ungerer. 2017. Reduced Complexity Many-Core: Timing Predictability Due to Message-Passing. In Architecture of Computing Systems - ARCS 2017: 30th International Conference, Vienna, Austria, April 3--6, 2017, Proceedings. Springer International Publishing, Cham, 139--151.
[13]
Jörg Mische and Theo Ungerer. 2014. Guaranteed Service Independent of the Task Placement in NoCs with Torus Topology. In 22nd International Conference on Real-Time Networks and Systems (RTNS '14). ACM, Article 151, 10 pages.
[14]
MPI-forum. 2015. MPI: A Message-Passing Interface Standard Version 3.1. (2015). available at http://www.mpi-forum.org/docs/mpi-3.1/mpi31-report.pdf.
[15]
Sarp Oral and Alan D. George. 2004. Multicast performance modeling and evaluation for high-speed unidirectional torus networks. Microprocessors and Microsystems 28, 9 (2004), 477--489.
[16]
Jelena Pješivac-Grbović, Thara Angskun, George Bosilca, Graham E. Fagg, Edgar Gabriel, and Jack J. Dongarra. 2007. Performance analysis of MPI collective operations. Cluster Computing 10, 2 (2007), 127--143.
[17]
Eberle A. Rambo and Rolf Ernst. 2015. Worst-case Communication Time Analysis of Networks-on-chip with Shared Virtual Channels. In Proceedings of the 2015 Design, Automation and Test in Europe Conference and Exhibition (DATE '15). EDA Consortium, San Jose, CA, USA, 537--542. http://dl.acm.org/citation.cfm?id=2755753.2755874
[18]
Christine Rochange, Armelle Bonenfant, Pascal Sainrat, Mike Gerdes, Julian Wolf, Theo Ungerer, Zlatko Petrov, and Frantisek Mikulu. 2010. WCET Analysis of a Parallel 3D Multigrid Solver Executed on the MERASA Multi-Core. In 10th International Workshop on Worst-Case Execution Time Analysis (WCET 2010) (OpenAccess Series in Informatics (OASIcs)), Björn Lisper (Ed.), Vol. 15. Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik, Dagstuhl, Germany, 90--100.
[19]
M. Schoeberl. 2007. A Time-Triggered Network-on-Chip. In 2007 International Conference on Field Programmable Logic and Applications. 377--382.
[20]
Martin Schoeberl, Florian Brandner, Jens Sparsø, and Evangelia Kasapaki. 2012. A Statically Scheduled Time-Division-Multiplexed Network-on-Chip for Real-Time Systems. In Proceedings of the 2012 IEEE/ACM Sixth International Symposium on Networks-on-Chip (NOCS '12). IEEE Computer Society, Washington, DC, USA, 152--160.
[21]
Anthony Skjellum, Arkady Kanevsky, Yoginder S. Dandass, Jerrell Watts, Steve Paavola, Dennis Cottel, Greg Henley, L. Shane Hebert, Zhenqian Cui, and Anna Rounbehler. 2004. The Real-Time Message Passing Interface Standard (MPI/RT-1.1). Concurrency and Computation: Practice and Experience 16, S1 (2004).
[22]
Rasmus Bo Sørensen, Wolfgang Puffitsch, Martin Schoeberl, and Jens Sparsø. 2015. Message Passing on a Time-predictable Multi-core Processor. In 18th International Symposium on Real-Time Distributed Computing (ISORC). IEEE, 51--59.
[23]
R. A. Stefan, A. Molnos, and K. Goossens. 2014. dAElite: A TDM NoC Supporting QoS, Multicast, and Fast Connection SetUp. IEEE Trans. Comput. 63, 3 (March 2014), 583--594.
[24]
Alexander Stegmeier, Jörg Mische, Martin Frieb, and Theo Ungerer. 2016. WCTT Bounds for MPI Primitives in the PaterNoster NoC. In 14th International Workshop on Real-Time Networks (RTN), Vol. 13. ACM, New York, NY, USA, 25--30.
[25]
Open MPI Development Team. 2016. Open MPI main development repository. https://github.com/open-mpi/ompi. (2016).
[26]
S. Tobuschat and R. Ernst. 2017. Efficient Latency Guarantees for Mixed-Criticality Networks-on-Chip. In 2017 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS). 113--122.
[27]
S. Tobuschat and R. Ernst. 2017. Real-time communication analysis for Networks-on-Chip with backpressure. In Design, Automation Test in Europe Conference Exhibition (DATE), 2017. 590--595.
[28]
Bibo Tu, Jianping Fan, Jianfeng Zhan, and Xiaofang Zhao. 2012. Performance analysis and optimization of MPI collective operations on multi-core clusters. The Journal of Supercomputing 60, 1 (01 Apr 2012), 141--162.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
RTNS '18: Proceedings of the 26th International Conference on Real-Time Networks and Systems
October 2018
277 pages
ISBN:9781450364638
DOI:10.1145/3273905
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

In-Cooperation

  • University of Poitiers: University of Poitiers

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 October 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. MPI
  2. Manycore
  3. Real Time
  4. WCET

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

RTNS '18
RTNS '18: 26th International Conference on Real-Time Networks and Systems
October 10 - 12, 2018
Chasseneuil-du-Poitou, France

Acceptance Rates

RTNS '18 Paper Acceptance Rate 25 of 52 submissions, 48%;
Overall Acceptance Rate 119 of 255 submissions, 47%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 76
    Total Downloads
  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)2
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media