skip to main content
10.1145/2488551.2488562acmotherconferencesArticle/Chapter ViewAbstractPublication PageseurompiConference Proceedingsconference-collections
research-article

Runtime message uniquification for accurate communication analysis on incomplete MPI event traces

Published: 15 September 2013 Publication History

Abstract

Communication analysis of parallel applications based on event traces depends on correct matching of associated MPI send and receive events. Selective monitoring techniques, however, may result in incomplete MPI event traces and, in that case, current matching strategies fail. In this paper we introduce an additional unique identifier for each message to make MPI events distinguishable from others. Therefore, it is possible to identify missing MPI events and match all remaining MPI events correctly. An overhead study with a real-life application and a benchmark suite demonstrates the applicability and benefits of this approach.

References

[1]
CRESTA. Collaborative Research into Exascale Systemware, Tools and Applications. http://cresta-project.eu.
[2]
D. Eschweiler, M. Wagner, M. Geimer, A. Knüpfer, W. E. Nagel, and F. Wolf. Open Trace Format 2: The Next Generation of Scalable Trace Formats and Support Libraries. In Applications, Tools and Techniques on the Road to Exascale Computing, volume 22 of Advances in Parallel Computing, pages 481--490, 2012.
[3]
M. Geimer, F. Wolf, B. J. Wylie, E. Ábrahám, D. Becker, and B. Mohr. The Scalasca Performance Toolset Architecture. Concurrency and Computation: Practice and Experience, 22(6):702--719, 2010.
[4]
B. Hess, C. Kutzner, D. van der Spoel, and E. Lindahl. GROMACS 4: Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. Journal of Chemical Theory and Computation, 4(3):435--447, 2008.
[5]
T. Hilbrich, M. S. Müller, B. R. de Supinski, M. Schulz, and W. E. Nagel. GTI: A Generic Tools Infrastructure for Event-Based Tools in Parallel Systems. In Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium, IPDPS '12, pages 1364--1375, Washington, DC, USA, 2012. IEEE Computer Society.
[6]
T. Hilbrich, J. Protze, M. Schulz, B. R. de Supinski, and M. S. Müller. MPI Runtime Error Detection with MUST: Advances in Deadlock Detection. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC '12, pages 30:1--30:11, Los Alamitos, CA, USA, 2012. IEEE Computer Society.
[7]
T. Hoefler, T. Schneider, and A. Lumsdaine. Characterizing the influence of system noise on large-scale applications by simulation. In Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, SC '10, pages 1--11, Washington, DC, USA, 2010. IEEE Computer Society.
[8]
A. Knüpfer, H. Brunst, J. Doleschal, M. Jurenz, M. Lieber, H. Mickler, M. S. Müller, and W. E. Nagel. The Vampir Performance Analysis Tool Set. In Tools for High Performance Computing, pages 139--155. Springer, July 2008.
[9]
A. Knüpfer, C. Rössel, D. Mey, S. Biersdorff, K. Diethelm, D. Eschweiler, M. Geimer, M. Gerndt, D. Lorenz, A. Malony, W. E. Nagel, Y. Oleynik, P. Philippen, P. Saviankou, D. Schmidl, S. Shende, R. Tschüter, M. Wagner, B. Wesarg, and F. Wolf. Score-P: A Joint Performance Measurement Run-Time Infrastructure for Periscope, Scalasca, TAU, and Vampir. In H. Brunst, M. S. Müller, W. E. Nagel, and M. M. Resch, editors, Tools for High Performance Computing 2011, pages 79--91. Springer Berlin Heidelberg, 2012.
[10]
Message Passing Interface Forum. MPI: A Message-Passing Interface Standard, Version 3.0, 2012. http://www.mpi-forum.org/docs/mpi-3.0/mpi30-report.pdf.
[11]
NASA Advanced Supercomputing (NAS) Division. NAS Parallel Benchmarks, Version 3.3. http://www.nas.nasa.gov/publications/npb.html.
[12]
M. Schulz, G. Bronevetsky, R. Fernandes, D. Marques, K. Pingali, and P. Stodghill. Implementation and evaluation of a scalable application-level checkpoint-recovery scheme for mpi programs. In Proceedings of the 2004 ACM/IEEE conference on Supercomputing, SC '04, pages 38--, Washington, DC, USA, 2004. IEEE Computer Society.
[13]
M. Schulz, G. Bronevetsky, and B. R. Supinski. On the performance of transparent mpi piggyback messages. In Proceedings of the 15th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface, pages 194--201, Berlin, Heidelberg, 2008. Springer-Verlag.
[14]
S. Shende and A. D. Malony. The TAU Parallel Performance System, SAGE Publications. International Journal of High Performance Computing Applications, 20(2):287--331, 2006.
[15]
M. Wagner, A. Knüpfer, and W. E. Nagel. Enhanced Encoding Techniques for the Open Trace Format 2. Procedia Computer Science, 9:1979--1987, 2012.
[16]
M. Wagner and W. E. Nagel. Strategies for Real-Time Event Reduction. In Euro-Par 2012: Parallel Processing Workshops, volume 7640 of Lecture Notes in Computer Science, pages 429--438. Springer Berlin Heidelberg, 2013.

Cited By

View all
  • (2017)Using adaptive runtime filtering to support an event‐based performance analysisConcurrency and Computation: Practice and Experience10.1002/cpe.409429:7Online publication date: 24-Feb-2017
  • (2016)OTFX: An In-memory Event Tracing Extension to the Open Trace Format 2Algorithms and Architectures for Parallel Processing10.1007/978-3-319-49956-7_1(3-17)Online publication date: 19-Nov-2016
  • (2015)MPI-focused Tracing with OTFXProceedings of the 22nd European MPI Users' Group Meeting10.1145/2802658.2802664(1-8)Online publication date: 21-Sep-2015
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
EuroMPI '13: Proceedings of the 20th European MPI Users' Group Meeting
September 2013
289 pages
ISBN:9781450319034
DOI:10.1145/2488551
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • ARCOS: Computer Architecture and Technology Area, Universidad Carlos III de Madrid

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 September 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. communication analysis
  2. event tracing
  3. message matching
  4. message passing interface
  5. point-to-point communication

Qualifiers

  • Research-article

Funding Sources

Conference

EuroMPI '13
Sponsor:
  • ARCOS
EuroMPI '13: 20th European MPI Users's Group Meeting
September 15 - 18, 2013
Madrid, Spain

Acceptance Rates

EuroMPI '13 Paper Acceptance Rate 22 of 47 submissions, 47%;
Overall Acceptance Rate 66 of 139 submissions, 47%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 11 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2017)Using adaptive runtime filtering to support an event‐based performance analysisConcurrency and Computation: Practice and Experience10.1002/cpe.409429:7Online publication date: 24-Feb-2017
  • (2016)OTFX: An In-memory Event Tracing Extension to the Open Trace Format 2Algorithms and Architectures for Parallel Processing10.1007/978-3-319-49956-7_1(3-17)Online publication date: 19-Nov-2016
  • (2015)MPI-focused Tracing with OTFXProceedings of the 22nd European MPI Users' Group Meeting10.1145/2802658.2802664(1-8)Online publication date: 21-Sep-2015
  • (2015)Adaptive Runtime FilteringProceedings of the 2015 IEEE 18th International Conference on Computational Science and Engineering (CSE)10.1109/CSE.2015.43(262-269)Online publication date: 21-Oct-2015
  • (2015)Towards Detailed Exascale Application Analysis — Selective Monitoring and VisualisationSolving Software Challenges for Exascale10.1007/978-3-319-15976-8_9(122-129)Online publication date: 19-Feb-2015

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media