skip to main content
10.1145/2907294.2907311acmconferencesArticle/Chapter ViewAbstractPublication PageshpdcConference Proceedingsconference-collections
short-paper
Public Access

Parallel Execution Profiles

Published: 31 May 2016 Publication History

Abstract

Observing the relative behavior of an application's threads is critical to identifying performance bottlenecks and understanding their root causes. We present parallel execution profiles (PEPs), which capture the relative behavior of parallel threads in terms of the user selected code regions they execute. The user annotates the program to identify code regions of interest. The PEP divides the execution time of a multithreaded application into time intervals or a sequence of frames during which the code regions being executed in parallel by application threads remain the same. PEPs can be easily analyzed to compute execution times spent by the application in interesting behavior states. This helps user understand the severity of common performance problems such as excessive waiting on events by threads, threads contending for locks, and the presence of straggler threads.

References

[1]
L. Adhianto, S. Banerjee, M. Fagan, M. Krentel, G. Marin, J. Mellor-Crummey, and N. R. Tallent. Hpctoolkit: Tools for performance analysis of optimized parallel programs. Concurrency and Computation: Practice and Experience, 22(6):685--701, 2010.
[2]
T. E. Anderson and E. D. Lazowska. Quartz: A tool for tuning parallel program performance. SIGMETRICS Perform. Eval. Rev., 18(1):115--125, Apr. 1990.
[3]
D. Bohme, F. Wolf, B. R. De Supinski, M. Schulz, and M. Geimer. Scalable critical-path based performance analysis. In Parallel & Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International, pages 1330--1340. IEEE, 2012.
[4]
F. David, G. Thomas, J. Lawall, and G. Muller. Continuously measuring critical section pressure with the free-lunch profiler. In Proceedings of the 2014 ACM International Conference on Object Oriented Programming Systems Languages & Applications, OOPSLA '14, pages 291--307, New York, NY, USA, 2014. ACM.
[5]
S. L. Graham, P. B. Kessler, and M. K. Mckusick. Gprof: A call graph execution profiler. In Proceedings of the 1982 SIGPLAN Symposium on Compiler Construction, SIGPLAN '82, pages 120--126, New York, NY, USA, 1982. ACM.
[6]
J. K. Hollingsworth and B. P. Miller. Parallel program performance metrics: A comprison and validation. In Proceedings of the 1992 ACM/IEEE Conference on Supercomputing, Supercomputing '92, pages 4--13, Los Alamitos, CA, USA, 1992. IEEE Computer Society Press.
[7]
D. Jeon, S. Garcia, C. Louie, and M. B. Taylor. Kismet: Parallel speedup estimates for serial programs. In Proceedings of the 2011 ACM International Conference on Object Oriented Programming Systems Languages and Applications, OOPSLA '11, pages 519--536, New York, NY, USA, 2011. ACM.
[8]
M. Kambadur, K. Tang, and M. A. Kim. Parashares: Finding the important basic blocks in multithreaded programs. In Euro-Par 2014 Parallel Processing, pages 75--86. Springer, 2014.

Cited By

View all
  • (2020)Tails in the cloud: a survey and taxonomy of straggler management within large-scale cloud data centresThe Journal of Supercomputing10.1007/s11227-020-03241-x76:12(10050-10089)Online publication date: 1-Dec-2020
  • (2017)A non-intrusive, operating system independent spinlock profiler for embedded multicore systemsProceedings of the Conference on Design, Automation & Test in Europe10.5555/3130379.3130454(322-325)Online publication date: 27-Mar-2017
  • (2017)A non-intrusive, operating system independent spinlock profiler for embedded multicore systemsDesign, Automation & Test in Europe Conference & Exhibition (DATE), 201710.23919/DATE.2017.7927009(322-325)Online publication date: Mar-2017

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
HPDC '16: Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing
May 2016
302 pages
ISBN:9781450343145
DOI:10.1145/2907294
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 May 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. contention
  2. intrusion
  3. load imbalance
  4. overhead
  5. parallel behaviors
  6. stragglers
  7. waiting

Qualifiers

  • Short-paper

Funding Sources

Conference

HPDC'16
Sponsor:

Acceptance Rates

HPDC '16 Paper Acceptance Rate 20 of 129 submissions, 16%;
Overall Acceptance Rate 166 of 966 submissions, 17%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)58
  • Downloads (Last 6 weeks)10
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2020)Tails in the cloud: a survey and taxonomy of straggler management within large-scale cloud data centresThe Journal of Supercomputing10.1007/s11227-020-03241-x76:12(10050-10089)Online publication date: 1-Dec-2020
  • (2017)A non-intrusive, operating system independent spinlock profiler for embedded multicore systemsProceedings of the Conference on Design, Automation & Test in Europe10.5555/3130379.3130454(322-325)Online publication date: 27-Mar-2017
  • (2017)A non-intrusive, operating system independent spinlock profiler for embedded multicore systemsDesign, Automation & Test in Europe Conference & Exhibition (DATE), 201710.23919/DATE.2017.7927009(322-325)Online publication date: Mar-2017

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media