skip to main content
10.1145/2479871.2479901acmconferencesArticle/Chapter ViewAbstractPublication PagesicpeConference Proceedingsconference-collections
research-article

Parallelism profiling and wall-time prediction for multi-threaded applications

Published:21 April 2013Publication History

ABSTRACT

A detailed and accurate characterization of the parallelism of applications is essential for predicting their wall-time on different platforms, both for an application running in isolation and for a set of consolidated applications executing on the same platform. However, prevailing profilers are often based on sampling and do not provide exact information on the parallelism of the profiled application. In this paper we present a novel profiler that logs all thread scheduling activities within the operating system kernel. These logs enable us to accurately characterize applications' parallelism on a given platform by computing the number of threads that are active at each moment. We also present a simple mathematical prediction model to estimate wall-time for program execution on a k2-core machine using profiles collected using a k1-core machine (of the same architecture and running at the same clock speed). We use our profiler to assess the parallelism of several CPU-bound DaCapo benchmarks and evaluate the accuracy of our prediction model.

References

  1. D. Ansaloni, L. Y. Chen, E. Smirni, and W. Binder. Model-driven consolidation of Java workloads on multicores. In Proc. of DSN, pages 229--234, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. W. Binder, J. Hulaas, P. Moret, and A. Villazón. Platform-independent profiling in a virtual execution environment. Softw., Pract. Exper., 39(1):47--79, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. M. Blackburn, R. Garner, C. Hoffmann, A. M. Khang, K. S. McKinley, R. Bentzur, A. Diwan, D. Feinberg, D. Frampton, S. Z. Guyer, M. Hirzel, A. Hosking, M. Jump, H. Lee, J. E. B. Moss, A. Phansalkar, D. Stefanović, T. VanDrunen, D. von Dincklage, and B. Wiedermann. The DaCapo benchmarks: Java benchmarking development and analysis. SIGPLAN Not., 41(10):169--190, Oct. 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. L. Y. Chen, D. Ansaloni, E. Smirni, A. Yokokawa, and W. Binder. Achieving application-centric performance targets via consolidation on multicores: Myth or reality? In Proc. of HPDC, pages 229--234, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. Cook, J. Cook, and W. Alkohlani. A statistical performance model of the Opteron processor. SIGMETRICS Perform. Eval. Rev., 38(4):75--80, Mar. 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. T. Dey, W. Wang, J. Davidson, and M. Soffa. Characterizing multi-threaded applications based on shared-resource contention. In Proc. of ISPASS, pages 76--86, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. Georges, D. Buytaert, and L. Eeckhout. Statistically rigorous Java performance evaluation. SIGPLAN Not., 42(10):57--76, Oct. 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Grama, G. Karypis, V. Kumar, and A. Gupta. Introduction to Parallel Computing. Pearson Education, 2003.Google ScholarGoogle Scholar
  9. M. D. Hill and M. R. Marty. Amdahl's law in the multicore era. IEEE COMPUTER, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. D. Jeon, S. Garcia, C. Louie, and M. B. Taylor. Kismet: parallel speedup estimates for serial programs. SIGPLAN Not., 46(10):519--536, Oct. 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. T. Mytkowicz, A. Diwan, M. Hauswirth, and P. F. Sweeney. Producing wrong data without doing anything obviously wrong! SIGPLAN Not., 44(3):265--276, Mar. 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. Peternier, D. Bonetta, W. Binder, and C. Pautasso. Overseer: Low-level hardware monitoring and management for Java. In Proc. of PPPJ, pages 143--146, Denmark, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. N. Tallent and J. Mellor-Crummey. Effective performance measurement and analysis of multithreaded applications. In Proc. of PPoPP, pages 229--240, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. A. Wong, D. Rexachs, and E. Luque. Pas2p tool, parallel application signature for performance prediction. In Proc. of PARA, pages 293--302, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. R. Yang, J. Antony, and A. P. Rendell. A simple performance model for multithreaded applications executing on non-uniform memory access computers. In Proc. of HPCC, pages 79--86, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Parallelism profiling and wall-time prediction for multi-threaded applications

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          ICPE '13: Proceedings of the 4th ACM/SPEC International Conference on Performance Engineering
          April 2013
          446 pages
          ISBN:9781450316361
          DOI:10.1145/2479871

          Copyright © 2013 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 21 April 2013

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          ICPE '13 Paper Acceptance Rate28of64submissions,44%Overall Acceptance Rate252of851submissions,30%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader