skip to main content
10.1145/2751205.2751220acmconferencesArticle/Chapter ViewAbstractPublication PagesicsConference Proceedingsconference-collections
research-article
Public Access

COMPASS: A Framework for Automated Performance Modeling and Prediction

Authors Info & Claims
Published:08 June 2015Publication History

ABSTRACT

Flexible, accurate performance predictions offer numerous benefits such as gaining insight into and optimizing applications and architectures. However, the development and evaluation of such performance predictions has been a major research challenge, due to the architectural complexities. To address this challenge, we have designed and implemented a prototype system, named COMPASS, for automated performance model generation and prediction. COMPASS generates a structured performance model from the target application's source code using automated static analysis, and then, it evaluates this model using various performance prediction techniques. As we demonstrate on several applications, the results of these predictions can be used for a variety of purposes, such as design space exploration, identifying performance tradeoffs for applications, and understanding sensitivities of important parameters. COMPASS can generate these predictions across several types of applications from traditional, sequential CPU applications to GPU-based, heterogeneous, parallel applications. Our empirical evaluation demonstrates a maximum overhead of 4%, flexibility to generate models for 9 applications, speed, ease of creation, and very low relative errors across a diverse set of architectures.

References

  1. A. Alexandrov, M. F. Ionescu, K. E. Schauser, and C. Scheiman, "LogGP: Incorporating long messages into the LogP model," in Proceedings of the Seventh Annual ACM Symposium on Parallel Algorithms and Architectures, ser. SPAA '95, 1995, pp. 95--105. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Bhattacharyya and T. Hoefler, "Pemogen: Automatic adaptive performance modeling during program runtime," in Proceedings of the 23rd International Conference on Parallel Architectures and Compilation, ser. PACT '14. New York, NY, USA: ACM, 2014, pp. 393--404. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. C. Chan, D. Unat et al., "Software design space exploration for exascale combustion co-design," in Supercomputing. Springer, 2013, pp. 196--212.Google ScholarGoogle Scholar
  4. D. Culler, R. Karp et al., "LogP: Towards a realistic model of parallel computation," in Proceedings of the Fourth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, ser. PPOPP '93, 1993, pp. 1--12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. C. Dave, H. Bae et al., "Cetus: A source-to-source compiler infrastructure for multicores," IEEE Computer, vol. 42, no. 12, pp. 36--42, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. C. L. Janssen, H. Adalsteinsson, and J. P. Kenny, "Using simulation to design extremescale applications and architectures: Programming model exploration," SIGMETRICS Performance Evaluation Review, vol. 38, no. 4, pp. 4--8, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. I. Karlin, A. Bhatele et al., "LULESH programming model and performance ports overview," Lawrence Livermore National Laboratory (LLNL), Livermore, CA, Tech. Rep. LLNL-TR-608824, 2012.Google ScholarGoogle Scholar
  8. B. C. Lee and D. M. Brooks, "Accurate and efficient regression modeling for microarchitectural performance and power prediction," in Proceedings of the 12th international conference on Architectural support for programming languages and operating systems. San Jose, California, USA: ACM, 2006, pp. 185--194. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. S. Lee and J. Vetter, "OpenARC: Open Accelerator Research Compiler for Directive-Based, Efficient Heterogeneous Computing," in HPDC '14: Proceedings of the ACM Symposium on High-Performance Parallel and Distributed Computing, Short Paper, June 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. C.-K. Luk, R. Cohn et al., "Pin: building customized program analysis tools with dynamic instrumentation," in Proceedings of the 2005 ACM SIGPLAN conference on Programming language design and implementation, ser. PLDI '05. New York, NY, USA: ACM, 2005, pp. 190--200. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. M. K. Martin, D. J. Sorin et al., "Multifacet's general execution-driven multiprocessor simulator (gems) toolset," SIGARCH Comput. Archit. News, vol. 33, no. 4, pp. 92--99, Nov. 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. OpenACC, "OpenACC: Directives for Accelerators," {Online}. Available: http://www.openacc-standard.org, 2011, (Accessed Jan. 16, 2015).Google ScholarGoogle Scholar
  13. D. J. Quinlan, "Rose: Compiler support for object-oriented frameworks." Parallel Processing Letters, vol. 10, no. 2/3, pp. 215--226, 2000.Google ScholarGoogle ScholarCross RefCross Ref
  14. A. F. Rodrigues, K. S. Hemmert et al., "The structural simulation toolkit," SIGMETRICS Performance Evaluation Review, vol. 38, no. 4, pp. 37--42, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. K. Spafford and J. S. Vetter, "Aspen: A domain specific language for performance modeling," in SC12: ACM/IEEE International Conference for High Performance Computing, Networking, Storage, and Analysis, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. N. R. Tallent and A. Hoisie, "Palm: Easing the burden of analytical performance modeling," in to appear, 2014.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. L. G. Valiant, "A bridging model for parallel computation," Communications of the ACM, vol. 33, no. 8, pp. 103--111, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. S. Vetter, R. Glassbrook et al., "Keeneland: Bringing heterogeneous GPU computing to the computational science community," IEEE Computing in Science and Engineering, vol. 13, no. 5, pp. 90--95, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. W. Wu and B. C. Lee, "Inferred models for dynamic and sparse hardware-software spaces," in Microarchitecture (MICRO), 2012 45th Annual IEEE/ACM International Symposium on. IEEE, 2012, pp. 413--424. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. COMPASS: A Framework for Automated Performance Modeling and Prediction

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      ICS '15: Proceedings of the 29th ACM on International Conference on Supercomputing
      June 2015
      446 pages
      ISBN:9781450335591
      DOI:10.1145/2751205

      Copyright © 2015 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 8 June 2015

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      ICS '15 Paper Acceptance Rate40of160submissions,25%Overall Acceptance Rate584of2,055submissions,28%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader