skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Assessing a mini-application as a performance proxy for a finite element method engineering application

Journal Article · · Concurrency and Computation. Practice and Experience
DOI:https://doi.org/10.1002/cpe.3587· OSTI ID:1061183

The performance of a large-scale, production-quality science and engineering application (‘app’) is often dominated by a small subset of the code. Even within that subset, computational and data access patterns are often repeated, so that an even smaller portion can represent the performance-impacting features. If application developers, parallel computing experts, and computer architects can together identify this representative subset and then develop a small mini-application (‘miniapp’) that can capture these primary performance characteristics, then this miniapp can be used to both improve the performance of the app as well as provide a tool for co-design for the high-performance computing community. However, a critical question is whether a miniapp can effectively capture key performance behavior of an app. This study provides a comparison of an implicit finite element semiconductor device modeling app on unstructured meshes with an implicit finite element miniapp on unstructured meshes. The goal is to assess whether the miniapp is predictive of the performance of the app. Finally, single compute node performance will be compared, as well as scaling up to 16,000 cores. Results indicate that the miniapp can be reasonably predictive of the performance characteristics of the app for a single iteration of the solver on a single compute node.

Research Organization:
Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA)
Grant/Contract Number:
AC04-94AL85000
OSTI ID:
1061183
Report Number(s):
SAND2012-7956J
Journal Information:
Concurrency and Computation. Practice and Experience, Vol. 27, Issue 17; ISSN 1532-0626
Publisher:
WileyCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 4 works
Citation information provided by
Web of Science

References (15)

S12---The HPC Challenge (HPCC) benchmark suite conference January 2006
The LINPACK Benchmark: past, present and future
  • Dongarra, Jack J.; Luszczek, Piotr; Petitet, Antoine
  • Concurrency and Computation: Practice and Experience, Vol. 15, Issue 9 https://doi.org/10.1002/cpe.728
journal January 2003
Exascale design space exploration and co-design journal January 2014
Assessing the role of mini-applications in predicting key performance characteristics of scientific and engineering applications journal January 2015
Quantifying Architectural Requirements of Contemporary Extreme-Scale Scientific Applications book January 2014
Performance of a parallel algebraic multilevel preconditioner for stabilized finite element semiconductor device modeling journal September 2009
Preconditioning Techniques for Large Linear Systems: A Survey journal November 2002
Improving multigrid performance for unstructured mesh drift-diffusion simulations on 147,000 cores: IMPROVING MULTIGRID PERFORMANCE FOR UNSTRUCTURED MESH DRIFT-DIFFUSION journal May 2012
An overview of the Trilinos project journal September 2005
Efficient Expression Templates for Operator Overloading-Based Automatic Differentiation book January 2012
Characterization of Scientific Workloads on Systems with Multi-Core Processors conference October 2006
Towards large-scale multi-socket, multicore parallel simulations: Performance of an MPI-only semiconductor device simulator journal September 2010
Navigating an Evolutionary Fast Path to Exascale
  • Barrett, R. F.; Hammond, S. D.; Vaughan, C. T.
  • 2012 SC Companion: High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion: High Performance Computing, Networking Storage and Analysis https://doi.org/10.1109/SC.Companion.2012.55
conference November 2012
GMRES: A Generalized Minimal Residual Algorithm for Solving Nonsymmetric Linear Systems journal July 1986
Bi-CGSTAB: A Fast and Smoothly Converging Variant of Bi-CG for the Solution of Nonsymmetric Linear Systems journal March 1992

Cited By (1)

Performance drop at executing communication-intensive parallel algorithms journal January 2020