skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: On the arithmetic intensity of high-order finite-volume discretizations for hyperbolic systems of conservation laws

Journal Article · · International Journal of High Performance Computing Applications
 [1];  [1]
  1. Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

It has been conjectured that higher-order discretizations for partial differential equations will have advantages over the lower-order counterparts commonly used today. The reasoning is that the increase in arithmetic operations will be more than offset by the reduction in data transfers and the increase in concurrent floating-point units. To evaluate this conjecture, the arithmetic intensity of a class of high-order finite-volume discretizations for hyperbolic systems of conservation laws is theoretically analyzed for spatial discretizations from orders three through eight in arbitrary dimensions. Additionally, three cache models are considered: the limiting cases of no cache and infinite cache as well as a finite-sized cache model. Models are validated experimentally by measuring floating-point operations and data transfers on an IBM Blue Gene/Q node. Theory and experiments demonstrate that high-order finite-volume methods will be able to provide increases in arithmetic intensity that will be necessary to make better utilization of on-node floating-point capability.

Research Organization:
Lawrence Livermore National Laboratory (LLNL), Livermore, CA (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA); USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
Grant/Contract Number:
AC52-07NA27344
OSTI ID:
1860892
Report Number(s):
LLNL-JRNL-716535; 859004
Journal Information:
International Journal of High Performance Computing Applications, Vol. 33, Issue 1; ISSN 1094-3420
Publisher:
SAGECopyright Statement
Country of Publication:
United States
Language:
English

References (35)

Improving the ratio of memory operations to floating-point operations in loops journal November 1994
A framework for hybrid parallel flow simulations with a trillion cells in complex geometries
  • Godenschwager, Christian; Schornbaum, Florian; Bauer, Martin
  • Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '13 https://doi.org/10.1145/2503210.2503273
conference January 2013
11 PFLOP/s simulations of cloud cavitation collapse
  • Rossinelli, Diego; Koumoutsakos, Petros; Hejazialhosseini, Babak
  • Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '13 https://doi.org/10.1145/2503210.2504565
conference January 2013
Understanding Application Performance via Micro-benchmarks on Three Large Supercomputers: Intrepid, Ranger and Jaguar journal May 2010
Estimating interlock and improving balance for pipelined architectures journal August 1988
Multicore/Multi-GPU Accelerated Simulations of Multiphase Compressible Flows Using Wavelet Adapted Grids journal January 2011
I/O complexity: The red-blue pebble game conference January 1981
A Study on Balancing Parallelism, Data Locality, and Recomputation in Existing PDE Solvers
  • Olschanowsky, Catherine; Strout, Michelle Mills; Guzik, Stephen
  • SC14: International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2014.70
conference November 2014
Performance evaluations of gyrokinetic Eulerian code GT5D on massively parallel multi-core platforms conference January 2011
An Optimizing Code Generator for a Class of Lattice-Boltzmann Computations
  • Pananilath, Irshad; Acharya, Aravind; Vasista, Vinay
  • ACM Transactions on Architecture and Code Optimization, Vol. 12, Issue 2 https://doi.org/10.1145/2739047
journal July 2015
Comparison of accurate methods for the integration of hyperbolic equations journal January 1972
Quantifying Performance Bottlenecks of Stencil Computations Using the Execution-Cache-Memory Model conference January 2015
Essentially non-oscillatory and weighted essentially non-oscillatory schemes for hyperbolic conservation laws book January 1998
Compiler-Directed Transformation for Higher-Order Stencils conference May 2015
Communication lower bounds and optimal algorithms for numerical linear algebra journal May 2014
High-order finite-volume methods for hyperbolic conservation laws on mapped multiblock grids journal May 2015
On the GPU performance of cell-centered finite volume method over unstructured tetrahedral meshes conference January 2013
Weighted Essentially Non-oscillatory Schemes journal November 1994
Fully multidimensional flux-corrected transport algorithms for fluids journal June 1979
Optimization of a lattice Boltzmann computation on state-of-the-art multicore platforms journal September 2009
Solving the compressible navier-stokes equations on up to 1.97 million cores and 4.1 trillion grid points
  • Bermejo-Moreno, Iván; Bodart, Julien; Larsson, Johan
  • SC13: International Conference for High Performance Computing, Networking, Storage and Analysis, Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1145/2503210.2503265
conference November 2013
A 30 Year Retrospective on Dennard's MOSFET Scaling Paper journal January 2007
A high-order finite-volume method for conservation laws on locally refined grids journal January 2011
The Piecewise Parabolic Method (PPM) for gas-dynamical simulations journal April 1984
Strong Stability-Preserving High-Order Time Discretization Methods journal January 2001
Hierarchical N-body Simulations with Autotuning for Heterogeneous Systems journal May 2012
A performance analysis framework for identifying potential benefits in GPGPU applications journal September 2012
Performance modeling of serial and parallel implementations of the fractional Adams-Bashforth-Moulton method journal June 2014
Managing application complexity in the SAMRAI object-oriented framework journal January 2002
Efficient Implementation of Weighted ENO Schemes journal June 1996
High-order, finite-volume methods in mapped coordinates journal April 2011
High throughput software for direct numerical simulations of compressible two-phase flows
  • Hejazialhosseini, Babak; Rossinelli, Diego; Conti, Christian
  • 2012 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2012.66
conference November 2012
Roofline: an insightful visual performance model for multicore architectures journal April 2009
An efficient mixed-precision, hybrid CPU–GPU implementation of a nonlinearly implicit one-dimensional particle-in-cell algorithm journal June 2012
Flux-corrected transport. I. SHASTA, a fluid transport algorithm that works journal January 1973