skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Porting a 3D seismic modeling code (SW4) to CORAL machines

Journal Article · · IBM Journal of Research and Development
 [1];  [1];  [1]
  1. Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

Seismic waves fourth order (SW4) solves the seismic wave equations on Cartesian and curvilinear grids using large compute clusters with O (100,000) cores. In this article we discuss the porting of SW4 to run on the CORAL architecture using the RAJA performance portability abstraction layer. The performances of key kernels using RAJA and CUDA are compared to estimate the performance penalty of using the portability abstraction layer. Code changes required for efficiency on GPUs and minimizing time spent in Message Passing Interface (MPI) are discussed. This article describes a path for efficiently porting large code bases to GPU-based machines while avoiding the pitfalls of a new architecture in the early stages of its deployment. Current bottlenecks in the code are discussed along with possible architectural or software mitigations. SW4 runs 28× faster on one 4-GPU CORAL node than on a CTS-1 node (Dual Intel Xeon E5-2695 v4). SW4 is now in routine use on problems of unprecedented resolution (203 billion grid points) and scale on 1,200 nodes of Summit.

Research Organization:
Lawrence Livermore National Laboratory (LLNL), Livermore, CA (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA)
Grant/Contract Number:
AC52-07NA27344
OSTI ID:
1762887
Report Number(s):
LLNL-JRNL-787242; 984005
Journal Information:
IBM Journal of Research and Development, Vol. 64, Issue 3/4; ISSN 0018-8646
Publisher:
IEEECopyright Statement
Country of Publication:
United States
Language:
English

References (12)

Toward Exascale Earthquake Ground Motion Simulations for Near-Fault Engineering Analysis journal January 2017
Caliper: Performance Introspection for HPC Software Stacks
  • Boehme, David; Gamblin, Todd; Beckingsale, David
  • SC16: International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2016.46
conference November 2016
Broadband (0–5 Hz) Fully Deterministic 3D Ground‐Motion Simulations of a Magnitude 7.0 Hayward Fault Earthquake: Comparison with Empirical Ground‐Motion Models and 3D Path and Site Effects from Source Normalized Intensities journal February 2019
A first order approximation to the optimum checkpoint interval journal September 1974
Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing conference November 2018
Direct simulation Monte Carlo on petaflop supercomputers and beyond journal August 2019
Preparation and optimization of a diverse workload for a large-scale heterogeneous system
  • Karlin, Ian; Park, Yoonho; de Supinski, Bronis R.
  • SC '19: The International Conference for High Performance Computing, Networking, Storage, and Analysis, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1145/3295500.3356192
conference November 2019
A Fourth Order Accurate Finite Difference Scheme for the Elastic Wave Equation in Second Order Formulation journal September 2011
Wave propagation in anisotropic elastic materials and curvilinear coordinates using a summation-by-parts finite difference method journal October 2015
Accelerated application development: The ORNL Titan experience journal August 2015
Super-Grid Modeling of the Elastic Wave Equation in Semi-Bounded Domains journal October 2014
An evaluation of the CORAL interconnects
  • Zimmer, Christopher; Atchley, Scott; Pankajakshan, Ramesh
  • SC '19: The International Conference for High Performance Computing, Networking, Storage, and Analysis, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1145/3295500.3356166
conference November 2019

Cited By (1)

Inter-loop optimization in RAJA using loop chains conference June 2021

Similar Records

Porting the WAVEWATCH III (v6.07) wave action source terms to GPU
Journal Article · Fri Mar 03 00:00:00 EST 2023 · Geoscientific Model Development (Online) · OSTI ID:1762887

Graphics processing unit accelerated phase field dislocation dynamics: Application to bi-metallic interfaces
Journal Article · Sat Oct 14 00:00:00 EDT 2017 · Advances in Engineering Software · OSTI ID:1762887

Exploring utilization options of heterogeneous architectures for multi-physics simulations
Journal Article · Wed May 22 00:00:00 EDT 2019 · Parallel Computing · OSTI ID:1762887