Porting a 3D seismic modeling code (SW4) to CORAL machines
- Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)
Seismic waves fourth order (SW4) solves the seismic wave equations on Cartesian and curvilinear grids using large compute clusters with O (100,000) cores. In this article we discuss the porting of SW4 to run on the CORAL architecture using the RAJA performance portability abstraction layer. The performances of key kernels using RAJA and CUDA are compared to estimate the performance penalty of using the portability abstraction layer. Code changes required for efficiency on GPUs and minimizing time spent in Message Passing Interface (MPI) are discussed. This article describes a path for efficiently porting large code bases to GPU-based machines while avoiding the pitfalls of a new architecture in the early stages of its deployment. Current bottlenecks in the code are discussed along with possible architectural or software mitigations. SW4 runs 28× faster on one 4-GPU CORAL node than on a CTS-1 node (Dual Intel Xeon E5-2695 v4). SW4 is now in routine use on problems of unprecedented resolution (203 billion grid points) and scale on 1,200 nodes of Summit.
- Research Organization:
- Lawrence Livermore National Laboratory (LLNL), Livermore, CA (United States)
- Sponsoring Organization:
- USDOE National Nuclear Security Administration (NNSA)
- Grant/Contract Number:
- AC52-07NA27344
- OSTI ID:
- 1762887
- Report Number(s):
- LLNL-JRNL-787242; 984005
- Journal Information:
- IBM Journal of Research and Development, Vol. 64, Issue 3/4; ISSN 0018-8646
- Publisher:
- IEEECopyright Statement
- Country of Publication:
- United States
- Language:
- English
Inter-loop optimization in RAJA using loop chains
|
conference | June 2021 |
Similar Records
Graphics processing unit accelerated phase field dislocation dynamics: Application to bi-metallic interfaces
Exploring utilization options of heterogeneous architectures for multi-physics simulations