Optimization and performance evaluation of the IDR iterative Krylov solver on GPUs

Language
en
Document Type
Article
Issue Date
2020-04-21
Issue Year
2018
Authors
Anzt, Hartwig
Kreutzer, Moritz
Ponce, Eduardo
Peterson, Gregory D.
Wellein, Gerhard
Dongarra, Jack
Editor
Abstract

In this paper, we present an optimized GPU implementation for the induced dimension reduction algorithm. We improve data locality, combine it with an efficient sparse matrix vector kernel, and investigate the potential of overlapping computation with communication as well as the possibility of concurrent kernel execution. A comprehensive performance evaluation is conducted using a suitable performance model. The analysis reveals efficiency of up to 90%, which indicates that the implementation achieves performance close to the theoretically attainable bound.

Journal Title
The International Journal of High Performance Computing Applications
Volume
32
Issue
2
Citation
The International Journal of High Performance Computing Applications 32. 2 (2018): 220 - 230. <https://journals.sagepub.com/doi/full/10.1177/1094342016646844>
Zugehörige ORCIDs