Accelerating the Conjugate Gradient Algorithm with GPUs in CFD Simulations

Anzt, Hartwig; Baboulin, Marc; Dongarra, Jack; Fournier, Yvan; Hulsemann, Frank; Khabou, Amal; Wang, Yushan

doi:10.1007/978-3-319-61982-8_5

Hartwig Anzt¹⁷,
Marc Baboulin¹⁸,
Jack Dongarra¹⁷,
Yvan Fournier¹⁹,
Frank Hulsemann¹⁹,
Amal Khabou¹⁸ &
…
Yushan Wang¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10150))

Included in the following conference series:

International Conference on Vector and Parallel Processing

489 Accesses
2 Citations

Abstract

This paper illustrates how GPU computing can be used to accelerate computational fluid dynamics (CFD) simulations. For sparse linear systems arising from finite volume discretization, we evaluate and optimize the performance of Conjugate Gradient (CG) routines designed for manycore accelerators and compare against an industrial CPU-based implementation. We also investigate how the recent advances in preconditioning, such as iterative Incomplete Cholesky (IC, as symmetric case of ILU) preconditioning, match the requirements for solving real world problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

GPU Acceleration of the FINE/FR CFD Solver in a Heterogeneous Environment with OpenACC Directives

Multi GPU Implementation to Accelerate the CFD Simulation of a 3D Turbo-Machinery Benchmark Using the RapidCFD Library

Acceleration of Turbomachinery Steady Simulations on GPU

References

Aliaga, J.I., Pérez, J., Quintana-Ortí, E.S.: Systematic fusion of CUDA kernels for iterative sparse linear system solvers. In: Träff, J.L., Hunold, S., Versaci, F. (eds.) Euro-Par 2015. LNCS, vol. 9233, pp. 675–686. Springer, Heidelberg (2015). doi:10.1007/978-3-662-48096-0_52
Chapter Google Scholar
Aliaga, J.I., Perez, J., Quintana-Orti, E.S., Anzt, H.: Reformulated conjugate gradient for the energy-aware solution of linear systems on GPUs. In: 2013 42nd International Conference on Parallel Processing (ICPP), pp. 320–329, October 2013
Google Scholar
Anzt, H., Chow, E., Dongarra, J.: Iterative sparse triangular solves for preconditioning. In: Träff, J.L., Hunold, S., Versaci, F. (eds.) Euro-Par 2015. LNCS, vol. 9233, pp. 650–661. Springer, Heidelberg (2015). doi:10.1007/978-3-662-48096-0_50
Chapter Google Scholar
Anzt, H., Tomov, S., Dongarra, J.: Energy efficiency and performance frontiers for sparse computations on GPU supercomputers. In: Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM 2015, pp. 1–10. ACM, New York (2015)
Google Scholar
Archambeau, F., Méchitoua, N., Sakiz, M.: Code Saturne: A Finite Volume Code for the computation of turbulent incompressible flows - Industrial Applications. Int. J. Finite 1(1) (2004)
Google Scholar
Chow, E., Anzt, H., Dongarra, J.: Asynchronous iterative algorithm for computing incomplete factorizations on GPUs. In: Kunkel, J.M., Ludwig, T. (eds.) ISC High Performance 2015. LNCS, vol. 9137, pp. 1–16. Springer, Cham (2015). doi:10.1007/978-3-319-20119-1_1
Chapter Google Scholar
Chow, E., Patel, A.: Fine-grained parallel incomplete LU factorization. SIAM J. Sci. Comput. 37, C169–C193 (2015)
Article MathSciNet MATH Google Scholar
MAGMA Web page. http://icl.cs.utk.edu/magma/index.html
NVIDIA Corporation. CUDA C best practices guide. http://docs.nvidia.com/cuda/cuda-c-best-practices-guide/
NVIDIA Corporation. CUDA Toolkit Documentation v7.5, September 2015
Google Scholar
Rupp, K., Rudolf, F., Weinbub, J.: ViennaCL - a high level linear algebra library for GPUs and multi-core CPUs. In: International Workshop on GPUs and Scientific Applications, pp. 51–56 (2010)
Google Scholar
Saad, Y.: Iterative Methods for Sparse Linear Systems. Society for Industrial and Applied Mathematics, Philadelphia (2003)
Book MATH Google Scholar

Download references

Acknowledgements

This work was funded by the contract P02220 between Université Paris-Sud and EDF. We are grateful to Karl Rupp (TU Wien) for his support in using the ViennaCL library.

Author information

Authors and Affiliations

Innovative Computing Laboratory, University of Tennessee, Knoxville, USA
Hartwig Anzt & Jack Dongarra
Laboratoire de Recherche en Informatique, Université Paris-Sud, Orsay, France
Marc Baboulin, Amal Khabou & Yushan Wang
EDF R&D, Clamart, France
Yvan Fournier & Frank Hulsemann

Authors

Hartwig Anzt
View author publications
You can also search for this author in PubMed Google Scholar
Marc Baboulin
View author publications
You can also search for this author in PubMed Google Scholar
Jack Dongarra
View author publications
You can also search for this author in PubMed Google Scholar
Yvan Fournier
View author publications
You can also search for this author in PubMed Google Scholar
Frank Hulsemann
View author publications
You can also search for this author in PubMed Google Scholar
Amal Khabou
View author publications
You can also search for this author in PubMed Google Scholar
Yushan Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Amal Khabou .

Editor information

Editors and Affiliations

University of Porto, Porto, Portugal
Inês Dutra
University of Porto, Porto, Portugal
Rui Camacho
University of Porto, Porto, Portugal
Jorge Barbosa
Lawrence Berkeley National Laboratory, Berkeley, California, USA
Osni Marques

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Anzt, H. et al. (2017). Accelerating the Conjugate Gradient Algorithm with GPUs in CFD Simulations. In: Dutra, I., Camacho, R., Barbosa, J., Marques, O. (eds) High Performance Computing for Computational Science – VECPAR 2016. VECPAR 2016. Lecture Notes in Computer Science(), vol 10150. Springer, Cham. https://doi.org/10.1007/978-3-319-61982-8_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-61982-8_5
Published: 14 July 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-61981-1
Online ISBN: 978-3-319-61982-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics