research-article

Unstructured grid applications on GPU: performance analysis and improvement

Authors:
Lizandro Solano-Quinde

Iowa State University, Ames, IA

Iowa State University, Ames, IA
View Profile

,
Zhi Jian Wang

Iowa State University, Ames, IA

Iowa State University, Ames, IA
View Profile

,
Brett Bode

University of Illinois, Urbana, IL

University of Illinois, Urbana, IL
View Profile

,
Arun K. Somani

Iowa State University, Ames, IA

Iowa State University, Ames, IA
View Profile

GPGPU-4: Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing UnitsMarch 2011Article No.: 13Pages 1–8https://doi.org/10.1145/1964179.1964197

Published:05 March 2011Publication History

GPGPU-4: Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units

Pages 1–8

ABSTRACT

Performance of applications running on GPUs is mainly affected by hardware occupancy and global memory latency. Scientific applications that rely on analysis using unstructured grids could benefit from the high performance capabilities provided by GPUs, however, its memory access pattern and algorithm limit the potential benefits.

In this paper we analyze the algorithm for unstructured grid analysis on the basis of hardware occupancy and memory access efficiency. In general, the algorithm can be divided into three stages: cell-oriented analysis, edge-oriented analysis and information update, which present different memory access patterns. Based on the analysis we modify the algorithm to make it suitable for GPUs. The proposed algorithm aims for high hardware occupancy and efficient global memory access. Finally, through implementation we show that our design achieves up to 88 times speedup compared to the sequential CPU version.

References

Owens, J. D, Houston, M., Luebke, D., Green, S., Stone, J. E., Phillips, J. C., GPU Computing, Processdings of the IEEE, Vol. 95(5) pp. 879--899, 2008.Google ScholarCross Ref
NVidia, NVIDIA CUDA Programming Guide v.2.3.1, Aug. 2009.Google Scholar
Owens, J. D, Luebke, D., Govindaraju, N., Harris, M., Kruger, J., Lefohn, A. E., Purcell, T., A Survey of General-Purpose Computation on Graphics Hardware, Computer Graphics Forum, Vol. 26(1) pp. 80--113, Mar. 2007.Google ScholarCross Ref
Hu, H., Turner, E., Parallel CFD Computing Using Shared Memory OpenMP, Lecture Notes on Computer Science, pp. 1137--1146, 2001. Google ScholarDigital Library
Mavriplis, D. J., Unstructured Grid Techniques, Anual Review of Fluid Mechanics, Vol. 29, pp. 473--514, Jan, 1997.Google ScholarCross Ref
Kaushik, D. K., Keyes, D. E., Efficient Parallelization of an Unstructured Grid Solver: A Memory-Centric Approach, Istambul Technical University, 1999.Google Scholar
Asanovic, K., Bodik, R., Catanzaro, B., Gebis, J., Husbands, P., Keutzer, K., Patterson, D., Plischker, W., Shalf, J., Williams, S., Yelick, K., The Landscape of Parallel Computing Research: A View from Berkeley, Electrical Engineering and Computer Sciences, University of California, Berkeley, Technical Report No. UCB/EECS-2006-183, Dec. 18, 2006Google Scholar
Corrigan, A., Camelli, F., Rainald, L., Running Unstructured Grid Based CFD Solvers on Modern Graphics Hardware, 19th AIAA Computational Fluid Dynamics, Jun, 2009.Google Scholar
Guo, W., Jin, C., Jianhua, Li., High performance lattice Boltzmann algorithms for fluid flows, International Symposium on Information Science and Engineering, 2008. Google ScholarDigital Library
Nickolls, J., Dally, W., The GPU Computing Era, Micro, IEEE, Vol: 30, 2, pp: 56--69, Mar. 2010. Google ScholarDigital Library
Wang, Z. J., Gao, H., A unifying lifting collocation penalty formulation including the discontinuous Galerkin, spectral volume/difference methods for conservation laws on mixed grids, Journal of Computational Physics, Vol: 228, 21, Nov. 2009. Google ScholarDigital Library

Index Terms

Unstructured grid applications on GPU: performance analysis and improvement

Recommendations

Vectorizing Unstructured Mesh Computations for Many-core Architectures
PMAM'14: Proceedings of Programming Models and Applications on Multicores and Manycores

Achieving optimal performance on the latest multi-core and many-core architectures depends more and more on making efficient use of the hardware's vector processing capabilities. While auto-vectorizing compilers do not require the use of vector ...
Read More
Vectorizing Unstructured Mesh Computations for Many-core Architectures
PMAM'14: Proceedings of Programming Models and Applications on Multicores and Manycores

Achieving optimal performance on the latest multi-core and many-core architectures depends more and more on making efficient use of the hardware's vector processing capabilities. While auto-vectorizing compilers do not require the use of vector ...
Read More
A performance study of general-purpose applications on graphics processors using CUDA

Graphics processors (GPUs) provide a vast number of simple, data-parallel, deeply multithreaded cores and high memory bandwidths. GPU architectures are becoming increasingly programmable, offering the potential for dramatic speedups for a variety of ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

GPGPU-4: Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units
March 2011
101 pages
ISBN:9781450305693
DOI:10.1145/1964179

Copyright © 2011 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 5 March 2011
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
CUDA
GPGPU
GPU
unstructured grid
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate57of129submissions,44%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 17
  Total Citations
  View Citations
- 371
  Total Downloads
- Downloads (Last 12 months)10
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Unstructured grid applications on GPU: performance analysis and improvement

GPGPU-4: Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units

ABSTRACT

References

Cited By

Index Terms

Recommendations

Vectorizing Unstructured Mesh Computations for Many-core Architectures

Vectorizing Unstructured Mesh Computations for Many-core Architectures

A performance study of general-purpose applications on graphics processors using CUDA

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Unstructured grid applications on GPU: performance analysis and improvement

GPGPU-4: Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units

ABSTRACT

References

Cited By

Index Terms

Recommendations

Vectorizing Unstructured Mesh Computations for Many-core Architectures

Vectorizing Unstructured Mesh Computations for Many-core Architectures

A performance study of general-purpose applications on graphics processors using CUDA

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media