skip to main content
10.1145/1964179.1964197acmotherconferencesArticle/Chapter ViewAbstractPublication PagesgpgpuConference Proceedingsconference-collections
research-article

Unstructured grid applications on GPU: performance analysis and improvement

Published:05 March 2011Publication History

ABSTRACT

Performance of applications running on GPUs is mainly affected by hardware occupancy and global memory latency. Scientific applications that rely on analysis using unstructured grids could benefit from the high performance capabilities provided by GPUs, however, its memory access pattern and algorithm limit the potential benefits.

In this paper we analyze the algorithm for unstructured grid analysis on the basis of hardware occupancy and memory access efficiency. In general, the algorithm can be divided into three stages: cell-oriented analysis, edge-oriented analysis and information update, which present different memory access patterns. Based on the analysis we modify the algorithm to make it suitable for GPUs. The proposed algorithm aims for high hardware occupancy and efficient global memory access. Finally, through implementation we show that our design achieves up to 88 times speedup compared to the sequential CPU version.

References

  1. Owens, J. D, Houston, M., Luebke, D., Green, S., Stone, J. E., Phillips, J. C., GPU Computing, Processdings of the IEEE, Vol. 95(5) pp. 879--899, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  2. NVidia, NVIDIA CUDA Programming Guide v.2.3.1, Aug. 2009.Google ScholarGoogle Scholar
  3. Owens, J. D, Luebke, D., Govindaraju, N., Harris, M., Kruger, J., Lefohn, A. E., Purcell, T., A Survey of General-Purpose Computation on Graphics Hardware, Computer Graphics Forum, Vol. 26(1) pp. 80--113, Mar. 2007.Google ScholarGoogle ScholarCross RefCross Ref
  4. Hu, H., Turner, E., Parallel CFD Computing Using Shared Memory OpenMP, Lecture Notes on Computer Science, pp. 1137--1146, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Mavriplis, D. J., Unstructured Grid Techniques, Anual Review of Fluid Mechanics, Vol. 29, pp. 473--514, Jan, 1997.Google ScholarGoogle ScholarCross RefCross Ref
  6. Kaushik, D. K., Keyes, D. E., Efficient Parallelization of an Unstructured Grid Solver: A Memory-Centric Approach, Istambul Technical University, 1999.Google ScholarGoogle Scholar
  7. Asanovic, K., Bodik, R., Catanzaro, B., Gebis, J., Husbands, P., Keutzer, K., Patterson, D., Plischker, W., Shalf, J., Williams, S., Yelick, K., The Landscape of Parallel Computing Research: A View from Berkeley, Electrical Engineering and Computer Sciences, University of California, Berkeley, Technical Report No. UCB/EECS-2006-183, Dec. 18, 2006Google ScholarGoogle Scholar
  8. Corrigan, A., Camelli, F., Rainald, L., Running Unstructured Grid Based CFD Solvers on Modern Graphics Hardware, 19th AIAA Computational Fluid Dynamics, Jun, 2009.Google ScholarGoogle Scholar
  9. Guo, W., Jin, C., Jianhua, Li., High performance lattice Boltzmann algorithms for fluid flows, International Symposium on Information Science and Engineering, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Nickolls, J., Dally, W., The GPU Computing Era, Micro, IEEE, Vol: 30, 2, pp: 56--69, Mar. 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Wang, Z. J., Gao, H., A unifying lifting collocation penalty formulation including the discontinuous Galerkin, spectral volume/difference methods for conservation laws on mixed grids, Journal of Computational Physics, Vol: 228, 21, Nov. 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Unstructured grid applications on GPU: performance analysis and improvement

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            GPGPU-4: Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units
            March 2011
            101 pages
            ISBN:9781450305693
            DOI:10.1145/1964179

            Copyright © 2011 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 5 March 2011

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            Overall Acceptance Rate57of129submissions,44%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader