skip to main content
10.1145/3407947.3407960acmotherconferencesArticle/Chapter ViewAbstractPublication Pageshp3cConference Proceedingsconference-collections
research-article

Parallelization and Optimization of a Combustion Simulation Application on GPU Platform

Authors Info & Claims
Published:06 August 2020Publication History

ABSTRACT

TURF sim (Target Unsteady Reacting Flow simulation) is a CFD application that solves engine combustion problems on structured grids. In this paper, PGI CUDA Fortran is used to implement the CPU + GPU heterogeneous parallelization. To reduce the data transfer overhead between the CPU and GPU and the MPI communication overhead, we design a packing/unpacking based method to decrease the volume of data transferred and the number of data transfer, and use the Pinned memory to improve data transfer rate. To overlap the CPU computation, GPU computation and CPU-GPU data transfer, we reorganize the computing process, and utilize the asynchronous data transfer and the stream processing technology. We also employ loop transformations to optimize the register utilization and task partitions on the GPU. Performance evaluation is performed on two GPU-based platforms. The experimental results show that our optimization techniques for GPU are effective and the resulting GPU version significantly outperforms the original pure CPU version. The GPU version, when using a NVIDIA GTX1060 GPU, achieves a 2.96X performance acceleration over the pure CPU version runs on an Intel i7-8700 CPU (6-cores). When using a NVIDIA V100 GPU, the GPU version achieves a 1.88X performance acceleration over the pure CPU version runs on two Intel Xeon Skylake Gold CPUs (24-cores).

References

  1. Anderson, W.K., Gropp, W.D., Kaushik, D.K., Keyes, D.E. and Smith, B.F. (1999) Achieving High Sustained Performance in an Unstructured Mesh CFD Application. Proc. 1999 ACM/IEEE Conf. on Supercomputing, Portland, OR, USA, November 13-18, p. 69. ACM, New York, USA.Google ScholarGoogle Scholar
  2. Gabriel Staffelbach. High performance computing for combustion applications. Proceedings of the 2006 ACM/IEEE conference on Supercomputing (SC 06). Tampa, Florida, November 11-17, 2006. Article No. 56.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Andres, E., Widhalm, M. and Caloto, A. (2009) Achieving High Speed CFD Simulations: Optimization, Parallelization, and FPGA Acceleration for the Unstructured DLR TAU Code. Proc. 47th AIAA Aerospace Sciences Meeting Including The New Horizons Forum and Aerospace Exposition, Orlando, FL, January 5-8, pp. 8745--8764. AIAA, VA, USAGoogle ScholarGoogle Scholar
  4. Ji-dong ZHAI, Wen-guang CHEN. Avision of post-exascale programming. Frontiers of Information Technology & Electronic Engineering. 2018 19(10): 1261--1266.Google ScholarGoogle Scholar
  5. Stan POSEY, Simon SEE and Michael WANG. GPU Progress and Directions in Applied CFD. Eleventh International Conference on CFD in the Minerals and Process Industries CSIRO, Melbourne, Australia 7-9 December 2015: 1--6.Google ScholarGoogle Scholar
  6. Menshov, I., Pavlukhin, P. Highly scalable implementation of an implicit matrix-free solver for gas dynamics on GPU-accelerated clusters. J Supercomput 73, 631--638 (2017) doi:10.1007/s11227-016-1800-1.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Xiangke Liao, Liquan Xiao, Canqun Yang, Yutong Lu:Mlky Way-2 supercomputer: system and application[J]. Frontiers of Computer Science.8(3): 345--356(2014).Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Xu Chuanfu, Deng Xiaogang, et al. Collaborating CPU and GPU for large-scale high-order CFD simulations with complex grids on the TianHe-1A supercomputer. JOURNAL OF COMPUTATIONAL PHYSICS, 2014, 278: 275--297Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Zifei Ji, Bing Wang and Huiqiang Zhang. Steady State Characteristics of Scramjet Engines Using Hydrogen for High Mach Numbers. International Space Planes and Hypersonic Systems and Technologies Conferences. 6-9 March 2017, Xiamen, China.Google ScholarGoogle Scholar
  10. Hengjie Guo, Yanfei Li, Bo Wang, Huiqiang Zhang, Hongming Xu. Numerical investigation on flashing jet behaviors of single-hole GDI injector. International Journal of Heat and Mass Transfer. 130 (2019) 50-59.Google ScholarGoogle ScholarCross RefCross Ref
  11. Yonggang Che, Meifang Yang, Chuanfu Xu, Yutong Lu. Petascale scramjet combustion simulation on the Tianhe-2 heterogeneous supercomputer. Parallel Computing. 2018, 77: 101--117.Google ScholarGoogle ScholarCross RefCross Ref
  12. Hemanth Kolla, Jacqueline H. Chen. Turbulent Combustion Simulations with High-Performance Computing. Energy, Environment, and Sustainability, 2018: 73--97.Google ScholarGoogle Scholar

Index Terms

  1. Parallelization and Optimization of a Combustion Simulation Application on GPU Platform

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      HP3C 2020: Proceedings of the 2020 4th International Conference on High Performance Compilation, Computing and Communications
      June 2020
      191 pages
      ISBN:9781450376914
      DOI:10.1145/3407947

      Copyright © 2020 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 6 August 2020

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited
    • Article Metrics

      • Downloads (Last 12 months)14
      • Downloads (Last 6 weeks)3

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader