A dataparallel implementation of an explicit method for the three-dimensional compressible Navier-Stokes equations

https://doi.org/10.1016/0167-8191(90)90093-OGet rights and content

Abstract

The fluid flow in a three-dimensional twisted channel is modeled by both the compressible Navier-Stokes equations, and the Euler equations. A three stage Runge-Kutta method is used for integrating the system of equations in time. A second-order accurate, centered difference scheme is used for spatial derivatives of the flux variables. For both the Euler and the Navier-Stokes equations artificial viscosity introduced through fourth-order centered differences is used to stabilize the numeric scheme. By using lower order difference approximations on or close to the boundary than in the interior, the difference stencils can be evaluated at all grid points concurrently. A few different difference molecules for the boundaries, and different factorizations of the fourth-order difference operators were evaluated. With the appropriate factorization of the difference stencils, six variables per lattice point suffice for the evaluation of the difference stencils occurring in the code. The three fourth-order stencils we investigated, including three different factorizations of one of these stencils, account for three out these six variables. The convergence rate for all stencils and their factorizations is approximately the same for the first 1000–1500 steps at which point the residual has reached a value of 10−2–10−3. From this point on the convergence rate for one of the factorizations of the fourth-order stencil is approximately twice that of one of the unfactored stencils.

A performance of 1.05 Gflops/s was demonstrated on 65 536 processor Connection Machine system with 512 Mbytes of primary storage. The performance scales in proportion to the number of processors. The performance on 8k processor configurations was 135 Mflops/s, on 16k processors 265 Mflops/s and 525 Mflops/s on 32k processors. The efficiency is independent of the machine size. The evaluation of the boundary conditions accounted for less than 5% of the total time. A performance improvement by a factor of about three is expected with optimized implementations of functional kernels such as convolution, and matrix-vector multiplication.

References (28)

  • Lisp release notes

    (1987)
  • Programming in C

    (1987)
  • M.Y Chan

    Dilation-2 embeddings of grids into hypercubes

  • M.Y Chan

    Embeddings of 3-dimensional grids into optimal hypercubes

    (1988)
    M.Y Chan
  • M.M Denneau et al.

    The switching network of the TF-1 parallel supercomputer

    Supercomputing Mag.

    (1988)
  • J.J Dongarra et al.

    An extended set of Fortran basic linear algebra subprograms

  • R Enander et al.

    Numerical simulation of fluid flow in a twisted channel

  • L.E Eriksson

    Boundary conditions for artificial dissipation operators

  • B Gustafsson

    The convergence rate for differential approximations to general mixed initial boundary value problem

    SIAM J Numer. Anal.

    (1981)
  • I Havel et al.

    B-valuations of graphs

    Czech. Math. J.

    (1972)
  • W.D Henshaw et al.

    On the smallest scale for the incompressible Navier-Stokes equations

    ICASE Report No. 88-8

    (1988)
  • W.D Hillis

    The Connection machine

    (1985)
  • C.-T Ho et al.

    On the embedding of arbitrary meshes in Boolean cubes with expansion two dilation two

  • C.-T Ho et al.

    Embedding meshes in boolean cubes by graph decomposition

    J. Parallel Distr. Comput.

    (1989)
    C.-T Ho et al.
  • Cited by (12)

    • Implementation and performance of a data parallel algorithm for the dynamic subgrid-scale stress model on a massively parallel computer

      1996, American Society of Mechanical Engineers, Fluids Engineering Division (Publication) FED
    View all citing articles on Scopus

    This project was partially sponsored by the Swedish Board of Technical Development (STU), reg-no 61008767, by Jacob Letterstedts fond, The Royal Swedish Academy of Sciences, by the Office of Naval Research under Contract No. N00014-86-K-0310, and by the United States Air Force under contract AFOSR-89-0382.

    View full text