Skip to main content
Log in

A Novel Parallel Computing Strategy for Compact Difference Schemes with Consistent Accuracy and Dispersion

  • Published:
Journal of Scientific Computing Aims and scope Submit manuscript

Abstract

In this paper, based on the boundary approximation approach for parallelization of the compact difference schemes, a novel strategy for the sub-domain boundary approximation schemes is proposed to maintain consistent accuracy and dispersion with the compact scheme in the interior points. In this strategy, not only the order of accuracy of the sub-domain boundary scheme is the same as the interior scheme, but the coefficient of the first truncation error term is also equal to that of the internal scheme. Furthermore, to realize the consistent dispersion performance for a class of high order upwind compact schemes, which usually include two expressions, we modify the opposite expression to be the sub-domain boundary scheme. As an example of application, the present strategy is applied to a fourth-order upwind compact scheme, and its accuracy is verified by a numerical test. The resolution and efficiency of the newly proposed parallel method are examined by four numerical examples, including propagation of a wave-packet, convection of isentropic vortex, Rayleigh–Taylor instability problems, and propagation of Gauss pulse. The results obtained demonstrate that the present strategy for compact difference schemes has the feasibility to solve the flow problems with high accuracy, resolution and efficiency in parallel computation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Availability of Data and Materials

The data generated during and analyzed during the current study are available from the corresponding author on reasonable request.

References

  1. Lele, S.K.: Compact finite difference schemes with spectral-like resolution. J. Comput. Phys. 103(1), 16–42 (1992)

    Article  MathSciNet  Google Scholar 

  2. Bodony, D.J., Lele, S.K.: Current status of jet noise predictions using large-eddy simulation. AIAA J. 46(2), 364–380 (2008)

    Article  Google Scholar 

  3. Schaupp, C., Sesterhenn, J., Friedrich, R.: On a method for direct numerical simulation of shear layer/compression wave interaction for aeroacoustic investigations. Comput. Fluids 37(4), 463–474 (2008). https://doi.org/10.1016/j.compfluid.2007.02.007

    Article  MATH  Google Scholar 

  4. Fu, D.X., Ma, Y.W., Li, X.L.: Direct numerical simulation of three-dimensional Richtmyer–Meshkov instability. Chin. Phys. Lett. 25(1), 188–190 (2008). https://doi.org/10.1088/0256-307X/25/1/051

    Article  Google Scholar 

  5. Laizet, S., Lamballais, E.: High-order compact schemes for incompressible flows: a simple and efficient method with quasi-spectral accuracy. J. Comput. Phys. 228(16), 5989–6015 (2009). https://doi.org/10.1016/j.jcp.2009.05.010

    Article  MathSciNet  MATH  Google Scholar 

  6. Laizet, S., Li, N.: Incompact3d: a powerful tool to tackle turbulence problems with up to O(105) computational cores. Int. J. Numer. Methods Fluids 67(11), 1735–1757 (2011). https://doi.org/10.1002/fld.2480

    Article  MATH  Google Scholar 

  7. Tian, Z.F., Yu, P.X.: An efficient compact difference scheme for solving the streamfunction formulation of the incompressible NavierStokes equations. J. Comput. Phys. 230(17), 6404–6419 (2011). https://doi.org/10.1016/j.jcp.2010.12.031

    Article  MathSciNet  MATH  Google Scholar 

  8. Oguic, R., Viazzo, S., Poncet, S.: A parallelized multidomain compact solver for incompressible turbulent flows in cylindrical geometries. J. Comput. Phys. 300, 710–731 (2015). https://doi.org/10.1016/j.jcp.2015.08.003

    Article  MathSciNet  MATH  Google Scholar 

  9. Yu, P.X., Tian, Z.F., Zhang, H.: A rational high-order compact difference method for the steady-state stream functionvorticity formulation of the NavierStokes equations. Comput. Math. Appl. 73(7), 1461–1484 (2017). https://doi.org/10.1016/j.camwa.2017.01.024

    Article  MathSciNet  MATH  Google Scholar 

  10. Yu, P.X., Tian, Z.F.: An upwind compact difference scheme for solving the streamfunction-velocity formulation of the unsteady incompressible Navier-Stokes equation. Comput. Math. Appl. 75(9), 3224–3243 (2018). https://doi.org/10.1016/j.camwa.2018.01.043

    Article  MathSciNet  MATH  Google Scholar 

  11. Harten, A., Lax, P.D., Leer, B.V.: On upstream differencing and Godunov-type schemes for hyperbolic conservation laws. SIAM Rev. 25(1), 35–61 (1983)

    Article  MathSciNet  Google Scholar 

  12. Fu, D.X., Ma, Y.W.: High resolution scheme. In: Hafez, M., Oshima, K. (eds.) Computational Fluid Dynamics Review, pp. 234–250. Wiley, New York (1995)

    Google Scholar 

  13. Tolstykh, A.I., Lipavskii, M.V.: On performance of methods with third-and fifth-order compact upwind differencing. J. Comput. Phys. 140(2), 205–232 (1998)

    Article  MathSciNet  Google Scholar 

  14. Ma, Y.W., Fu, D.X., Kobayashi, T., Taniguchi, N.: Numerical solution of the incompressible Navier–Stokes equations with an upwind compact difference scheme. Int. J. Numer. Methods Fluids 30(5), 509–521 (1999)

    Article  MathSciNet  Google Scholar 

  15. Li, X.L., Ma, Y.W., Fu, D.X.: High efficient method for incompressible N-S equations and analysis of two-dimensional turbulent channel flow. Acta Mechanica Sinica (2001) (in Chinese)

  16. He, Z.W., Li, X.L., Fu, D.X., Ma, Y.W.: A 5th order monotonicity-preserving upwind compact difference scheme. Sci. China (Phys. Mech. Astron.) 54(3), 511–522 (2011)

    Article  Google Scholar 

  17. Tam, C.K., Webb, J.C.: Dispersion-relation-preserving finite difference schemes for computational acoustics. J. Comput. Phys. 107(2), 262–281 (1993)

    Article  MathSciNet  Google Scholar 

  18. Tian, Z.F., Liang, X., Yu, P.X.: A higher order compact finite difference algorithm for solving the incompressible Navier–Stokes equations. Int. J. Numer. Methods Eng. 88(6), 511–532 (2011). https://doi.org/10.1002/nme.3184

    Article  MathSciNet  MATH  Google Scholar 

  19. Qin, Q., Xia, Z.A., Tian, Z.F.: High accuracy numerical investigation of double-diffusive convection in a rectangular enclosure with horizontal temperature and concentration gradients. Int. J. Heat Mass Transf. 71, 405–423 (2014). https://doi.org/10.1016/j.ijheatmasstransfer.2013.12.035

    Article  Google Scholar 

  20. Sun, X.: Application and accuracy of the parallel diagonal dominant algorithm. Parallel Comput. 21(8), 1241–1267 (1995). https://doi.org/10.1016/0167-8191(95)00018-J

    Article  MathSciNet  MATH  Google Scholar 

  21. Gaitonde, D., Visbal, M.: Further development of a Navier–Stokes solution procedure based on higher-order formulas. In: 37th Aerospace Sciences Meeting and Exhibit 557 (1999)

  22. Povitsky, A., Morris, P.J.: A higher-order compact method in space and time based on parallel implementation of the Thomas algorithm. J. Comput. Phys. 161(1), 182–203 (2000). https://doi.org/10.1006/jcph.2000.6497

    Article  MathSciNet  MATH  Google Scholar 

  23. Ladeinde, F., Cai, X., Visbal, M.R., Gaitonde, D.: Parallel implementation of curvilinear high-order formulas. Int. J. Comput. Fluid Dyn. 17(6), 467–485 (2003). https://doi.org/10.1080/10618560310001615060

    Article  MATH  Google Scholar 

  24. Sengupta, T.K., Dipankar, A., Rao, A.K.: A new compact scheme for parallel computing using domain decomposition. J. Comput. Phys. 220(2), 654–677 (2007)

    Article  Google Scholar 

  25. Chao, J., Haselbacher, A., Balachandar, S.: A massively parallel multi-block hybrid compact-WENO scheme for compressible flows. J. Comput. Phys. 228(19), 7473–7491 (2009)

    Article  MathSciNet  Google Scholar 

  26. Kim, J.W., Sandberg, R.D.: Efficient parallel computing with a compact finite difference scheme. Comput. Fluids 58, 70–87 (2012)

    Article  MathSciNet  Google Scholar 

  27. Kim, J.W.: Quasi-disjoint pentadiagonal matrix systems for the parallelization of compact finite-difference schemes and filters. J. Comput. Phys. 241, 168–194 (2013). https://doi.org/10.1016/j.jcp.2013.01.046

    Article  MathSciNet  MATH  Google Scholar 

  28. Capuano, F., Mastellone, A.: Parallelization of compact finite-volume schemes for turbulent compressible flow. In: Proceedings of 27th International Conference on Parallel Computational Fluid Dynamics, 2015. (2015)

  29. Abide, S., Binous, M.S., Zeghmati, B.: An efficient parallel high-order compact scheme for the 3D incompressible Navier–Stokes equations. Int. J. Comput. Fluid Dyn. 31(4–5), 214–229 (2017). https://doi.org/10.1080/10618562.2017.1326592

    Article  MathSciNet  Google Scholar 

  30. Naik, N.H., Naik, V.K., Nicoules, M.: Parallelization of a class of implicit finite difference schemes in computational fluid dynamics. Int. J. High Speed Comput. 5(01), 1–50 (1993)

    Article  Google Scholar 

  31. Terekhov, A.V.: A highly scalable parallel algorithm for solving Toeplitz tridiagonal systems of linear equations. J. Parallel Distrib. Comput. 87, 102–108 (2016)

    Article  Google Scholar 

  32. Larsson, J., Gustafsson, B.: Stability criteria for hybrid difference methods. J. Comput. Phys. 227(5), 2886–2898 (2008)

    Article  MathSciNet  Google Scholar 

  33. Povitsky, A., Wolfshtein, M.: Multidomain implicit numerical scheme. Int. J. Numer. Methods Fluids 25(5), 547–566 (1997)

    Article  MathSciNet  Google Scholar 

  34. Yu, S., Tsai, Y., Hsieh, K.: Runge–Kutta methods combined with compact difference schemes for the unsteady Euler equations. In: 28th Joint Propulsion Conference and Exhibit, 19923210 (1992)

  35. Van Leer, B.: Flux-vector splitting for the Euler equations. In: Eighth International Conference on Numerical Methods in Fluid Dynamics, pp. 507-512. Springer, Berlin (1982)

  36. Sharp, D.H.: An overview of Rayleigh–Taylor instability. Physica D 12(1), 3–18 (1984). https://doi.org/10.1016/0167-2789(84)90510-4

    Article  MathSciNet  MATH  Google Scholar 

  37. Samtaney, R., Pullin, D.I.: On initial value and self similar solutions of the compressible Euler equations. Phys. Fluids 8(10), 2650–2655 (1996). https://doi.org/10.1063/1.869050

    Article  MathSciNet  MATH  Google Scholar 

  38. Shi, J., Zhang, Y., Shu, C.: Resolution of high order WENO schemes for complicated flow structures. J. Comput. Phys. 186(2), 690–696 (2003)

    Article  MathSciNet  Google Scholar 

  39. Waddell, J.T., Niederhaus, C.E., Jacobs, J.W.: Experimental study of Rayleigh—Taylor instability: low Atwood number liquid systems with single-mode initial perturbations. Phys. Fluids 13(5), 1263–1273 (2001). https://doi.org/10.1063/1.1359762

    Article  MATH  Google Scholar 

  40. Tian, Z.F.: A rational high-order compact ADI method for unsteady convection–diffusion equations. Comput. Phys. Commun. 182(3), 649–662 (2011). https://doi.org/10.1016/j.cpc.2010.11.013

    Article  MathSciNet  MATH  Google Scholar 

  41. Carpenter, M.H., Gottlieb, D., Abarbanel, S.: Stable and accurate boundary treatments for compact, high-order finite-difference schemes. Appl. Numer. Math. 12(1–3), 55–87 (1993)

    Article  MathSciNet  Google Scholar 

  42. Carpenter, M.H., Gottlieb, D., Abarbanel, S.: Time-stable boundary conditions for finite-difference schemes solving hyperbolic systems: methodology and application to high-order compact schemes. J. Comput. Phys. 111(2), 220–236 (1994)

    Article  MathSciNet  Google Scholar 

  43. Carpenter, M. H., Kennedy, C. A.: Fourth-order 2N-storage Runge–Kutta schemes Nasa Technical Memorandum, 1–26 (1994)

  44. Kennedy, C.A., Carpenter, M.H., Lewis, R.M.: Low-storage, explicit Runge–Kutta schemes for the compressible Navier–Stokes equations. Appl. Numer. Math. 35(3), 177–219 (2000)

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgements

This work was supported in part by the National Natural Science Foundation of China under Grant 11502054 and 11872151, National Science and Technology Major Project, Ministry of Science and Technology of China, Grant/Award Number: 2017-II-0007-0021, Special Program for Applied Research on Super Computation of the NSFC-Guangdong Joint Fund (the fourth phase). The authors are grateful to the referees for helpful comments and suggestions which improved the quality of the paper considerably.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Peixiang Yu.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A

The following is to apply the compact fourth-order central difference scheme in conjunction with sixth-order central difference scheme for parallel calculation of the first partial derivative in the same way as Sect. 2.2.3.

If Eq. (24) is assumed to be a sixth-order accuracy scheme, a sixth-order sub-domain boundary scheme at \(i=1\) can be constructed with the coefficients \(a_k\) solved as follow

$$\begin{aligned} \begin{matrix} {{a}_{-3}}=-\frac{{1}}{{60}}, &{} {{a}_{-2}} =\frac{{3}}{{20}}, &{} {{a}_{-1}}=-\frac{{3}}{{4}}, &{} {{a}_{1}}=\frac{{3}}{{4}}, &{} {{a}_{{2}}}=- \frac{{3}}{{20}}, &{} {{a}_{3}}=\frac{1}{{60}}. \\ \end{matrix} \end{aligned}$$
(74)

The expression of truncation errors is

$$\begin{aligned} {{F}_{1}^{\small {6th}}}=\left. \frac{\partial {{u}}}{\partial {{x}}}\right| _1-\frac{{{h}^{6}}}{140}\left. \frac{{{\partial }^{7}}{{u}}}{\partial {{x}}^{7}}\right| _1+O\left( {{h}^{8}} \right) . \end{aligned}$$
(75)

Comparisons of the effects that the sub-domain boundary scheme has on the \(L_{\infty }\)-norm errors between numerical results and exact results as well as the convergence order are shown in Table 8.

Table 8 Comparisons of the \(L_{\infty }\)-norm and the convergence order of serial computing method and the parallel computing setup with 3 and 8 processes using sixth-order central difference scheme as sub-domain boundary approximation. “Ratio” refers to the ratio of parallel computing results to serial computing results

In general, the parallel differencing systems with sixth-order sub-domain boundary scheme provide fourth-order accuracy throughout the domain. However, the \(L_{\infty }\)-norm errors calculated by the parallel algorithm using sixth-order sub-domain boundary scheme is always 1.2 times the serial results, while that of the parallel algorithm using fourth-order sub-domain boundary scheme with consistent coefficient of first truncation error term are almost equal to the serial results. According to the Eq. (21), since \(\alpha _{-1}=\alpha _{1}=\frac{1}{4}\) and \(C_e=0\), then

$$\begin{aligned} G_a=G+\frac{1}{5} C {{(\text{ j }k)}^{r+1}}{{h}^{r}}+O({{h}^{r+1}}) \end{aligned}$$
(76)

which indicates that the error of the actual approximation near the boundary is at least 20% greater than the serial one. This is exactly verified by the numerical results.

Appendix B

The following is to extended our approach to match the second truncation error terms in addition and see if it improves the result for parallel calculation of the first partial derivative in the same way as Sect. 2.2.3.

An explicit central difference scheme based on eight-point stencils is taken into account here. For instance, when \(i=1\) it can be expressed as

$$\begin{aligned} {{F}_{1}}=\frac{1}{h}\sum \limits _{n=-4}^{4}{{{a}_{n}}{{u}_{1+n}}}. \end{aligned}$$
(77)

If Eq. (77) is assumed to be a fourth-order accuracy scheme and the approximation \(F_1\) has the same coefficient of first and second truncation error term as Eq. (22), the coefficients \(a_n\) can be solved that

$$\begin{aligned}&\begin{matrix} {{a}_{-4}}=\frac{1}{432},&{{a}_{-3}}=-\frac{1}{36},&{{a}_{-2}}=\frac{37}{216},&{{a}_{-1}}=-\frac{83}{108}, \end{matrix}\nonumber \\&\begin{matrix} {{a}_{0}}=0, &{} {{a}_{1}}=\frac{83}{108}, &{} {{a}_{2}}=-\frac{37}{216}, &{} {{a}_{3}}=\frac{1}{36}, &{} {{a}_{4}}=-\frac{1}{432}. \\ \end{matrix} \end{aligned}$$
(78)

Comparisons of the effects that the sub-domain boundary scheme has on the \(L_{\infty }\)-norm errors between numerical results and exact results as well as the convergence order are shown in Table 9.

Table 9 Comparisons of the \(L_{\infty }\)-norm and the convergence order of serial computing method and the parallel computing setup with 3 and 8 processes using eight-stencil fourth-order central difference scheme as sub-domain boundary approximation. “Ratio” refers to the ratio of parallel computing results to serial computing results

Compared to Table 1, the extension to match the second truncation error terms does improve the results, especially when the grids size is small. This shows that for the case of small number of grid points per process, in order to minimize the discrepancies and improve the results, if enlarging the size of sub-boundary scheme stencils is available, we can expand our approach to match the second truncation error term or even more.

Appendix C

If the explicit schemes is employed as the sub-domain boundary schemes for the parallelization of CCU45, in order to meet the proposition in Sect. 2.1 for consistent accuracy and to have DRP property, the explicit schemes based on a seven-point stencil is required. Consider the following explicit schemes:

  1. (a)

    EC7 scheme:

    $$\begin{aligned} {F}_{bound\ i}=\frac{1}{h}\sum _{n=-3}^{3}a_{n}u_{i+n}. \end{aligned}$$
    (79)
  2. (b)

    E7U4 scheme:

    $$\begin{aligned} {F}_{bound\ i}=\frac{1}{h}\sum _{n=-4}^{2}b_{n}u_{i+n}. \end{aligned}$$
    (80)
  3. (c)

    E7U5 scheme:

    $$\begin{aligned} {F}_{bound\ i}=\frac{1}{h}\sum _{n=-5}^{1}c_{n}u_{i+n}. \end{aligned}$$
    (81)

The integrated error \(E(d_0)\) is defined as follows.

$$\begin{aligned} E\left( d_0 \right) =\int _{0}^{r\pi }{{\left| \omega -{{k}_{i}}\left( d_0 \right) \right| }^{2}} {W\left( \omega \right) } d\omega \end{aligned}$$
(82)

where r is a factor to determine the optimization integration range, \(d_0\) represents \(b_0\) or \(c_0\), and \(W(\omega )\) is a weighting function.

In fact, the EC7 scheme is a central difference scheme and its dispersion relation can not be optimized by changing \(a_0\). One can impose the conditions that E7U4 and E7U5 schemes are accurate to fourth-order and keep the coefficient of first truncation error term consistent with Eq. (49). These leave one of the coefficients, note \(b_0\) or \(c_0\), as a free parameter. This parameter can then be chosen to minimize the error integral E of Eq. (82). After carrying out the constrained minimization, the coefficients can be found, which are shown in Tables 10, 11 and 12.

Table 10 Coefficients of Eq. (79) for the EC7 scheme
Table 11 Coefficients of Eq. (80) for the E7U4 scheme
Table 12 Coefficients of Eq. (81) for the E7U5 scheme

With the use of Fourier transform, the modified wavenumber defined in Eq. (50) \(k_e\left( \omega \right) =k_{r}\left( \omega \right) +\text{ j } k_{i}\left( \omega \right) \) of the schemes can be obtained. The profiles of the imaginary part \(k_{i}\) of each scheme as well as the CCU45 scheme are shown in Fig. 4. Under the consideration of not employing too many halo points for data transfer, we find that it is difficult to improve the resolution characteristics of the explicit sub-domain boundary scheme by optimization with one or two extra halo points.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, J., Yu, P., Ouyang, H. et al. A Novel Parallel Computing Strategy for Compact Difference Schemes with Consistent Accuracy and Dispersion. J Sci Comput 86, 5 (2021). https://doi.org/10.1007/s10915-020-01383-x

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10915-020-01383-x

Keywords

Navigation