Skip to main content

Analyzing the Parallel Scalability of an Implicit Unstructured Mesh CFD Code

  • Conference paper
  • First Online:
High Performance Computing — HiPC 2000 (HiPC 2000)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1970))

Included in the following conference series:

Abstract

In this paper, we identify the scalability bottlenecks of an unstructured grid CFD code (PETSc-FUN3D) by studying the impact of several algorithmic and architectural parameters and by examiningdif ferent programmingmodels. We discuss the basic performance characteristics of this PDE code with the help of simple performance models developed in our earlier work, presentingprimarily experimental results. In addition to achievingg ood per-processor performance (which has been addressed in our cited work and without which scalability claims are suspect) we strive to improve the implementation and convergence scalability of PETSc-FUN3D on thousands of processors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. W. K. Anderson and D. L. Bonhaus. An implicit upwind algorithm for computing turbulent flows on unstructured grids. Computers and Fluids, 23:1–21, 1994.

    Article  MATH  Google Scholar 

  2. W. K. Anderson, W. D. Gropp, D. K. Kaushik D. E. Keyes, and B. F. Smith. Achievinghig high sustained performance in an unstructured mesh CFD application. In Proceedings of SC’99. IEEE Computer Society, 1999. Gordon Bell Prize Award Paper in Special Category.

    Google Scholar 

  3. W. K. Anderson, R. D. Rausch, and D. L. TBonhaus. Implicit/multigrid algorithms for incompressible turbulent flows on unstructured grids. J. Computational Physics, 128:391–408, 1996.

    Article  MATH  Google Scholar 

  4. S. Balay, W. D. Gropp, L. C. McInnes, and B. F. Smith. The Portable Extensible Toolkit for Scientific Computing(PETSc) version 28. http://www.mcs.anl.gov/petsc/petsc.html, 2000.

  5. S. W. Bova, C. P. Breshears, C. E. Cuicchi, Z. Demirbilek, and H. A. Gabb. Dual-level parallel analysis of harbor wave response usingMPI and OpenMP. Int. J. High Performance Computing Applications, 14:49–64, 2000.

    Google Scholar 

  6. W. D. Gropp, D. K. Kaushik, D. E. Keyes, and B. F. Smith. Toward realistic performance bounds for implicit CFD codes. In D. Keyes, A. Ecer, J. Periaux, N. Satofuka, and P. Fox, editors, Proceedings of Parallel CFD’99, pages 233–240. Elsevier, 1999.

    Google Scholar 

  7. W. D. Gropp, D. K. Kaushik, D. E. Keyes, and B. F. Smith. Performance modelingand tuningof an unstructured mesh CFD application. In Proceedings of SC2000. IEEE Computer Society, 2000.

    Google Scholar 

  8. W. D. Gropp, L. C. McInnes, M. D. Tidriri, and D. E. Keyes. Globalized Newton-Krylov-Schwarz algorithms and software for parallel implicit CFD. Int. J. High Performance Computing Applications, 14:102–136, 2000.

    Article  Google Scholar 

  9. William Gropp, Ewing Lusk, and Anthony Skjellum. Using MPI: Portable Parallel Programming with the Message Passing Interface, 2nd edition. MIT Press, Cambridge, MA, 1999.

    Google Scholar 

  10. William D. Gropp and Ewing Lusk. Reproducible measurements of MPI performance characteristics. In Jack Dongarra, Emilio Luque, and Tomàs Margalef, editors, Recent Advances in Parallel Virtual Machine and Message Passing Interface, volume 1697 of Lecture Notes in Computer Science, pages 11–18. Springer Verlag, 1999. 6th European PVM/MPI Users’ Group Meeting, Barcelona, Spain, September 1999.

    Chapter  Google Scholar 

  11. P. D. Hough, T. G. Kolda, and V. J. Torczon. Asynchronous parallel pattern search for nonlinear optimization. Technical Report SAND2000-8213, Sandia National Laboratories, Livermore, January 2000. Submitted to SIAM J. Scientific Computation.

    Google Scholar 

  12. G. Karypis and V. Kumar. A fast and high quality scheme for partitioning irregular graphs. SIAM J. Scientific Computing, 20:359–392, 1999.

    Article  MATH  MathSciNet  Google Scholar 

  13. D. J. Mavriplis. Parallel unstructured mesh analysis of high-lift configurations. Technical Report 2000-0923, AIAA, 2000.

    Google Scholar 

  14. J. D. McCalpin. STREAM: Sustainable memory bandwidth in high performance computers. Technical report, University of Virginia, 1995. http://www.cs.virginia.edu/stream.

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Gropp, W.D., Kaushik, D.K., Keyes, D.E., Smith, B.F. (2000). Analyzing the Parallel Scalability of an Implicit Unstructured Mesh CFD Code. In: Valero, M., Prasanna, V.K., Vajapeyam, S. (eds) High Performance Computing — HiPC 2000. HiPC 2000. Lecture Notes in Computer Science, vol 1970. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44467-X_36

Download citation

  • DOI: https://doi.org/10.1007/3-540-44467-X_36

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41429-2

  • Online ISBN: 978-3-540-44467-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics