skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A low-communication, parallel algorithm for solving PDEs based on range decomposition

Journal Article · · Numerical Linear Algebra with Applications
DOI:https://doi.org/10.1002/nla.2041· OSTI ID:1533195

This work proposes a new, low-communication algorithm for solving PDEs on massively parallel computers. The range decomposition (RD) algorithm exposes coarse-grain parallelism by applying nested iteration and adaptive mesh refinement locally before performing a global communication step. Just a few such steps are observed to be sufficient to obtain accuracy within a small multiple of discretization error. The target applications are petascale and exascale machines, where hierarchical parallelism is required and traditional parallel numerical PDE communication patterns are costly because of message latency. The RD algorithm uses a partition of unity to equally distribute the error, and thus, the work. The computational advantages of this approach are that the decomposed problems can be solved in parallel without any communication until the partitioned solutions are summed. This offers potential advantages in the paradigm of expensive communication but very cheap computation. This paper introduces the method and explains the details of the communication step. Two performance models are developed, showing that the latency cost associated with a traditional parallel implementation of nested iteration is proportional to log(P)2, whereas the RD method reduces the communication latency to log(P), while maintaining similar bandwidth costs. Numerical results for two problems, Laplace and advection diffusion, demonstrate the enhanced performance, and a heuristic argument explains why the method converges quickly.

Research Organization:
Texas A & M Univ., College Station, TX (United States); Univ. of Colorado, Boulder, CO (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA); USDOE Office of Science (SC); National Science Foundation (NSF); US Air Force Office of Scientific Research (AFOSR)
Grant/Contract Number:
NA0002376; FG02-03ER25574; B614452; FA95501210478
OSTI ID:
1533195
Journal Information:
Numerical Linear Algebra with Applications, Vol. 24, Issue 3; ISSN 1070-5325
Publisher:
WileyCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 4 works
Citation information provided by
Web of Science

References (11)

Theh?p version of the finite element method for elliptic equations of order2m journal January 1988
A New Paradigm for Parallel Adaptive Meshing Algorithms journal January 2000
First-Order System Least Squares for Second-Order Partial Differential Equations: Part I journal December 1994
XSEDE: Accelerating Scientific Discovery journal September 2014
BoomerAMG: A parallel algebraic multigrid solver and preconditioner journal April 2002
First-Order System Least Squares for Second-Order Partial Differential Equations: Part II journal April 1997
Algebraic Multigrid Domain and Range Decomposition (AMG-DD/AMG-RD) journal January 2015
Parallel Adaptive Multilevel Methods with Full Domain Partitions journal March 2004
Modeling the performance of an algebraic multigrid cycle on HPC platforms conference January 2011
Parallel adaptive mesh refinement for first-order system least squares: PARALLEL AMR FOR FOSLS-AMG journal January 2012
Efficiency Based Adaptive Local Refinement for First-Order System Least-Squares Formulations journal January 2011

Cited By (1)

Preparing sparse solvers for exascale computing
  • Anzt, Hartwig; Boman, Erik; Falgout, Rob
  • Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, Vol. 378, Issue 2166 https://doi.org/10.1098/rsta.2019.0053
journal January 2020