论文信息 - Efficient Parallelization of an Unstructured Grid Solver : A Memory-Centric Approach

Efficient Parallelization of an Unstructured Grid Solver : A Memory-Centric Approach

For an unstructured grid computational fluid dynamics computation typical of many large-scale partial differential equations requiring implicit treatment, we describe coding practices that lead to high implementation efficiency for standard computational and communication kernels, in both uniprocessor and parallel senses. Moreover, a family of Newton-like preconditioned Krylov algorithms whose convergence rate degrades only slightly with increasing parallel granularity, relying primarily on sparse Jacobian-vector multiplications, can be expressed in terms of these kernels. A combination of the three (uniprocessor performance, parallel scalability, and algorithmic scalability) is required for overall high performance on the largest scale problems that a given generation of parallel platforms supports.

D. K. Kaushik | D. E. Keyes | D. Keyes | D. Kaushik

[1] William Gropp,et al. Parallel Implicit PDE Computations , 1997, Parallel CFD.

[2] James Demmel,et al. Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology , 1997, ICS '97.

[3] David E. Keyes,et al. On the Interaction of Architecture and Algorithm in the Domain-based Parallelization of an Unstructu , 1997 .

[4] Vipin Kumar,et al. A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs , 1998, SIAM J. Sci. Comput..

[5] W. K. Anderson,et al. Implicit/Multigrid Algorithms for Incompressible Turbulent Flows on Unstructured Grids , 1995 .

[6] D. Keyes,et al. Toward Realistic Performance Bounds for Implicit CFD , 1999 .

[7] W. K. Anderson,et al. An implicit upwind algorithm for computing turbulent flows on unstructured grids , 1994 .

[8] C. Kelley,et al. Convergence Analysis of Pseudo-Transient Continuation , 1998 .

[9] E. Cuthill,et al. Reducing the bandwidth of sparse symmetric matrices , 1969, ACM '69.

[10] D. Keyes. How Scalable is Domain Decomposition in Practice , 1998 .

[11] Barry F. Smith,et al. Methods for Compressible and Incompressible Flows on Unstructured Grids , 1999 .

[12] David E. Keyes,et al. Three Parallel Programming Paradigms: Comparisons on an Archetypal PDE Computation , 1999, Scalable Comput. Pract. Exp..