The design and implementation of a parallel unstructured Euler solver using software primitives

We are concerned with the implementation of a three-dimensional unstructured-grid Euler solver on massively parallel distributed-memory computer architectures. The goal is to minimize solution time by achieving high computational rates with a numerically efficient algorithm. An unstructured multigrid algorithm with an edge-based data structure has been adopted, and a number of optimizations have been devised and implemented to accelerate the parallel computational rates. The implementation is carried out by creating a set of software tools, which provide an interface between the parallelization issues and the sequential code, while providing a basis for future automatic run-time compilation support

[1]  Joel H. Saltz,et al.  Principles of runtime support for parallel processors , 1988, ICS '88.

[2]  James D. Schoeffler Microprocessor Architecture , 1975, IEEE Transactions on Industrial Electronics and Control Instrumentation.

[3]  Alok Choudhary,et al.  Runtime compilation techniques for data partitioning and communication schedule reuse , 1993, Supercomputing '93.

[4]  Joel H. Saltz,et al.  Applying the CHAOS/PARTI library to irregular problems in computational chemistry and computational aerodynamics , 1993, Proceedings of Scalable Parallel Libraries Conference.

[5]  P. Sadayappan,et al.  Compile-Time Charactirization Recurrent Patterns in Irregular Computations , 1993, 1993 International Conference on Parallel Processing - ICPP'93.

[6]  Geoffrey C. Fox,et al.  Runtime Support and Compilation Methods for User-Specified Irregular Data Distributions , 1995, IEEE Trans. Parallel Distributed Syst..

[7]  Neal Margulis,et al.  I860 Microprocessor Architecture , 1990 .

[8]  Joel H. Saltz,et al.  Runtime and language support for compiling adaptive irregular programs on distributed‐memory machines , 1995, Softw. Pract. Exp..

[9]  D WilliamsRoy Performance of dynamic load balancing algorithms for unstructured mesh calculations , 1991 .

[10]  M. Gerndt,et al.  SUPERB support for irregular scientific computations , 1992, Proceedings Scalable High Performance Computing Conference SHPCC-92..

[11]  Wayne Smith,et al.  A scalable high-performance environment for fluid flow analysis on unstructured grids , 1994, Proceedings of Supercomputing '94.

[12]  Joel H. Saltz,et al.  Adaptive runtime support for direct simulation Monte Carlo methods on distributed memory architectures , 1994, Proceedings of IEEE Scalable High Performance Computing Conference.

[13]  Alex Pothen,et al.  PARTITIONING SPARSE MATRICES WITH EIGENVECTORS OF GRAPHS* , 1990 .

[14]  E. Cuthill,et al.  Reducing the bandwidth of sparse symmetric matrices , 1969, ACM '69.

[15]  D. Mavriplis Three dimensional unstructured multigrid for the Euler equations , 1991 .

[16]  Harry Berryman,et al.  Multiprocessors and run-time compilation , 1991, Concurr. Pract. Exp..

[17]  Johan De Keyser,et al.  Adaptive Irregular Multiple Grids on a Distributed Memory Multiprocessor , 1991, EDMCC.

[18]  Michael J. Quinn,et al.  Block data decomposition for partial-homogeneous parallel networks , 1994, 1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences.

[19]  Joel H. Saltz,et al.  The Dybbuk runtime system , 1993, Digest of Papers. Compcon Spring.

[20]  Shahid H. Bokhari,et al.  A Partitioning Strategy for Nonuniform Problems on Multiprocessors , 1987, IEEE Transactions on Computers.

[21]  Horst D. Simon,et al.  Partitioning of unstructured problems for parallel processing , 1991 .

[22]  T. Barth,et al.  Efficient massively parallel Euler solver for two-dimensional unstructured grids , 1992 .

[23]  Harry Berryman,et al.  Execution time support for adaptive scientific algorithms on distributed memory machines , 1991, Concurr. Pract. Exp..

[24]  Harry Berryman,et al.  Distributed Memory Compiler Design for Sparse Problems , 1995, IEEE Trans. Computers.

[25]  Joel H. Saltz,et al.  Parallelizing Molecular Dynamics Programs for Distributed Memory Machines: An Application of the Cha , 1994 .

[26]  Youcef Saad,et al.  A Basic Tool Kit for Sparse Matrix Computations , 1990 .

[27]  Youcef Saad,et al.  A Basic Tool Kit for Sparse Matrix Computations , 1990 .

[28]  A Jameson,et al.  CALCULATION OF IN VISCID TRANSONIC FLOW OVER A COMPLETE AIRCRAFT , 1986 .

[29]  Neal Margulis i860 microprocessor internal architecture , 1990 .