An efficient 3D grid based scheduling for heterogeneous systems

The cost/performance ratio of networks of workstations has been constantly improving. This trend is expected to continue in the near future. The aggregate peak rate of such systems often matches or exceeds the peak rate offered by the fastest parallel computers. This has motivated research toward using a network of computers, interconnected via a fast network (cluster system) or a simple Local Area Network (LAN) (distributed system), for high performance concurrent computations. Some of the important research issues arise such as (i) Problem partitioning and virtual interconnection topology mapping; (ii) Execution scheduling and load balancing.Past results exist for grid partitioning (into subdomains) and mapping to parallel and distributed systems. In our work we consider the problem of grid partitioning of a 3D domain arising in aircraft CFD simulations in order to schedule tasks for load balanced execution on a heterogeneous distributed system. This problem has additional restrictions on how to partition the grid. Past work for this problem were on parallel systems with only few processor configurations. We derive heuristic algorithms for: (1) homogeneous systems with any number of processors; (2) heterogeneous systems taking into account the processor speed and memory capacity. We implement our algorithms on a dedicated network of workstations (using MPI) and test them with a CFD simulation code (TURNS--Transonic Unsteady Rotor Navier Stokes).

[1]  Anthony T. Chronopoulos,et al.  Parallel Iterative S-Step Methods for Unsymmetric Linear Systems , 1996, Parallel Comput..

[2]  Roger C. Strawn,et al.  Parallelization of a Three-Dimensional Flow Solver for Euler Rotorcraft Aerodynamics Predictions , 1996 .

[3]  Daniel Grosu Some Performance Metrics for Heterogeneous Distributed Systems , 1996, PDPTA.

[4]  J. Baeder,et al.  TURNS - A free-wake Euler/Navier-Stokes numerical method for helicopter rotors , 1993 .

[5]  Gary L. Miller,et al.  A unified geometric approach to graph separators , 1991, [1991] Proceedings 32nd Annual Symposium of Foundations of Computer Science.

[6]  David M. Nicol,et al.  Rectilinear Partitioning of Irregular Data Parallel Computations , 1994, J. Parallel Distributed Comput..

[7]  Jack Dongarra,et al.  Numerical Linear Algebra for High-Performance Computers , 1998 .

[8]  Anthony T. Chronopoulos,et al.  Parallel Newton-Krylov method for rotary-wing flowfield calculations , 1999 .

[9]  Gary B. Lamont,et al.  Asymmetric Load Balancing on a Heterogeneous Cluster of PCs , 1999, PDPTA.

[10]  Lie Wang,et al.  Towards a fast implementation of spectral nested dissection , 1992, Proceedings Supercomputing '92.

[11]  Meng-Sing Liou,et al.  Preconditioned implicit solvers for the Navier-Stokes equations on distributed-memory machines , 1994 .

[12]  Vipin Kumar,et al.  A Parallel Algorithm for Multilevel Graph Partitioning and Sparse Matrix Ordering , 1998, J. Parallel Distributed Comput..

[13]  William Gropp,et al.  Users guide for mpich, a portable implementation of MPI , 1996 .

[14]  Vipin Kumar,et al.  A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs , 1998, SIAM J. Sci. Comput..

[15]  A. Jameson,et al.  Lower-upper Symmetric-Gauss-Seidel method for the Euler and Navier-Stokes equations , 1988 .

[16]  Michael J. Quinn,et al.  Non-uniform 2-D grid partitioning for heterogeneous parallel architectures , 1995, Proceedings of 9th International Parallel Processing Symposium.

[17]  Thomas Sterling,et al.  An assessment of Beowulf-class computing for NASA requirements: initial findings from the first NASA workshop on Beowulf-class clustered computing , 1998, 1998 IEEE Aerospace Conference Proceedings (Cat. No.98TH8339).

[18]  T Decker,et al.  A Distributed Load Balancing Algorithm for Heterogeneous Parallel Computing Systems , 1998 .

[19]  Vipin Kumar,et al.  Graph partitioning for dynamic, adaptive and multi-phase scientific simulations , 2001, Proceedings 42nd IEEE Symposium on Foundations of Computer Science.

[20]  William Gropp,et al.  User''s Guide for mpich , 1996 .

[21]  Robert van Engelen,et al.  Graph Partitioning for High Performance Scienti c Simulations , 2000 .

[22]  Dana A. Knoll,et al.  Comparison of standard and matrix-free implementations of several Newton-Krylov solvers , 1994 .