Optimal Scheduling for UET/UET-UCT Generalized n-Dimensional Grid Task Graphs

Then-dimensional grid is one of the most representative patterns of data flow in parallel computation. Many scientific algorithms, which require nearest neighbor communication in a lattice space, are modeled by a task graph with the properties of a simple or enhanced grid. The two most frequently used scheduling models for grids are the unit execution time-zero communication delay (UET) and the unit execution time?unit communication time (UET-UCT). In this paper we introduce an enhanced model of then-dimensional grid by adding extra diagonal edges and allowing unequal boundaries for each dimension. For this generalized grid topology we establish the optimal makespan for both cases of UET/UET-UCT grids. Then we give a closed formula that calculates the minimum number of processors required to achieve the optimal makespan. Finally, we propose a low-complexity optimal time and processor scheduling strategy for both cases.

[1]  Richard M. Karp,et al.  The Organization of Computations for Uniform Recurrence Equations , 1967, JACM.

[2]  Leslie Lamport,et al.  The parallel execution of DO loops , 1974, CACM.

[3]  Jeffrey D. Ullman,et al.  NP-Complete Scheduling Problems , 1975, J. Comput. Syst. Sci..

[4]  S. Kung,et al.  VLSI Array processors , 1985, IEEE ASSP Magazine.

[5]  Dan I. Moldovan,et al.  Partitioning and Mapping Algorithms into Fixed Size Systolic Arrays , 1986, IEEE Transactions on Computers.

[6]  Dan I. Moldovan,et al.  ADVIS: A Software Package for the Design of Systolic Arrays , 1987, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[7]  Victor J. Rayward-Smith,et al.  UET scheduling with unit interprocessor communication delays , 1987, Discret. Appl. Math..

[8]  Thomas Kailath,et al.  Regular iterative algorithms and their implementation on processor arrays , 1988, Proc. IEEE.

[9]  Paul G. Spirakis,et al.  Lower bounds and efficient algorithms for multiprocessor scheduling of dags with communication delays , 1989, SPAA '89.

[10]  P. Chrétienne A polynomial algorithm to optimally schedule tasks on a virtual distributed system under tree-like precedence constraints , 1989 .

[11]  Mihalis Yannakakis,et al.  Towards an Architecture-Independent Analysis of Parallel Algorithms , 1990, SIAM J. Comput..

[12]  Alain Darte Regular partitioning for synthesizing fixed-size systolic arrays , 1991, Integr..

[13]  Weijia Shang,et al.  Time Optimal Linear Schedules for Algorithms with Uniform Dependencies , 1991, IEEE Trans. Computers.

[14]  Philippe Chrétienne,et al.  C.P.M. Scheduling with Small Communication Delays and Task Duplication , 1991, Oper. Res..

[15]  Yves Robert,et al.  Linear Scheduling Is Nearly Optimal , 1991, Parallel Process. Lett..

[16]  Christophe Picouleau Etude de problemes d'optimisation dans les systemes distribues , 1992 .

[17]  Tao Yang,et al.  On the Granularity and Clustering of Directed Acyclic Task Graphs , 1993, IEEE Trans. Parallel Distributed Syst..

[18]  Christian Lengauer,et al.  A Systolizing Compilation Scheme for Nested Loops with Linear Bounds , 1993, Functional Programming, Concurrency, Simulation and Automated Reasoning.

[19]  Dan I. Moldovan,et al.  Parallel processing - from applications to systems , 1993 .

[20]  Yves Robert,et al.  Mapping Uniform Loop Nests Onto Distributed Memory Architectures , 1993, Parallel Comput..

[21]  Roland Zink,et al.  GRIDS - A programming system for grid-based technical and scientific applications on parallel systems , 1994, Future Gener. Comput. Syst..

[22]  Tao Yang,et al.  DSC: Scheduling Parallel Tasks on an Unbounded Number of Processors , 1994, IEEE Trans. Parallel Distributed Syst..

[23]  Eugene L. Lawler,et al.  Scheduling In and Out Forests in the Presence of Communication Delays , 1996, IEEE Trans. Parallel Distributed Syst..

[24]  Nectarios Koziris,et al.  Optimal Time and Efficient Space Free Scheduling For Nested Loops , 1996, Comput. J..

[25]  Optimal Schedules for d-D Grid Graphs with Communication Delays (Extended Abstract) , 1996, STACS.

[26]  Nectarios Koziris,et al.  Mapping nested loops onto distributed memory multiprocessors , 1997, Proceedings 1997 International Conference on Parallel and Distributed Systems.

[27]  Nectarios Koziris,et al.  Lower Time and Processor Bounds for Efficient Mapping of Uniform Dependence Algorithms into Systolic Arrays , 1997, Parallel Algorithms Appl..

[28]  Nectarios Koziris,et al.  Automatic Hardware Synthesis of Nested Loops Using UET Grids and VHDL , 1997, HPCN Europe.

[29]  Tao Yang,et al.  Heuristic Algorithms for Scheduling Iterative Task Computations on Distributed Memory Machines , 1997, IEEE Trans. Parallel Distributed Syst..

[30]  Evripidis Bampis,et al.  Optimal Schedules for d-D Grid Graphs with Communication Delays , 1998, Parallel Comput..