Scheduling a Divisible Task in a Two-dimensional Toroidal Mesh

Abstract In this paper, a problem of scheduling an arbitrarily divisible task is considered. Taking into account both communication delays and computation time we propose a scheduling method which minimizes total execution time. We focus on two dimensional processor networks assuming a circuit-switching routing mechanism. The scheduling method uses a scattering scheme proposed in Peters and Syska (IEEE Trans. Parallel Distributed Systems 7(3) (1996) 246–255) to distribute parts of the task to processors in a minimum time. We show how to model and solve this problem with a set of algebraic equations. A solution of the latter allows one to analyze the performance of the network depending on various actual parameters of the task and the parallel machine. Though the method is defined for a particular architecture and scattering scheme it can be generalized to analyze other architectures of parallel computer systems.

[1]  Joseph G. Peters,et al.  Circuit-Switched Broadcasting in Torus Networks , 1996, IEEE Trans. Parallel Distributed Syst..

[2]  Ranga Vemuri,et al.  An integrated multicomponent synthesis environment for MCMs , 1993, Computer.

[3]  Thomas G. Robertazzi,et al.  Distributed computation with communication delay (distributed intelligent sensor networks) , 1988 .

[4]  Craig B. Stunkel,et al.  The SP1 high-performance switch , 1994, Proceedings of IEEE Scalable High Performance Computing Conference.

[5]  Thomas G. Robertazzi,et al.  Closed Form Solutions for Bus and Tree Networks of Processors Load Sharing A Divisible Job , 1993, 1993 International Conference on Parallel Processing - ICPP'93.

[6]  Yousef Saad,et al.  Data communication in parallel architectures , 1989, Parallel Comput..

[7]  Thomas G. Robertazzi,et al.  An optimum load sharing strategy for divisible jobs with time-varying processor and channel speed , 1995 .

[8]  Denis Trystram,et al.  Implementation of Parallel Numerical Routines Using Broadcast Communication Schemes , 1990, CONPAR.

[9]  F. Leighton,et al.  Introduction to Parallel Algorithms and Architectures: Arrays, Trees, Hypercubes , 1991 .

[10]  Debasish Ghose,et al.  Distributed Computation with Communication Delays: Asymptotic Performance Analysis , 1994, J. Parallel Distributed Comput..

[11]  Philippe Michallon,et al.  Schemas de communications globales dans les reseaux de processeurs : application a la grille torique. (Global communication schemes in processor networks ; application in torus) , 1994 .

[12]  Christophe Calvin,et al.  Minimisation du sur-coût des communications dans la parallélisation des algorithmes numériques. (Minimization of the overhead of communications in the parallelization of numerical algorithms) , 1995 .

[13]  Jacek Blazewicz,et al.  Scheduling Divisible Jobs on Hypercubes , 1995, Parallel Comput..

[14]  Lionel M. Ni,et al.  A survey of wormhole routing techniques in direct networks , 1993, Computer.

[15]  Kai Hwang,et al.  Advanced computer architecture - parallelism, scalability, programmability , 1992 .

[16]  Phillip Krueger,et al.  ob Scheduling is More Important than Processor Allocation for Hypercube Computers , 1994, IEEE Trans. Parallel Distributed Syst..

[17]  Krithi Ramamritham,et al.  Distributed Scheduling of Tasks with Deadlines and Resource Requirements , 1989, IEEE Trans. Computers.

[18]  Frédéric Guinand Ordonnancement avec communications pour architectures multiprocesseurs dans divers modèles d'exécution , 1995 .