Scheduling parallel applications using malleable tasks on clusters

Scheduling is a central issue for implementing applications on parallel and distributed systems. This problem has been intensively studied for conventional parallel systems. Clusters of SMP (symmetric Multi-Processors) are a cost effective alternative to parallel supercomputers which are more and more popular. New characteristics are influencing the execution of parallel applications, like for instance the hierarchical structure and the heterogeneity of the processors. Communications between SMP usually need some important latencies that create large communication delays. Designing efficient software that take full advantage of such systems remains difficult. The model of malleable task (MT) was introduced some years ago and has been proved to be an efficient way for implementing parallel applications on conventional systems. Here, the target application is considered at a larger level of granularity than in other models (corresponding typically to numerical routines) where the tasks can themselves be executed in parallel. In this paper, we are interested in designing efficient lowcost scheduling algorithms for implementing parallel applications for clusters. We first discuss the problems that occur while scheduling an application on parallel systems and give a classification of applications leading to various scheduling problems. For each of these problems, we will use the same methodology for optimizing the resource utilization of parallel programs.

[1]  Han Hoogeveen,et al.  Three, four, five, six, or the complexity of scheduling with communication delays , 1994, Oper. Res. Lett..

[2]  Rodolphe Giroudeau,et al.  L'impact des delais de communications hierarchiques sur la complexite et l'approximation des problemes d'ordonnancement , 2000 .

[3]  Anja Feldmann,et al.  Optimal online scheduling of parallel jobs with dependencies , 1993, STOC.

[4]  Hugh Garraway Parallel Computer Architecture: A Hardware/Software Approach , 1999, IEEE Concurrency.

[5]  Thomas Decker,et al.  Parallel Real Root Isolation Using the Descartes Method , 1999, HiPC.

[6]  Denis Trystram,et al.  Dynamic Load Balancing for Ocean Circulation Model with Adaptive Meshing , 1999, Euro-Par.

[7]  Klaus Jansen,et al.  Linear Time Approximation Schemes for Shop Scheduling Problems , 2000 .

[8]  Denis Trystram,et al.  An approximation algorithm for scheduling trees of malleable tasks , 2002, Eur. J. Oper. Res..

[9]  Philip S. Yu,et al.  Approximate algorithms scheduling parallelizable tasks , 1992, SPAA '92.

[10]  Ronald L. Graham,et al.  Bounds on Multiprocessing Timing Anomalies , 1969, SIAM Journal of Applied Mathematics.

[11]  Denis Trystram,et al.  Efficient approximation algorithms for scheduling malleable tasks , 1999, SPAA '99.

[12]  Cynthia A. Phillips,et al.  Optimal Time-Critical Scheduling via Resource Augmentation , 1997, STOC '97.

[13]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[14]  Pierre-François Dutot,et al.  Scheduling on hierarchical clusters using malleable tasks , 2001, SPAA '01.

[15]  Victor J. Rayward-Smith,et al.  UET scheduling with unit interprocessor communication delays , 1987, Discret. Appl. Math..

[16]  RENAUD LEPÈRE,et al.  Approximation Algorithms for Scheduling Malleable Tasks Under Precedence Constraints , 2001, Int. J. Found. Comput. Sci..

[17]  David B. Shmoys,et al.  Using dual approximation algorithms for scheduling problems: Theoretical and practical results , 1985, 26th Annual Symposium on Foundations of Computer Science (sfcs 1985).

[18]  Jacek Blazewicz,et al.  Schedulling Malleable Task with Convex Processing Speed Functions , 2000, Computación y Sistemas.

[19]  Jacek Blazewicz,et al.  Approximation Algorithms for Scheduling Independent Malleable Tasks , 2001, Euro-Par.

[20]  Joseph Y.-T. Leung,et al.  Complexity of Scheduling Parallel Task Systems , 1989, SIAM J. Discret. Math..

[21]  Walter Ludwig,et al.  Algorithms for scheduling malleable and nonmalleable parallel tasks , 1996, Technical Report / University of Wisconsin, Madison / Computer Sciences Department.

[22]  A. Steinberg,et al.  A Strip-Packing Algorithm with Absolute Performance Bound 2 , 1997, SIAM J. Comput..

[23]  Evripidis Bampis,et al.  On the complexity of scheduling with large communication delays , 1996 .

[24]  Robert E. Tarjan,et al.  Performance Bounds for Level-Oriented Two-Dimensional Packing Algorithms , 1980, SIAM J. Comput..

[25]  Klaus Jansen,et al.  Linear-Time Approximation Schemes for Scheduling Malleable Parallel Tasks , 1999, SODA '99.

[26]  Cynthia A. Phillips,et al.  Optimal Time-Critical Scheduling via Resource Augmentation (Extended Abstract) , 1997, Symposium on the Theory of Computing.