Analysis of Tasks Reallocation in a Dedicated Grid Environment

In this paper, we study the impact of tasks reallocation onto a multi-cluster environment where clusters are heterogeneous and use different resources management policies. In this context, we propose a reallocation mechanism that migrates waiting jobs from one cluster to another. We performed simulations using real traces to study benefits of reallocations. We compared two algorithms providing the reallocation mechanism, each with several heuristics to schedule jobs. Results show that in some cases it is possible to obtain a substantial gain on the average job response time (more than a factor of two). In the other cases, the reallocation mechanism is beneficial most of the time, making of great interest the implementation of a reallocation mechanism in a Grid framework.

[1]  Franck Cappello,et al.  Grid'5000: A Large Scale And Highly Reconfigurable Experimental Grid Testbed , 2006, Int. J. High Perform. Comput. Appl..

[2]  Yves Caniou,et al.  Simbatch: An API for Simulating and Predicting the Performance of Parallel Resources Managed by Batch Systems , 2008, Euro-Par Workshops.

[3]  Uwe Schwiegelshohn,et al.  Online scheduling in grids , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[4]  Dan Tsafrir,et al.  Workload sanitation for performance evaluation , 2006, 2006 IEEE International Symposium on Performance Analysis of Systems and Software.

[5]  P. Sadayappan,et al.  Scheduling of Parallel Jobs in a Heterogeneous Multi-site Environement , 2003, JSSPP.

[6]  Uwe Schwiegelshohn,et al.  Analysis of first-come-first-serve parallel job scheduling , 1998, SODA '98.

[7]  Jianhui Yue,et al.  Global Backfilling Scheduling in Multiclusters , 2004, AACC.

[8]  Ibm Redbooks,et al.  Workload Management With Loadleveler , 2001 .

[9]  Eddy Caron,et al.  Diet: A Scalable Toolbox to Build Network Enabled Servers on the Grid , 2006, Int. J. High Perform. Comput. Appl..

[10]  Mark J. Clement,et al.  Core Algorithms of the Maui Scheduler , 2001, JSSPP.

[11]  Rajkumar Buyya,et al.  Rescheduling co-allocation requests based on flexible advance reservations and processor remapping , 2008, 2008 9th IEEE/ACM International Conference on Grid Computing.

[12]  David A. Lifka,et al.  The ANL/IBM SP Scheduling System , 1995, JSSPP.

[13]  F. Desprez,et al.  Meta-scheduling and Task Reallocation in a Grid Environment , 2009, 2009 Third International Conference on Advanced Engineering Computing and Applications in Sciences.

[14]  Henri Casanova,et al.  SimGrid: A Generic Framework for Large-Scale Distributed Experiments , 2008, Tenth International Conference on Computer Modeling and Simulation (uksim 2008).

[15]  R. F. Freund,et al.  Dynamic Mapping of a Class of Independent Tasks onto Heterogeneous Computing Systems , 1999, J. Parallel Distributed Comput..

[16]  Marta Beltrán,et al.  The Impact of Workload Variability on Load Balancing Algorithms , 2009, Scalable Comput. Pract. Exp..

[17]  Uwe Schwiegelshohn,et al.  Parallel Job Scheduling - A Status Report , 2004, JSSPP.

[18]  Julita Corbalán,et al.  A Job Self-scheduling Policy for HPC Infrastructures , 2007, JSSPP.

[19]  Alexandru Iosup,et al.  Trace-based evaluation of job runtime and queue wait time predictions in grids , 2009, HPDC '09.

[20]  Mitsuhisa Sato,et al.  Ninf: A Network Based Information Library for Global World-Wide Computing Infrastructure , 1997, HPCN Europe.

[21]  R. F. Freund,et al.  Dynamic matching and scheduling of a class of independent tasks onto heterogeneous computing systems , 1999, Proceedings. Eighth Heterogeneous Computing Workshop (HCW'99).

[22]  Larry Rudolph,et al.  Metrics and Benchmarking for Parallel Job Scheduling , 1998, JSSPP.

[23]  Georges Da Costa,et al.  2005 IEEE International Symposium on Cluster Computing and the Grid , 2005, CCGRID.

[24]  Wolfgang Gentzsch,et al.  Sun Grid Engine: towards creating a compute power grid , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.