An Enhanced Data-aware Scheduling Algorithm for Batch-mode Dataintensive Jobs on Data Grid

This paper aims to propose an enhanced data-aware scheduling algorithm for batch-mode data-intensive job on Gfarm data grid by using LSF plug-in mechanism. The batch-mode data-intensive job is categorized into two types. These two types of batch-mode jobs are analyzed in details and implemented by our scheduling algorithm. Finally, an example is given to evaluate this algorithm and some conclusions have been drawn by analysis

[1]  Kavitha Ranganathan,et al.  Decoupling computation and data scheduling in distributed data-intensive applications , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[2]  Debra A. Hensgen,et al.  The relative performance of various mapping algorithms is independent of sizable variances in run-time predictions , 1998, Proceedings Seventh Heterogeneous Computing Workshop (HCW'98).

[3]  Satoshi Matsuoka,et al.  Grid Datafarm Architecture for Petascale Data Intensive Computing , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[4]  Miron Livny,et al.  Managing network resources in Condor , 2000, Proceedings the Ninth International Symposium on High-Performance Distributed Computing.

[5]  Jingwen Wang,et al.  Utopia: A load sharing facility for large, heterogeneous distributed computer systems , 1993, Softw. Pract. Exp..

[6]  Andrew L. Wendelborn,et al.  A Data-Aware Resource Broker for Data Grids , 2004, NPC.

[7]  R. F. Freund,et al.  Dynamic Mapping of a Class of Independent Tasks onto Heterogeneous Computing Systems , 1999, J. Parallel Distributed Comput..

[8]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[9]  Shaowen Wang,et al.  GCA '05 : proceedings of the 2005 International Conference on Grid Computing and Applications , 2005 .

[10]  Liang Hu,et al.  Implementing Data Aware Scheduling In Gfarm(R) Using LSF(TM) Scheduler plugin Mechanism , 2005, GCA.

[11]  Ami Marowka,et al.  The GRID: Blueprint for a New Computing Infrastructure , 2000, Parallel Distributed Comput. Pract..