Improving job scheduling performance with parallel access to replicas in Data Grid environment

Data Grid has evolved to be the solution for data-intensive applications, such as High Energy Physics (HEP), astrophysics, and computational genomics. These applications usually have large input of data to be analyzed and these input data are widely replicated across Data Grid to improve the performance. The job scheduling performance on traditional computing jobs can be studied using queuing theory. However, with the addition of data transfer, the job scheduling performance is too complex to be modeled. In this research, we study the impact of data transfer on the performance of job scheduling in the Data Grid environment. We have proposed a parallel downloading system that supports replicating data fragments and parallel downloading of replicated data fragments, to improve the job scheduling performance. The performance of the parallel downloading system is compared with non-parallel downloading system, using three scheduling heuristics: Shortest Turnaround Time (STT), Least Relative Load (LRL) and Data Present (DP). Our simulation results show that the proposed parallel download approach greatly improves the Data Grid performance for all three scheduling algorithms, in terms of the geometric mean of job turnaround time. The advantage of parallel downloading system is most evident when the Data Grid has relatively low network bandwidth and relatively high computing power.

[1]  Kurt Stockinger,et al.  OptorSim-A Grid Simulator for Studying Dynamic Data Replication Strategies , 2003 .

[2]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[3]  Ami Marowka,et al.  The GRID: Blueprint for a New Computing Infrastructure , 2000, Parallel Distributed Comput. Pract..

[4]  Cynthia Bailey Lee,et al.  Are User Runtime Estimates Inherently Inaccurate? , 2004, JSSPP.

[5]  Ruay-Shiung Chang,et al.  A multiple parallel download scheme with server throughput and client bandwidth considerations for data grids , 2008, Future Gener. Comput. Syst..

[6]  Larry Rudolph,et al.  Metrics and Benchmarking for Parallel Job Scheduling , 1998, JSSPP.

[7]  Emmanouel A. Varvarigos,et al.  Statistical Analysis and Modeling of Jobs in a Grid Environment , 2007, Journal of Grid Computing.

[8]  Larry Rudolph,et al.  Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing , 1995 .

[9]  Sudharshan S. Vazhkudai Enabling the co-allocation of grid data transfers , 2003, Proceedings. First Latin American Web Congress.

[10]  Chao-Tung Yang,et al.  Enhancement of Anticipative Recursively-Adjusting Mechanism for Redundant Parallel File Transfer in Data Grids , 2008, 2008 14th IEEE International Conference on Parallel and Distributed Systems.

[11]  Chao-Tung Yang,et al.  Implementation of a dynamic adjustment strategy for parallel file transfer in co-allocation data grids , 2009, The Journal of Supercomputing.

[12]  NetComm Limited LAN(Local area network) , 2010 .

[13]  Kavitha Ranganathan,et al.  Identifying Dynamic Replication Strategies for a High-Performance Data Grid , 2001, GRID.

[14]  Kavitha Ranganathan,et al.  Decoupling computation and data scheduling in distributed data-intensive applications , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[15]  Chien-Min Wang,et al.  Efficient multi-source data transfer in data grids , 2006, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06).

[16]  Francine Berman,et al.  When the Herd Is Smart: Aggregate Behavior in the Selection of Job Request , 2003, IEEE Trans. Parallel Distributed Syst..

[17]  Gerd Keiser,et al.  Local Area Networks , 1989 .

[18]  Ian T. Foster,et al.  The data grid: Towards an architecture for the distributed management and analysis of large scientific datasets , 2000, J. Netw. Comput. Appl..

[19]  Chao-Tung Yang,et al.  Improvements on dynamic adjustment mechanism in co-allocation data grid environments , 2007, The Journal of Supercomputing.

[20]  Satoshi Matsuoka,et al.  Performance analysis of scheduling and replication algorithms on Grid Datafarm architecture for high-energy physics applications , 2003, High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on.

[21]  Ruay-Shiung Chang,et al.  Accessing data from many servers simultaneously and adaptively in data grids , 2010, Future Gener. Comput. Syst..

[22]  Kavitha Ranganathan,et al.  Simulation Studies of Computation and Data Scheduling Algorithms for Data Grids , 2003, Journal of Grid Computing.

[23]  Chao-Tung Yang,et al.  Implementation of a Cyber Transformer for Parallel Download in Co-Allocation Data Grid Environments , 2008, 2008 Seventh International Conference on Grid and Cooperative Computing.

[24]  Floriano Zini,et al.  Analysis of Scheduling and Replica Optimisation Strategies for Data Grids Using OptorSim , 2004, Journal of Grid Computing.

[25]  Ming Tang,et al.  The impact of data replication on job scheduling performance in the Data Grid , 2006, Future Gener. Comput. Syst..

[26]  Sathish S. Vadhiyar,et al.  Efficient reuse of replicated parallel data segments in computational grids , 2008, Future Gener. Comput. Syst..

[27]  Ming Tang,et al.  Dynamic replication algorithms for the multi-tier Data Grid , 2005, Future Gener. Comput. Syst..

[28]  Ruay-Shiung Chang,et al.  Complete and fragmented replica selection and retrieval in Data Grids , 2007, Future Gener. Comput. Syst..

[29]  Chao-Tung Yang,et al.  Implementation of a dynamic adjustment mechanism with efficient replica selection in data grid environments , 2006, SAC '06.

[30]  Ruay-Shiung Chang,et al.  An efficient and bandwidth sensitive parallel download scheme in data grids , 2008, 2008 3rd International Conference on Communication Systems Software and Middleware and Workshops (COMSWARE '08).

[31]  William E. Johnston,et al.  The Computing and Data Grid Approach: Infrastructure for Distributed Science Applications , 2013, Comput. Artif. Intell..

[32]  R. F. Freund,et al.  Dynamic Mapping of a Class of Independent Tasks onto Heterogeneous Computing Systems , 1999, J. Parallel Distributed Comput..

[33]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[34]  Warren Smith,et al.  Benchmarks and Standards for the Evaluation of Parallel Job Schedulers , 1999, JSSPP.