Achieving Guaranteed Service with Fault-Tolerant Resources in Grid

Composed of loosely coupled virtual resources, grid, being highly distinguished from traditional high-performance computing, is extensively used in computation-intensive problem solving in the arenas of science and technology. Maintaining performance or balancing load of each resource in grid is always more challenging with high chances of resource failure. The objective of this paper is to improve the efficiency of the Nearest Deadline First Scheduled (NDFS) algorithm considering resource failure a sudden occurrence in grid. The algorithm introduces periodical runtime backup to another available resource for retaining Quality-of-Service as approved in service quality agreement. This paper presents multiple job execution cases through implementation of benchmark codes executed in local grid test bed using Globus Toolkit middleware, with an emphasis on resource failure phenomenon of grid. These experimental results establish the requirements of the proposed algorithm to ensure the job deadline misses get reduced even if unexpected resource failures happen.

[1]  Rajkumar Buyya,et al.  GridSim: a toolkit for the modeling and simulation of distributed resource management and scheduling for Grid computing , 2002, Concurr. Comput. Pract. Exp..

[2]  Sukalyan Goswami,et al.  Optimization of Workload Scheduling in Computational Grid , 2016, FICTA.

[3]  Jemal H. Abawajy,et al.  Fault-tolerant scheduling policy for grid computing systems , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[4]  Raju Nedunchezhian,et al.  Performance-Driven Load Balancing with a Primary-Backup Approach for Computational Grids with Low Communication Cost and Replication Cost , 2013, IEEE Transactions on Computers.

[5]  Sukalyan Goswami,et al.  Handling Resource Failure Towards Load Balancing in Computational Grid Environment , 2014, 2014 Fourth International Conference of Emerging Applications of Information Technology.

[6]  Sukalyan Goswami,et al.  Resource Prioritization Technique in Computational Grid Environment , 2016 .

[7]  Bharadwaj Veeravalli,et al.  On the Design of Adaptive and Decentralized Load Balancing Algorithms with Load Estimation for Computational Grid Environments , 2007, IEEE Transactions on Parallel and Distributed Systems.

[8]  Sukalyan Goswami,et al.  Deadline stringency based job scheduling in computational grid environment , 2015, 2015 2nd International Conference on Computing for Sustainable Global Development (INDIACom).

[9]  Anthony A. Maciejewski,et al.  Stochastic robustness metric and its use for static resource allocations , 2008, J. Parallel Distributed Comput..

[10]  Steven Tuecke,et al.  The Anatomy of the Grid , 2003 .

[11]  Ian T. Foster,et al.  The Anatomy of the Grid: Enabling Scalable Virtual Organizations , 2001, Int. J. High Perform. Comput. Appl..