A resource selection strategy and check pointing to minimize computational time in case of grid resource failure

Grid provides us a huge amount of computational resources in a distributed manner, using which we can perform our tasks over these grid environments. The main bottleneck of the grid environment is resource failure. When a resource fails, the grid scheduler reschedules that job on the new resource from its last saved checkpoint information. In this paper, we propose the following algorithm: if a resource fails, then it tries to minimize the completion time of a job running on two different resources with different characteristics by comparing highest check point information between these two concurrent resources.

[1]  Francine Berman,et al.  Adaptive Computing on the Grid Using AppLeS , 2003, IEEE Trans. Parallel Distributed Syst..

[2]  Indranil Gupta,et al.  On scalable and efficient distributed failure detectors , 2001, PODC '01.

[3]  V. R. Uthariaraj,et al.  FAULT TOLERANT SCHEDULING STRATEGY FOR COMPUTATIONAL GRID ENVIRONMENT , 2010 .

[4]  Naohiro Hayashibara,et al.  Failure detectors for large-scale distributed systems , 2002, 21st IEEE Symposium on Reliable Distributed Systems, 2002. Proceedings..

[5]  Rajkumar Buyya,et al.  A Deadline and Budget Constrained Cost-Time Optimisation Algorithm for Scheduling Task Farming Applications on Global Grids , 2002, ArXiv.

[6]  Shun-Li Ding,et al.  An algorithm for agent-based task scheduling in grid environments , 2004, Proceedings of 2004 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.04EX826).

[7]  Li Qian-mu,et al.  A Root-fault Detection System of Grid Based on Immunology , 2006, 2006 Fifth International Conference on Grid and Cooperative Computing (GCC'06).

[8]  Stephen Gilmore,et al.  2005 Ieee International Symposium on Cluster Computing and the Grid Enhancing the Effective Utilisation of Grid Clusters by Exploiting On-line Performability Analysis , 2022 .

[9]  N. Malarvizhi,et al.  A Broker-Based Approach to Resource Discovery and Selection in Grid Environments , 2008, 2008 International Conference on Computer and Electrical Engineering.