Using rough set based multi-checkpointing for fault-tolerance scheduling in economic grids

Fault tolerant Grid scheduling is of vital importance in the Grid computing world. Task replication and checkpointing is two popular methods to achieve a fault tolerant scheduling. Replication method is not an applicable way in economic-based grid computing due to use a large number of resources. The cost of spent time must be paid by consumer for all participant nodes. In this paper, we proposed a fault-tolerant scheduling technique based on Multi-Checkpointing by using rough set theory for economic-based grid with respect to minimum cost, high efficiency, and minimum latency. In our proposed approach, we assume that if one of the provider nodes is failed, there is not enough time to start a task on a new node from beginning again. The experimental results show a promising method with less computation cost price and better fault-tolerance in acceptable completion time.

[1]  Chong-Sun Hwang,et al.  Group-based dynamic computational replication mechanism in peer-to-peer grid computing , 2006, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06).

[2]  Ian T. Foster,et al.  The anatomy of the grid: enabling scalable virtual organizations , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[3]  G. Allen,et al.  The Cactus Code: a problem solving environment for the grid , 2000, Proceedings the Ninth International Symposium on High-Performance Distributed Computing.

[4]  N. Hussain,et al.  Towards Optimal Fault Tolerant Scheduling in Computational Grid , 2007, 2007 International Conference on Emerging Technologies.

[5]  Asgarali Bouyer,et al.  An Online and Predictive Method for Grid Scheduling Based on Data Mining and Rough Set , 2009, ICCSA.

[6]  Asgarali Bouyer,et al.  A learning-based approach for fault tolerance on grid resources scheduling , 2009, 2009 5th IEEE GCC Conference & Exhibition.

[7]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[8]  Paul T. Groth,et al.  FT‐Grid: a system for achieving fault tolerance in grids , 2008, Concurr. Comput. Pract. Exp..

[9]  Mohd. Noor Md. Sap,et al.  A new approach for selecting best resources nodes by using fuzzy decision tree in grid resource broker , 2008 .

[10]  Filip De Turck,et al.  Adaptive Task Checkpointing and Replication: Toward Efficient Fault-Tolerant Grids , 2009, IEEE Transactions on Parallel and Distributed Systems.

[11]  Sanjeev K. Aggarwal,et al.  A Fault Tolerance Scheme for Hierarchical Dynamic Schedulers in Grids , 2008, 2008 International Conference on Parallel Processing - Workshops.

[12]  Radu Prodan,et al.  Short Paper: Data Mining-based Fault Prediction and Detection on the Grid , 2006, 2006 15th IEEE International Conference on High Performance Distributed Computing.