A fault tolerent approach in scientific workflow systems based on cloud computing

Fault Tolerance is a configuration that prevents a computer or network device from failing in the event of unexpected problem or error such as hardware failure, link failure, unauthorized access, variations in the configuration of different systems and system running out of memory or disk space. The integration of fault tolerance measures with scheduling gains much importance. Scientific workflows use distributed heterogeneous resources in cloud interface are often hard to program. This paper explains task replication technique and simulation of cloud computing systems.

[1]  Christian Engelmann,et al.  A Proactive Fault Tolerance Framework for High-Performance Computing , 2010 .

[2]  Sameer Singh Chauhan,et al.  QoS Guided Heuristic Algorithms for Grid Task Scheduling , 2010 .

[3]  S. Karthik,et al.  Analyzing interaction between Denial of Service (DoS) attacks and threats , 2008, 2008 International Conference on Computing, Communication and Networking.

[4]  Jong Sou Park,et al.  Increasing System Fault Tolerance with Software Rejuvenation in E-government System , 2010 .

[5]  S. Kannan,et al.  Ant Colony Optimization for Routing in Mobile Ad-Hoc Networks , 2010 .

[6]  V. Rhymend Uthariaraj,et al.  A Minimum Time to Release Job Scheduling Algorithm in Computational Grid Environment , 2009, 2009 Fifth International Joint Conference on INC, IMS and IDC.

[7]  Carl Kesselman,et al.  Optimizing Grid-Based Workflow Execution , 2005, Journal of Grid Computing.

[8]  S. Thamarai Selvi,et al.  Scheduling in Virtualized Grid Environment Using Hybrid Approach , 2010, Grid 2010.

[9]  Radu Prodan,et al.  A New Fault Tolerance Heuristic for Scientific Workflows in Highly Distributed Environments Based on Resubmission Impact , 2009, 2009 Fifth IEEE International Conference on e-Science.

[10]  Wan-Jun Yu,et al.  Ant colony optimization for routing in mobile ad hoc networks , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[11]  Valentin Cristea,et al.  Fault Tolerance and Recovery in Grid Workflow Management Systems , 2010, 2010 International Conference on Complex, Intelligent and Software Intensive Systems.

[12]  Bharadwaj Veeravalli,et al.  Fault-tolerant scheduling for differentiated classes of tasks with low replication cost in computational grids , 2007, HPDC '07.

[13]  Yang Zhang,et al.  Combined Fault Tolerance and Scheduling Techniques for Workflow Applications on Computational Grids , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[14]  T. Maragatham,et al.  A Study of Attacks, Attack Detection and Prevention Methods in Proactive and Reactive Routing Protocols , 2011 .

[15]  Gregor von Laszewski,et al.  QoS guided Min-Min heuristic for grid task scheduling , 2003, Journal of Computer Science and Technology.

[16]  Ivan Rodero,et al.  Evaluation of Coordinated Grid Scheduling Strategies , 2009, 2009 11th IEEE International Conference on High Performance Computing and Communications.

[17]  Louise E. Moser,et al.  Fault Tolerance Middleware for Cloud Computing , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[18]  Anne H. H. Ngu,et al.  Enabling ScientificWorkflow Reuse through Structured Composition of Dataflow and Control-Flow , 2006, 22nd International Conference on Data Engineering Workshops (ICDEW'06).

[19]  Rajkumar Buyya,et al.  A Taxonomy of Workflow Management Systems for Grid Computing , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[20]  T. Ravichandran,et al.  Multi Directional Geographical Traceback with n Directions Generalization , 2008 .