Fault Tolerance in Cloud Computing

Cloud Computing is an important computational paradigm which provide on-demand services to users and in low cost. Fault tolerance and reliability are of great importance that provides correct result even in the presence of faulty components. Most of the systems are safety critical and highly reliable. So to achieve reliability in real time computing, the demand for fault tolerance is increased. In real time computing, the capabilities of intensive computing can be an advantage to execute real time tasks. In most of the applications of real time cloud, processing is done on remote cloud computing nodes. Therefore, due to loose control over the computing node, chances of errors increase. Fault tolerance techniques are used to predict these failures and take an appropriate action before failures actually occur. So to achieve reliability in real time computing, the requirement for fault tolerance increases. The reliability of virtual machines changes after every computing cycle i.e. it is adaptive in nature. In this paper, we introduce an unusual, system level aspect on the creation and management of fault tolerance in clouds. A complete high level approach is introduced to execute the fault tolerant techniques by

[1]  Vincenzo Piuri,et al.  Fault Tolerance Management in Cloud Computing: A System-Level Perspective , 2013, IEEE Systems Journal.

[2]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[3]  Fabrice Huet,et al.  Adaptive Fault Tolerance in Real Time Cloud Computing , 2011, 2011 IEEE World Congress on Services.

[4]  Chuang Lin,et al.  Performance, Fault-Tolerance and Scalability Analysis of Virtual Infrastructure Management System , 2009, 2009 IEEE International Symposium on Parallel and Distributed Processing with Applications.

[5]  Rachid Guerraoui,et al.  Independent faults in the cloud , 2010, LADIS '10.

[6]  Louise E. Moser,et al.  Fault Tolerance Middleware for Cloud Computing , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[7]  Guilherme Piegas Koslovski,et al.  Reliablility Support in Virtual Infrastructures , 2019 .

[8]  Matti A. Hiltunen,et al.  An approach to constructing modular fault-tolerant protocols , 1993, Proceedings of 1993 IEEE 12th Symposium on Reliable Distributed Systems.

[9]  Sheheryar Malik,et al.  Time Stamped Fault Tolerance in Distributed Real Time Systems , 2005 .

[10]  S. Sudha Lakshmi,et al.  Fault Tolerance in Cloud Computing , 2013 .

[11]  Christian Engelmann,et al.  A Framework for Proactive Fault Tolerance , 2008, 2008 Third International Conference on Availability, Reliability and Security.