An autonomic approach for fault tolerance using scaling, replication and monitoring in cloud computing

Cloud based systems are more popular in today's world but fault tolerance in cloud is a gigantic challenge, as it affects the reliability and availability for the end users. A number of tools have been deployed to minimize the impact of faults. A fault tolerable system ensures to perform continuous operation and produce correct results even after the failure of components up to some extent. More over huge amount of data in the cloud cannot monitor manually by the administrator. Automated tools, dynamic deploying of more servers are the basic requirements of the todays cloud system in order to handle unexpected traffic spikes in the network. This proposed work introduces an autonomic prospective on managing the fault tolerance which ensure scalability, reliability and availability. HAProxy has been used to provide scaling to the web servers for load balancing in proactive manner. It also monitors the web servers for fault prevention at the user level. Our framework works with autonomic mirroring and load balancing of data in database servers using MySQL master- master replication and Nginx respectively. Administrator keeps an eye on working of servers through Nagios tool 24×7 monitoring can't be done manually by the service provider. The proposed work has been implemented in the cloud virtualization environment. Experimental results show that our framework can deal with fault tolerance very effectively.

[1]  Louise E. Moser,et al.  Fault Tolerance Middleware for Cloud Computing , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[2]  V. Piuri,et al.  A comprehensive conceptual system-level approach to fault tolerance in Cloud Computing , 2012, 2012 IEEE International Systems Conference SysCon 2012.

[3]  T. Ravichandran,et al.  Pre-emptive scheduling of on-line real time services with task migration for cloud computing , 2013, 2013 International Conference on Pattern Recognition, Informatics and Mobile Engineering.

[4]  Fabrice Huet,et al.  Adaptive Fault Tolerance in Real Time Cloud Computing , 2011, 2011 IEEE World Congress on Services.

[5]  Amal Ganesh,et al.  A study on fault tolerance methods in Cloud Computing , 2014, 2014 IEEE International Advance Computing Conference (IACC).

[6]  Yuan-Shun Dai,et al.  Self-healing and Hybrid Diagnosis in Cloud Computing , 2009, CloudCom.

[7]  Inderveer Chana,et al.  Fault Tolerance- Challenges, Techniques and Implementation in Cloud Computing , 2012 .

[8]  Laurent Broto,et al.  Approaches to cloud computing fault tolerance , 2012, 2012 International Conference on Computer, Information and Telecommunication Systems (CITS).

[9]  Pabitra Mohan Khilar,et al.  VFT: A virtualization and fault tolerance approach for cloud computing , 2013, 2013 IEEE CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES.

[10]  Antonio Puliafito,et al.  Workload-Based Software Rejuvenation in Cloud Systems , 2013, IEEE Transactions on Computers.