A Cost and Contention Conscious Scheduling for Recovery in Cloud Environment

Cloud Computing (CC) model plays an important role for the growth of contemporary IT industry where stability, availability and partition tolerance of computational resources mean a great deal. It is of utmost significance that not only cloud services are to be provided with satisfactory performance but also they are able to minimize and resiliently recover from potential damages when cloud infrastructures are subject to changes and/or disasters. This study discusses a method that results in potentially better resiliency and faster recovery from failures based on the well-known genetic algorithm. Moreover, we aim to achieve a globally optimized performance as well as a service solution that can remain financially and operationally balanced according to customer preferences. The proposed methodology has undergone various and intensive evaluations to be proclaimed of their effectiveness and efficiency, even when put under tight comparison with other existing work of relevant aspects.

[1]  Oliver Sinnen,et al.  Task Scheduling for Parallel Systems , 2007, Wiley series on parallel and distributed computing.

[2]  Leonel Sousa,et al.  Communication contention in task scheduling , 2005, IEEE Transactions on Parallel and Distributed Systems.

[3]  Kashi Venkatesh Vishwanath,et al.  Characterizing cloud computing hardware reliability , 2010, SoCC '10.

[4]  Naoki Shibata,et al.  Task Scheduling Algorithm for Multicore Processor System for Minimizing Recovery Time in Case of Single Node Fault , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[5]  Albert Y. Zomaya,et al.  A Novel State Transition Method for Metaheuristic-Based Scheduling in Heterogeneous Computing Systems , 2008, IEEE Transactions on Parallel and Distributed Systems.

[6]  Eui-nam Huh,et al.  A Thin-Thick Client Collaboration for Optimizing Task Scheduling in Mobile Cloud Computing , 2013, 2013 International Conference on IT Convergence and Security (ICITCS).

[7]  E. Bartholomy,et al.  The need to move toward virtualized and more resilient disaster-recovery architectures , 2013, IBM J. Res. Dev..

[8]  Kun-Lung Wu,et al.  SODA: An Optimizing Scheduler for Large-Scale Stream-Based Distributed Computer Systems , 2008, Middleware.

[9]  Fan Ye,et al.  A Hybrid Approach to High Availability in Stream Processing Systems , 2010, 2010 IEEE 30th International Conference on Distributed Computing Systems.

[10]  Benjamin J. C. Yuan,et al.  Key consideration factors of adopting cloud computing for science , 2012, 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings.

[11]  Rajkumar Buyya,et al.  CloudSim: A Novel Framework for Modeling and Simulation of Cloud Computing Infrastructures and Services , 2009, ArXiv.

[12]  Oliver Sinnen,et al.  Task Scheduling for Parallel Systems (Wiley Series on Parallel and Distributed Computing) , 2007 .

[13]  R. Srikant,et al.  Stochastic models of load balancing and scheduling in cloud computing clusters , 2012, 2012 Proceedings IEEE INFOCOM.