High availability in clouds: systematic review and research challenges

Cloud Computing has been used by different types of clients because it has many advantages, including the minimization of infrastructure resources costs, and its elasticity property, which allows services to be scaled up or down according to the current demand. From the Cloud provider point-of-view, there are many challenges to be overcome in order to deliver Cloud services that meet all requirements defined in Service Level Agreements (SLAs). High availability has been one of the biggest challenges for providers, and many services can be used to improve the availability of a service, such as checkpointing, load balancing, and redundancy. Beyond services, we can also find infrastructure and middleware solutions. This systematic review has as its main goal to present and discuss high available (HA) solutions for Cloud Computing, and to introduce some research challenges in this area. We hope this work can be used as a starting point to understanding and coping with HA problems in Cloud.

[1]  Kyoungho An,et al.  A cloud middleware for assuring performance and high availability of soft real-time applications , 2014, J. Syst. Archit..

[2]  Paul Marshall,et al.  Elastic Site: Using Clouds to Elastically Extend Site Resources , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[3]  Parisa Heidari,et al.  Integrating Open SAF High Availability Solution with Open Stack , 2015, 2015 IEEE World Congress on Services.

[4]  Albert G. Greenberg,et al.  Ananta: cloud scale load balancing , 2013, SIGCOMM.

[5]  Maria Toeroe,et al.  Service Availability: Principles and Practice , 2012 .

[6]  Kazuhiko Kato,et al.  An Extensible Cloud Platform Inspired by Operating Systems , 2011, 2011 Fourth IEEE International Conference on Utility and Cloud Computing.

[7]  Neha Pawar,et al.  Managing application level elasticity and availability , 2014, 10th International Conference on Network and Service Management (CNSM) and Workshop.

[8]  Yves Lemieux,et al.  Achieving High Availability at the Application Level in the Cloud , 2013, 2013 IEEE Sixth International Conference on Cloud Computing.

[9]  Kishor S. Trivedi,et al.  Software aging in the eucalyptus cloud computing infrastructure , 2014, ACM J. Emerg. Technol. Comput. Syst..

[10]  Dutch T. Meyer,et al.  Remus: High Availability via Asynchronous Virtual Machine Replication. (Best Paper) , 2008, NSDI.

[11]  Boon-Yaik Ooi,et al.  Dynamic service placement and replication framework to enhance service availability using team formation algorithm , 2012, J. Syst. Softw..

[12]  M. Eltoweissy,et al.  Issues and challenges , 2019, Justice for Children in the Context of Counter-Terrorism.

[13]  Judith Kelner,et al.  Risk-based model for availability estimation of SAF redundancy models , 2016, 2016 IEEE Symposium on Computers and Communication (ISCC).

[14]  Asif Imran,et al.  Cloud-Niagara: A high availability and low overhead fault tolerance middleware for the cloud , 2014, 16th Int'l Conf. Computer and Information Technology.

[15]  Judith Kelner,et al.  Resource Allocation in Clouds: Concepts, Tools and Research Challenges , 2011 .

[16]  Marc Frîncu,et al.  Scheduling highly available applications on cloud environments , 2014, Future Gener. Comput. Syst..

[17]  B. P. S. Sahoo,et al.  Cloud Computing Features, Issues, and Challenges: A Big Picture , 2015, 2015 International Conference on Computational Intelligence and Networks.

[18]  Rami Bahsoon,et al.  Scalable service-oriented replication with flexible consistency guarantee in the cloud , 2014, Inf. Sci..

[19]  Maria Toeroe,et al.  Towards an Evaluation Framework for Availability Solutions in the Cloud , 2014, 2014 IEEE International Symposium on Software Reliability Engineering Workshops.

[20]  Djamel Fawzi Hadj Sadok,et al.  KVM virtualization impact on active round-trip time measurements , 2015, 2015 IFIP/IEEE International Symposium on Integrated Network Management (IM).

[21]  Emanuel Ferreira Coutinho,et al.  Elasticity in cloud computing: a survey , 2014, annals of telecommunications - annales des télécommunications.

[22]  Hoi Chan,et al.  An approach to high availability for cloud servers with snapshot mechanism , 2012, MIDDLEWARE '12.

[23]  Gang Huang,et al.  Model-based high availability configuration framework for cloud , 2013, MDS '13.

[24]  Aleksandar Dimov,et al.  Software availability in the cloud , 2013, CompSysTech '13.

[25]  Raouf Boutaba,et al.  Cloud Architectures, Networks, Services, and Management , 2015 .

[26]  Ricardo Jiménez-Peris,et al.  Elastic SI-Cache: consistent and scalable caching in multi-tier architectures , 2011, The VLDB Journal.

[27]  Johannes Behl,et al.  Providing fault-tolerant execution of web-service-based workflows within clouds , 2012, CloudCP '12.

[28]  Alysson Bessani,et al.  The TClouds platform: concept, architecture and instantiations , 2013, DISCCO '13.

[29]  Tharam S. Dillon,et al.  Cloud Computing: Issues and Challenges , 2010, 2010 24th IEEE International Conference on Advanced Information Networking and Applications.

[30]  Anees Shaikh,et al.  Are clouds ready for large distributed applications? , 2010, OPSR.

[31]  Franck Cappello,et al.  BlobCR: Virtual disk based checkpoint-restart for HPC applications on IaaS clouds , 2013, J. Parallel Distributed Comput..

[32]  J. Singh,et al.  High Availability of Clouds: Failover Strategies for Cloud Computing Using Integrated Checkpointing Algorithms , 2012, 2012 International Conference on Communication Systems and Network Technologies.

[33]  M. Anand,et al.  Always On: Architecture for High Availability Cloud Applications , 2012, 2012 IEEE International Conference on Cloud Computing in Emerging Markets (CCEM).