Disaster Recovery in Cloud Computing: A Survey

Disaster recovery is a persistent problem in IT platforms. This problem is more crucial in cloud computing, because Cloud Service Providers (CSPs) have to provide the services to their customers even if the data center is down, due to a disaster. In the past few years, researchers have shown interest to disaster recovery using cloud computing, and a considerable amount of literature has been published in this area. However, to the best of our knowledge, there is a lack of precise survey for detailed analysis of cloud-based disaster recovery. To fill this gap, this paper provides an extensive survey of disaster recovery concepts and research in the cloud environments. We present different taxonomy of disaster recovery mechanisms, main challenges and proposed solutions. We also describe the cloud-based disaster recovery platforms and identify open issues related to disaster recovery.

[1]  Rakesh Gupta Above the Clouds: A View of Cloud Computing , 2012 .

[2]  Akshat Verma,et al.  End-to-end disaster recovery planning: From art to science , 2010, 2010 IEEE Network Operations and Management Symposium - NOMS 2010.

[3]  Miriam A. M. Capretz,et al.  Knowledge as a Service Framework for Disaster Data Management , 2013, 2013 Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprises.

[4]  Oscar Arean Disaster recovery in the cloud , 2013, Netw. Secur..

[5]  Hiroki Kashiwazaki Practical uses of cloud computing services in a Japanese university of the arts against aftermath of the 2011 Tohoku earthquake , 2012, SIGUCCS '12.

[6]  Richard Wolski,et al.  The Eucalyptus Open-Source Cloud-Computing System , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[7]  John Wilkes,et al.  Seneca: remote mirroring done write , 2003, USENIX Annual Technical Conference, General Track.

[8]  Zhang Nan,et al.  Cloud Computing-based Data Storage and Disaster Recovery , 2011, 2011 International Conference on Future Computer Science and Education.

[9]  Jun Zhu,et al.  Optimizing the Performance of Virtual Machine Synchronization for Fault Tolerance , 2011, IEEE Transactions on Computers.

[10]  H KatzRandy,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988 .

[11]  Andrew Warfield,et al.  SecondSite: disaster tolerance as a service , 2012, VEE '12.

[12]  Dutch T. Meyer,et al.  Remus: High Availability via Asynchronous Virtual Machine Replication. (Best Paper) , 2008, NSDI.

[13]  Kenneth Mark Anderson,et al.  MySQL to NoSQL: data modeling challenges in supporting scalability , 2012, SPLASH '12.

[14]  Rajkumar Buyya,et al.  Cloud Computing Principles and Paradigms , 2011 .

[15]  Jong Sou Park,et al.  Disaster Recovery for System Architecture Using Cloud Computing , 2010, 2010 10th IEEE/IPSJ International Symposium on Applications and the Internet.

[16]  Akshat Verma,et al.  RSCMap: Resiliency Planning in Storage Clouds , 2011, ICSOC.

[17]  Javed I. Khan,et al.  Peer-to-Peer Enterprise Data Backup over a Ren Cloud , 2011, 2011 Eighth International Conference on Information Technology: New Generations.

[18]  Fred B. Schneider,et al.  Hypervisor-based fault tolerance , 1996, TOCS.

[19]  Paulo Romero Martins Maciel,et al.  Dependability models for designing disaster tolerant cloud computing systems , 2013, 2013 43rd Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN).

[20]  M. Zaharia,et al.  Above the Clouds : A View of Cloud Computing , 2009 .

[21]  Reinhard German,et al.  Performance analysis of communication systems - modelling with non-Markovian stochastic Petri nets , 2000, Wiley-Interscience series in systems and optimization.

[22]  Valentina Salapura Cloud computing: Virtualization and resiliency for data center computing , 2012, 2012 IEEE 30th International Conference on Computer Design (ICCD).

[23]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[24]  P. Mell,et al.  The NIST Definition of Cloud Computing , 2011 .

[25]  Andrew Warfield,et al.  Xen and the art of virtualization , 2003, SOSP '03.

[26]  S. Swaminathan,et al.  Disaster recovery services in the cloud for SMEs , 2012, 2012 International Conference on Cloud Computing Technologies, Applications and Management (ICCCTAM).

[27]  C.G. Rudolph Business continuation planning/disaster recovery: a marketing perspective , 1990, IEEE Communications Magazine.

[28]  Prashant J. Shenoy,et al.  PipeCloud: using causality to overcome speed-of-light delays in cloud-based disaster recovery , 2011, SOCC '11.

[29]  Yuval Tamir,et al.  FAST TRANSPARENT FAILOVER FOR RELIABLE WEB SERVICE , 2003 .

[30]  A. Kivity,et al.  kvm : the Linux Virtual Machine Monitor , 2007 .

[31]  C. Grama,et al.  Continuous Disaster Tolerance in the IaaS clouds , 2012, 2012 13th International Conference on Optimization of Electrical and Electronic Equipment (OPTIM).

[32]  Vijaykumar Javaraiah Backup for cloud and disaster recovery for consumers and SMBs , 2011, 2011 Fifth IEEE International Conference on Advanced Telecommunication Systems and Networks (ANTS).

[33]  Dennis C. Guster,et al.  Enhancing the Disaster Recovery Plan Through Virtualization , 2011, J. Inf. Technol. Res..

[34]  Antonio Pescapè,et al.  Cloud monitoring: A survey , 2013, Comput. Networks.

[35]  T.T.Lwin,et al.  High Availability Cluster System for Local Disaster Recovery with Markov Modeling Approach , 2009, 0912.1835.

[36]  Raouf Boutaba,et al.  Cloud computing: state-of-the-art and research challenges , 2010, Journal of Internet Services and Applications.

[37]  Y. K. Malaiya,et al.  Evaluating disaster recovery plans using the cloud , 2013, 2013 Proceedings Annual Reliability and Maintainability Symposium (RAMS).

[38]  Masaki Fukui,et al.  Design and implementation of virtualized ICT resource management system for carrier network services toward cloud computing era , 2013, 2013 Proceedings of ITU Kaleidoscope: Building Sustainable Communities.

[39]  Kazuo Ichihara,et al.  Performance Evaluation of a Disaster Recovery System and Practical Network System Applications , 2010, 2010 Fifth International Conference on Systems and Networks Communications.

[40]  田村 芳明,et al.  Kemari: Virtual Machine Synchronization for Fault Tolerance , 2010 .

[41]  S. R. Patil,et al.  Methodology for Usage of Emerging Disk to Ameliorate Hybrid Storage Clouds , 2012, 2012 IEEE International Conference on Cloud Computing in Emerging Markets (CCEM).

[42]  Arun Venkataramani,et al.  Disaster Recovery as a Cloud Service: Economic Benefits & Deployment Challenges , 2010, HotCloud.

[43]  Yashwant K. Malaiya,et al.  Assessing Disaster Recovery Alternatives: On-Site, Colocation or Cloud , 2012, 2012 IEEE 23rd International Symposium on Software Reliability Engineering Workshops.

[44]  Philipp Reisner,et al.  Replicated Storage with Shared Disk Semantics , 2007 .

[45]  Jin Tong,et al.  NIST Cloud Computing Reference Architecture , 2011, 2011 IEEE World Congress on Services.

[46]  Alain Azagury Point-in-Time Copy: Yesterday, Today and Tomorrow , 2002 .