Towards a Big Data system disaster recovery in a Private Cloud

Disaster recovery (DR) plays a vital role in restoring the organization's data in the case of emergency and hazardous accidents. While many papers in security focus on privacy and security technologies, few address the DR process, particularly for a Big Data system. However, all these studies that have investigated DR methods belong to the single-basket approach, which means there is only one destination from which to secure the restored data, and mostly use only one type of technology implementation. We propose a multi-purpose approach, which allows data to be restored to multiple sites with multiple methods to ensure the organization recovers a very high percentage of data close to 100%, with all sites in London, Southampton and Leeds data recovered. The traditional TCP/IP baseline, snapshot and replication are used with their system design and development explained. We compare performance between different approaches and multi-purpose approach stands out in the event of emergency. Data at all sites in London, Southampton and Leeds can be restored and updated simultaneously. Results show that optimize command can recover 1TB of data within 650 s and command for three sites can recover 1 TB of data within 1360 s. All data backup and recovery has failure rate of 1.6% and below. All the data centers should adopt multi-purpose approaches to ensure all the data in the Big Data system can be recovered and retrieved without experiencing a prolong downtime and complex recovery processes. We make recommendations for adopting multi-purpose approach for data centers, and demonstrate that 100% of data is fully recovered with low execution time at all sites during a hazardous event as described in the paper.

[1]  Victor Chang Cloud Bioinformatics in a Private Cloud Deployment , 2013 .

[2]  Alexander S. Szalay,et al.  GrayWulf: Scalable Clustered Architecture for Data Intensive Computing , 2009, 2009 42nd Hawaii International Conference on System Sciences.

[3]  Divyakant Agrawal,et al.  Big data and cloud computing: current state and future opportunities , 2011, EDBT/ICDT '11.

[4]  Balachandra Reddy Kandukuri,et al.  Cloud Security Issues , 2009, 2009 IEEE International Conference on Services Computing.

[5]  Victor Chang,et al.  A proposed model to analyse risk and return for Cloud adoption , 2014 .

[6]  C. L. Philip Chen,et al.  Data-intensive applications, challenges, techniques and technologies: A survey on Big Data , 2014, Inf. Sci..

[7]  Ian Sommerville,et al.  Cloud Migration: A Case Study of Migrating an Enterprise IT System to IaaS , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[8]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[9]  Victor I. Chang,et al.  Cloud Storage and Bioinformatics in a Private Cloud Deployment: Lessons for Data Intensive Research , 2012, CLOSER.

[10]  Prashant J. Shenoy,et al.  PipeCloud: using causality to overcome speed-of-light delays in cloud-based disaster recovery , 2011, SOCC '11.

[11]  K. M. Annervaz,et al.  Multi-site data distribution for disaster recovery - A planning framework , 2014, Future Gener. Comput. Syst..

[12]  Arun Venkataramani,et al.  Disaster Recovery as a Cloud Service: Economic Benefits & Deployment Challenges , 2010, HotCloud.

[13]  Charlotte J. Hiatt A Primer for Disaster Recovery Planning in an IT Environment , 1999 .

[14]  Susan Snedaker,et al.  Business Continuity and Disaster Recovery Planning for IT Professionals , 2007 .

[15]  Jon G. Elerath Hard Disk Drives: The Good, the Bad and the Ugly! , 2007, ACM Queue.

[16]  Veda C. Storey,et al.  Business Intelligence and Analytics: From Big Data to Big Impact , 2012, MIS Q..

[17]  Thomas Kirste,et al.  Design Challenges for an Integrated Disaster Management Communication and Information System , 2002 .

[18]  Victor I. Chang,et al.  The Business Intelligence as a Service in the Cloud , 2014, Future Gener. Comput. Syst..

[19]  Paul T. Jaeger,et al.  Cloud Computing and Information Policy: Computing in a Policy Cloud? , 2008 .

[20]  V. Kavitha,et al.  A survey on security issues in service delivery models of cloud computing , 2011, J. Netw. Comput. Appl..

[21]  Joel J. P. C. Rodrigues Advancing Medical Practice through Technology: Applications for Healthcare Delivery, Management, and Quality , 2013 .

[22]  M. Anusha,et al.  Big Data-Survey , 2016 .