Integrated Resiliency Planning in Storage Clouds

Storage clouds use economies of scale to host data for diverse enterprises. However, enterprises differ in the requirements for their data. In this work, we investigate the problem of resiliency or disaster recovery (DR) planning in a storage cloud. The resiliency requirements vary greatly between different enterprises and also between different datasets for the same enterprise. We present in this paper Resilient Storage Cloud Map (RSCMap), a generic cost-minimizing optimization framework for disaster recovery planning, where the cost function may be tailored to meet diverse objectives. We present fast algorithms that come up with a minimum cost DR plan, while meeting all the DR requirements associated with all the datasets hosted on the storage cloud. Our algorithms have strong theoretical properties: 2 factor approximation for bandwidth minimization and fixed parameter constant approximation for the general cost minimization problem. We perform a comprehensive experimental evaluation of RSCMap using models for a wide variety of replication solutions and show that RSCMap outperforms existing resiliency planning approaches.

[1]  Arun Venkataramani,et al.  Disaster Recovery as a Cloud Service: Economic Benefits & Deployment Challenges , 2010, HotCloud.

[2]  Prashant J. Shenoy,et al.  PipeCloud: using causality to overcome speed-of-light delays in cloud-based disaster recovery , 2011, SOCC '11.

[3]  Akshat Verma,et al.  SWEEPER: An Efficient Disaster Recovery Point Identification Mechanism , 2008, FAST.

[4]  Steve R. Kleiman,et al.  SnapMirror: File-System-Based Asynchronous Mirroring for Disaster Recovery , 2002, FAST.

[5]  Akshat Verma,et al.  12MAP: Cloud Disaster Recovery Based on Image-Instance Mapping , 2013, Middleware.

[6]  G. Dantzig Discrete-Variable Extremum Problems , 1957 .

[7]  Dirk Beyer,et al.  On the road to recovery: restoring data after disasters , 2006, EuroSys '06.

[8]  Yuan Yu,et al.  Dryad: distributed data-parallel programs from sequential building blocks , 2007, EuroSys '07.

[9]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[10]  Jon Howell,et al.  Flat Datacenter Storage , 2012, OSDI.

[11]  Dutch T. Meyer,et al.  Remus: High Availability via Asynchronous Virtual Machine Replication. (Best Paper) , 2008, NSDI.

[12]  Akshat Verma,et al.  End-to-end disaster recovery planning: From art to science , 2010, 2010 IEEE Network Operations and Management Symposium - NOMS 2010.

[13]  William H. Sanders,et al.  Designing dependable storage solutions for shared application environments , 2006, International Conference on Dependable Systems and Networks (DSN'06).

[14]  Kimberly Keeton,et al.  A framework for evaluating storage system dependability , 2004, International Conference on Dependable Systems and Networks, 2004.

[15]  Dirk Beyer,et al.  Designing for Disasters , 2004, FAST.

[16]  Lakshmi Ganesh,et al.  Smoke and Mirrors: Reflecting Files at a Geographically Remote Location Without Loss of Performance , 2009, FAST.

[17]  John Wilkes,et al.  Seneca: remote mirroring done write , 2003, USENIX Annual Technical Conference, General Track.

[18]  Alain Azagury Point-in-Time Copy: Yesterday, Today and Tomorrow , 2002 .