Cost-optimized redundant data storage in the cloud

The use of cloud-based storage systems for storing data is a popular alternative to local storage systems. Beside several benefits of cloud-based storages, there are also downsides like vendor lock-in or unavailability. Moreover, the selection of the best fitting storage solution can be a tedious and cumbersome task and the storage requirements may change over time. In this paper, we formulate a system model that uses multiple cloud-based services to realize a redundant and cost-efficient storage. Within this system model, we formulate a local and a global optimization problem that considers historical data access information and predefined quality of service requirements to select a cost-efficient storage solution. Furthermore, we present a heuristic optimization approach for the global optimization. Extensive evaluations show the benefits of our work in comparison with a baseline that follows a state-of-the-art approach. We show that our solutions save up to 30% of the cumulative cost in comparison with the baseline.

[1]  Chia-Wei Chang,et al.  Probability-Based Cloud Storage Providers Selection Algorithms with Maximum Availability , 2012, 2012 41st International Conference on Parallel Processing.

[2]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[3]  Yafei Dai,et al.  CHARM: A Cost-Efficient Multi-Cloud Data Hosting Scheme with High Availability , 2015, IEEE Transactions on Cloud Computing.

[4]  Mulya,et al.  CHARM : A Cost-efficient Multi-cloud Data Hosting Scheme with High Availability , 2016 .

[5]  David Bermbach,et al.  Cloud Federation: Effects of Federated Compute Resources on Quality of Service and Cost* , 2013, 2013 IEEE International Conference on Cloud Engineering (IC2E).

[6]  Schahram Dustdar,et al.  Winds of Change: From Vendor Lock-In to the Meta Cloud , 2013, IEEE Internet Computing.

[7]  Loretta Mastroeni,et al.  Cloud storage pricing: a comparison of current practices , 2013, ICPE 2013.

[8]  Marko Vukolic,et al.  Dissecting UbuntuOne: Autopsy of a Global-scale Personal Cloud Back-end , 2015, Internet Measurement Conference.

[9]  Elizabeth Chang,et al.  Conceptual SLA framework for cloud computing , 2010, 4th IEEE International Conference on Digital Ecosystems and Technologies.

[10]  Philipp Waibel,et al.  Cost-Efficient Data Redundancy in the Cloud , 2016, 2016 IEEE 9th International Conference on Service-Oriented Computing and Applications (SOCA).

[11]  Carey L. Williamson,et al.  Temporal locality and its impact on Web proxy cache performance , 2000, Perform. Evaluation.

[12]  Rajkumar Buyya,et al.  Brokering Algorithms for Optimizing the Availability and Cost of Cloud Storage Services , 2013, 2013 IEEE 5th International Conference on Cloud Computing Technology and Science.

[13]  David Bermbach,et al.  MetaStorage: A Federated Cloud Storage System to Manage Consistency-Latency Tradeoffs , 2011, 2011 IEEE 4th International Conference on Cloud Computing.

[14]  Karl Aberer,et al.  Scalia: An adaptive scheme for efficient multi-cloud storage , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[15]  Ari Juels,et al.  HAIL: a high-availability and integrity layer for cloud storage , 2009, CCS.

[16]  Arumugam Seetharaman,et al.  The usage and adoption of cloud computing by small and medium businesses , 2013, Int. J. Inf. Manag..

[17]  Andreas Haeberlen,et al.  Efficient Replica Maintenance for Distributed Storage Systems , 2006, NSDI.

[18]  James S. Plank,et al.  Erasure Codes for Storage Systems: A Brief Primer , 2013, login Usenix Mag..

[19]  Massimo Villari,et al.  Data On-Boarding in Federated Storage Clouds , 2013, 2013 IEEE Sixth International Conference on Cloud Computing.

[20]  John Kubiatowicz,et al.  Erasure Coding Vs. Replication: A Quantitative Comparison , 2002, IPTPS.

[21]  Rodrigo Rodrigues,et al.  High Availability in DHTs: Erasure Coding vs. Replication , 2005, IPTPS.

[22]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[23]  Yih-Farn Robin Chen,et al.  The Growing Pains of Cloud Storage , 2015, IEEE Internet Comput..

[24]  Hakim Weatherspoon,et al.  RACS: a case for cloud storage diversity , 2010, SoCC '10.

[25]  David R. Karger,et al.  Wide-area cooperative storage with CFS , 2001, SOSP.

[26]  Christoph Meinel,et al.  Applying Erasure Codes for Fault Tolerance in Cloud-RAID , 2013, 2013 IEEE 16th International Conference on Computational Science and Engineering.