To Transfer or Not: An Online Cost Optimization Algorithm for Using Two-Tier Storage-as-a-Service Clouds

Currently, Storage-as-a-Service (StaaS) clouds offer multiple data storage and access pricing options which usually consist of hot and cold tiers. The cold tier storage option offers a lower storage price while the hot tier storage option offers a lower access price. Cloud users need to choose an optimal tier to store their data objects economically based on the frequency of accesses to their data objects. Besides, StaaS cloud users can transfer data objects between these two tiers to save cost according to the varying frequency of accesses to their data objects. Therefore, in order to make optimal transferring decisions, future access curves are needed to be predicted. However, for cloud users, it is difficult to precisely predict future access frequencies for their data objects. In this paper, we propose an online algorithm to guide StaaS cloud users in making decisions on whether and when to transfer their data objects between cold and hot tiers for achieving cost optimizations, while users do not need to have any prior knowledge of future access frequencies. We prove theoretically that the proposed online algorithm can achieve guaranteed competitive ratios for data objects stored in a two-tier StaaS cloud. Finally, through extensive experiments, we validate the effectiveness of our proposed online algorithm and show that it can save costs significantly compared with always keeping data objects in one tier or always transferring data objects from one tier to the other when their access frequencies begin to vary.

[1]  Hakim Weatherspoon,et al.  RACS: a case for cloud storage diversity , 2010, SoCC '10.

[2]  Abdelkarim Erradi,et al.  Cost Optimization Algorithms for Hot and Cool Tiers Cloud Storage Services , 2018, 2018 IEEE 11th International Conference on Cloud Computing (CLOUD).

[3]  Nicola Blefari-Melazzi,et al.  An Approach to Balance Maintenance Costs and Electricity Consumption in Cloud Data Centers , 2018, IEEE Transactions on Sustainable Computing.

[4]  Rajkumar Buyya,et al.  SLA-based virtual machine management for heterogeneous workloads in a cloud datacenter , 2014, J. Netw. Comput. Appl..

[5]  M. Shamim Hossain,et al.  Edge-CoCaCo: Toward Joint Optimization of Computation, Caching, and Communication on Edge Cloud , 2018, IEEE Wireless Communications.

[6]  Jun Li,et al.  Optimizing Cost for Online Social Networks on Geo-Distributed Clouds , 2016, IEEE/ACM Transactions on Networking.

[7]  David A. Maltz,et al.  Cloudward bound: planning for beneficial migration of enterprise applications to the cloud , 2010, SIGCOMM 2010.

[8]  Ethan Katz-Bassett,et al.  SPANStore: cost-effective geo-replicated storage spanning multiple cloud services , 2013, SOSP.

[9]  Rajkumar Buyya,et al.  To move or not to move: Cost optimization in a dual cloud-based storage architecture , 2016, J. Netw. Comput. Appl..

[10]  Katherine Guo,et al.  Intra-cloud lightning: Building CDNs in the cloud , 2012, 2012 Proceedings IEEE INFOCOM.

[11]  Xiao Liu,et al.  A Revised Discrete Particle Swarm Optimization for Cloud Workflow Scheduling , 2010, 2010 International Conference on Computational Intelligence and Security.

[12]  Zahir Tari,et al.  MetaCDN: Harnessing 'Storage Clouds' for high performance content delivery , 2009, J. Netw. Comput. Appl..

[13]  Rajkumar Buyya,et al.  Data Storage Management in Cloud Environments , 2017, ACM Comput. Surv..

[14]  Anand Sivasubramaniam,et al.  To Move or Not to Move: The Economics of Cloud Computing , 2011, HotCloud.

[15]  Guangwen Yang,et al.  Understanding Data Characteristics and Access Patterns in a Cloud Storage System , 2013, 2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing.

[16]  Bu-Sung Lee,et al.  Optimization of Resource Provisioning Cost in Cloud Computing , 2012, IEEE Transactions on Services Computing.

[17]  Daniel J. Abadi,et al.  Data Management in the Cloud: Limitations and Opportunities , 2009, IEEE Data Eng. Bull..

[18]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[19]  Arun Venkataramani,et al.  Disaster Recovery as a Cloud Service: Economic Benefits & Deployment Challenges , 2010, HotCloud.

[20]  Anna R. Karlin,et al.  Competitive randomized algorithms for non-uniform problems , 1990, SODA '90.

[21]  Symeon Papavassiliou,et al.  A Cloud-Oriented Content Delivery Network Paradigm: Modeling and Assessment , 2013, IEEE Transactions on Dependable and Secure Computing.

[22]  Rudolf Fleischer On The Bahncard Problem , 1998, COCOON.

[23]  Miguel Correia,et al.  DepSky: Dependable and Secure Storage in a Cloud-of-Clouds , 2013, TOS.