Online cost optimization algorithms for tiered cloud storage services

Abstract The new generation multi-tiered cloud storage services offer various tiers, such as hot and cool tiers, which are characterized by differentiated Quality of Service (QoS) (i.e., access latency, availability and throughput) and the corresponding storage and access costs. However, selecting among these storage tiers to efficiently manage data and improve performance at reduced cost is still a core and difficult problem. In this paper, we address this problem by developing and evaluating algorithms for automated data placement and movement between hot and cool storage tiers. We propose two practical online object placement algorithms that assume no knowledge of future data access. The first online cost optimization algorithm uses no replication (NR) and initially places the object in the hot tier. Then, based on read/write access pattern following a long tail distribution, it may decide to move the object to the cool tier to optimize the storage service cost. The second algorithm with replication (WR) initially places the object in the cool tier, and then replicates it in the hot tier upon receiving read/write requests to it. Additionally, we analytically demonstrate that the online algorithms incur less than twice the cost in comparison to the optimal offline algorithm that assumes the knowledge of exact future workload on the objects. The experimental results using a Twitter Workload and the CloudSim simulator confirm that the proposed algorithms yield significant cost savings (5%–55%) compared to the no-migration policy which permanently stores data in the hot tier.

[1]  Rajkumar Buyya,et al.  Cost Optimization for Dynamic Replication and Migration of Data in Cloud Data Centers , 2019, IEEE Transactions on Cloud Computing.

[2]  Rui Wang,et al.  Towards social user profiling: unified and discriminative influence model for inferring home locations , 2012, KDD.

[3]  Karl Aberer,et al.  Scalia: An adaptive scheme for efficient multi-cloud storage , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[4]  Song Jiang,et al.  Synergistic coupling of SSD and hard disk for QoS-aware virtual memory , 2013, 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS).

[5]  Wei Wang,et al.  To Reserve or Not to Reserve: Optimal Online Multi-Instance Acquisition in IaaS Clouds , 2013, ICAC.

[6]  Junfeng Yang,et al.  Grandet: A Unified, Economical Object Store for Web Applications , 2016, SoCC.

[7]  Rajkumar Buyya,et al.  Data Storage Management in Cloud Environments , 2017, ACM Comput. Surv..

[8]  Philipp Waibel,et al.  Cost- and Latency-Efficient Redundant Data Storage in the Cloud , 2017, 2017 IEEE 10th Conference on Service-Oriented Computing and Applications (SOCA).

[9]  Anna R. Karlin,et al.  Competitive randomized algorithms for non-uniform problems , 1990, SODA '90.

[10]  Ronny Hans,et al.  Cost-optimized redundant data storage in the cloud , 2017, Service Oriented Computing and Applications.

[11]  Rajkumar Buyya,et al.  CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms , 2011, Softw. Pract. Exp..

[12]  Jun Li,et al.  Optimizing Cost for Online Social Networks on Geo-Distributed Clouds , 2016, IEEE/ACM Transactions on Networking.

[13]  Abdelkarim Erradi,et al.  Cost Optimization Algorithms for Hot and Cool Tiers Cloud Storage Services , 2018, 2018 IEEE 11th International Conference on Cloud Computing (CLOUD).

[14]  Ethan Katz-Bassett,et al.  SPANStore: cost-effective geo-replicated storage spanning multiple cloud services , 2013, SOSP.

[15]  Rajkumar Buyya,et al.  To move or not to move: Cost optimization in a dual cloud-based storage architecture , 2016, J. Netw. Comput. Appl..

[16]  Cory Hill,et al.  f4: Facebook's Warm BLOB Storage System , 2014, OSDI.

[17]  Anand Sivasubramaniam,et al.  HybridStore: A Cost-Efficient, High-Performance Storage System Combining SSDs and HDDs , 2011, 2011 IEEE 19th Annual International Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication Systems.

[18]  Qing Yang,et al.  I-CASH: Intelligently Coupled Array of SSD and HDD , 2011, 2011 IEEE 17th International Symposium on High Performance Computer Architecture.

[19]  Feng Chen,et al.  Hystor: making the best use of solid state drives in high performance storage systems , 2011, ICS '11.

[20]  Murali S. Kodialam,et al.  The constrained Ski-Rental problem and its application to online cloud cost optimization , 2013, 2013 Proceedings IEEE INFOCOM.

[21]  Allan Borodin,et al.  Online computation and competitive analysis , 1998 .

[22]  Murali S. Kodialam,et al.  Frugal storage for cloud file systems , 2012, EuroSys '12.

[23]  Sanjeev Kumar,et al.  Finding a Needle in Haystack: Facebook's Photo Storage , 2010, OSDI.

[24]  Haiying Shen,et al.  Harnessing the Power of Multiple Cloud Service Providers: An Economical and SLA-Guaranteed Cloud Storage Service , 2015, 2015 IEEE 35th International Conference on Distributed Computing Systems.