More IOPS for Less: Exploiting Burstable Storage in Public Clouds

Burstable storage is a public cloud feature that enhances cloud storage volumes with credits that can be used to boost performance temporarily. These credits can be exchanged for increased storage throughput, for a short period of time, and are replenished over time. We examine how burstable storage can be leveraged to reduce cost and/or improve performance for three use cases with different data-longevity requirements: traditional persistent storage, caching, and ephemeral storage. Although cloud storage volumes are typically priced by capacity, we find that each AWS gp2 volume starts with the same number of burst credits. Exploiting that fact, we find that aggressive interchanging of large numbers of small short-term volumes can increase IOPS by up to 100× at a cost increase of only 10–40%. Compared to an AWS io1 volume provisioned for the same performance, such interchanging reduces cost by 97.5%.

[1]  Bronis R. de Supinski,et al.  Design, Modeling, and Evaluation of a Scalable Multi-level Checkpointing System , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.

[2]  Yu-Ju Hong,et al.  Dynamic server provisioning to minimize cost in an IaaS cloud , 2011, PERV.

[3]  M. Balazinska,et al.  An analysis of Hadoop usage in scientific workloads , 2013 .

[4]  George Kesidis,et al.  Exploiting Spot and Burstable Instances for Improving the Cost-efficacy of In-Memory Caches on the Public Cloud , 2017, EuroSys.

[5]  Gregory R. Ganger,et al.  Tributary: spot-dancing for elastic services with latency SLOs , 2018, USENIX ATC.

[6]  Bin Fan,et al.  A Case for Packing and Indexing in Cloud File Systems , 2018, HotCloud.

[7]  Evgenia Smirni,et al.  CEDULE: A Scheduling Framework for Burstable Performance in Cloud Computing , 2018, 2018 IEEE International Conference on Autonomic Computing (ICAC).

[8]  Evgenia Smirni,et al.  It's not a Sprint, it's a Marathon: Stretching Multi-resource Burstable Performance in Public Clouds (industry track) , 2019, Middleware Industry.

[9]  Wei Wang,et al.  MArk: Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving , 2019, USENIX Annual Technical Conference.

[10]  Prateek Sharma,et al.  SpotWeb: Running Latency-sensitive Distributed Web Services on Transient Cloud Servers , 2019, HPDC.

[11]  Bhuvan Urgaonkar,et al.  BurScale: Using Burstable Instances for Cost-Effective Autoscaling in the Public Cloud , 2019, SoCC.

[12]  David Wentzlaff,et al.  Burstable Instances for Clouds: Performance Modeling, Equilibrium Analysis, and Revenue Maximization , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[13]  Ao Wang,et al.  InfiniCache: Exploiting Ephemeral Serverless Functions to Build a Cost-Effective Memory Cache , 2020, FAST.