Quality-Elasticity: Improved Resource Utilization, Throughput, and Response Times Via Adjusting Output Quality to Current Operating Conditions

This work addresses two related problems for on-line services, namely poor resource utilization during regular operating conditions, and low throughput, long response times, or poor performance under periods of high system load. To address these problems, we introduce our notion of quality-elasticity as a manner of dynamically adapting response qualities from software services along a fine-grained spectrum. When resources are abundant, response quality can be increased, and when resources are scarce, responses are delivered at a lower quality to prioritize throughput and response times. We present an example of how a complex online shopping site can be made quality-elastic. Experiments show that, compared to state of the art, improvements in throughput (57% more served queries), lowered response times (8 time reduction for 95th percentile responses), and an estimated 40% profitability increase can be made using our quality-elastic approach. When resources are abundant, our approach may achieve upwards of twice as high resource utilization as prior work in this field.

[1]  Ion Stoica,et al.  Occupy the cloud: distributed computing for the 99% , 2017, SoCC.

[2]  Christof Fetzer,et al.  ApproxJoin: Approximate Distributed Joins , 2018, SoCC.

[3]  Kartik Hosanagar,et al.  Recommender systems and their impact on sales diversity , 2007, EC '07.

[4]  Kartik Hosanagar,et al.  Blockbuster Culture's Next Rise or Fall: The Impact of Recommender Systems on Sales Diversity , 2007, Manag. Sci..

[5]  Karl-Erik Årzén,et al.  Cloud Application Predictability through Integrated Load-Balancing and Service Time Control , 2018, 2018 IEEE International Conference on Autonomic Computing (ICAC).

[6]  Rajkumar Buyya,et al.  Brownout Approach for Adaptive Management of Resources and Applications in Cloud Computing Systems , 2019, ACM Comput. Surv..

[7]  Indranil Gupta,et al.  Social Trove: A Self-Summarizing Storage Service for Social Sensing , 2015, 2015 IEEE International Conference on Autonomic Computing.

[8]  Richard E. Brown,et al.  United States Data Center Energy Usage Report , 2016 .

[9]  Philipp Leitner,et al.  Patterns in the Chaos—A Study of Performance Variation and Predictability in Public IaaS Clouds , 2014, ACM Trans. Internet Techn..

[10]  Karl-Erik Årzén,et al.  Towards Mission-Critical Control at the Edge and Over 5G , 2018, 2018 IEEE International Conference on Edge Computing (EDGE).

[11]  Wolf-Dietrich Weber,et al.  Power provisioning for a warehouse-sized computer , 2007, ISCA '07.

[12]  Erik Elmroth,et al.  Control-theoretical load-balancing for cloud applications with brownout , 2014, 53rd IEEE Conference on Decision and Control.

[13]  Rajkumar Buyya,et al.  Energy Efficient Scheduling of Cloud Application Components with Brownout , 2016, IEEE Transactions on Sustainable Computing.

[14]  Erik Elmroth,et al.  Control-Based Load-Balancing Techniques: Analysis and Performance Evaluation via a Randomized Optimization Approach , 2016 .

[15]  Erik Elmroth,et al.  Power-aware cloud brownout: Response time and power consumption control , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[16]  Erik Elmroth,et al.  A virtual machine re-packing approach to the horizontal vs. vertical elasticity trade-off for cloud autoscaling , 2013, CAC.

[17]  Johan Tordsson,et al.  Resource Management Challenges for the Infinite Cloud , 2015 .

[18]  Adam Wierman,et al.  Open Versus Closed: A Cautionary Tale , 2006, NSDI.

[19]  Florin Ciucu,et al.  Distributed resource management across process boundaries , 2017, SoCC.

[20]  María S. Pérez-Hernández,et al.  Exploring Energy-Consistency Trade-Offs in Cassandra Cloud Storage System , 2015, 2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD).

[21]  Thu D. Nguyen,et al.  ApproxHadoop: Bringing Approximations to MapReduce Frameworks , 2015, ASPLOS.

[22]  Ann Mary Joy,et al.  Performance comparison between Linux containers and virtual machines , 2015, 2015 International Conference on Advances in Computer Engineering and Applications.

[23]  Fiona Fui-Hoon Nah,et al.  A study on tolerable waiting time: how long are Web users willing to wait? , 2004, AMCIS.

[24]  Xiaohui Gu,et al.  AGILE: Elastic Distributed Resource Scaling for Infrastructure-as-a-Service , 2013, ICAC.

[25]  Erik Elmroth,et al.  Improving Cloud Service Resilience Using Brownout-Aware Load-Balancing , 2014, 2014 IEEE 33rd International Symposium on Reliable Distributed Systems.

[26]  Christopher Stewart,et al.  Measuring and Managing Answer Quality for Online Data-Intensive Services , 2015, 2015 IEEE International Conference on Autonomic Computing.

[27]  Calton Pu,et al.  When average is not average: large response time fluctuations in n-tier systems , 2012, ICAC '12.

[28]  Luiz André Barroso,et al.  The Case for Energy-Proportional Computing , 2007, Computer.

[29]  Karl-Erik Årzén,et al.  Brownout: building more robust cloud applications , 2014, ICSE.

[30]  Charles Reiss,et al.  Towards understanding heterogeneous clouds at scale : Google trace analysis , 2012 .

[31]  Zhenyu Wen,et al.  ApproxIoT: Approximate Analytics for Edge Computing , 2018, 2018 IEEE 38th International Conference on Distributed Computing Systems (ICDCS).

[32]  Karl-Erik Årzén,et al.  Control strategies for predictable brownouts in cloud computing , 2014 .