Towards resource-efficient cloud systems: Avoiding over-provisioning in demand-prediction based resource provisioning

Demand-prediction based resource provisioning schemes help assure service level objectives (SLO) in cloud systems. We notice that if a provisioning scheme does not exclude bursts from historical resource demands in normal demand prediction or always uses a large padding to correct under-prediction, it will lead to resource over-provisioning and low resource utilization. To improve the previous schemes, in this paper, we present a Resource-efficient Predictive Resource Provisioning system in clouds (RPRP) that excludes bursts in demand prediction and has algorithms to specifically handle bursts to avoid resource over-provisioning. Rather than setting padding to a possibly high value, RPRP has a load-dependent padding algorithm that adaptively determines padding based on predicted demands. To handle bursts, RPRP embodies a responsive padding algorithm that adaptively adjusts padding to recover from both under-provisioning and over-provisioning. We implemented RPRP on top of Xen and conducted both trace-driven simulation and real-world testbed experiments. The experimental results show that RPRP achieves higher resource utilization, more accurate demand predictions, and fewer SLO violations than previous schemes.

[1]  A. Rowstron,et al.  Towards predictable datacenter networks , 2011, SIGCOMM.

[2]  Saikat Guha,et al.  Generalized resource allocation for the cloud , 2012, SoCC '12.

[3]  Ajay Gulati VMware distributed resource Management : design , Implementation , and lessons learned , 2022 .

[4]  Prashant J. Shenoy,et al.  Seagull: Intelligent Cloud Bursting for Enterprise Applications , 2012, USENIX Annual Technical Conference.

[5]  Haiying Shen,et al.  RIAL: Resource Intensity Aware Load balancing in clouds , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[6]  Haiying Shen,et al.  CORP: Cooperative Opportunistic Resource Provisioning for Short-Lived Jobs in Cloud Systems , 2016, 2016 IEEE International Conference on Cluster Computing (CLUSTER).

[7]  Rajkumar Buyya,et al.  CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms , 2011, Softw. Pract. Exp..

[8]  Haiying Shen,et al.  Probabilistic demand allocation for cloud service brokerage , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[9]  Yu-Ju Hong,et al.  Dynamic server provisioning to minimize cost in an IaaS cloud , 2011, PERV.

[10]  I. Stoica,et al.  FairCloud: sharing the network in cloud computing , 2011, CCRV.

[11]  Kang G. Shin,et al.  Adaptive control of virtualized resources in utility computing environments , 2007, EuroSys '07.

[12]  Jerome A. Rolia,et al.  Capacity Management and Demand Prediction for Next Generation Data Centers , 2007, IEEE International Conference on Web Services (ICWS 2007).

[13]  Aameek Singh,et al.  Server-storage virtualization: Integration and load balancing in data centers , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[14]  Haiying Shen,et al.  Distributed Autonomous Virtual Resource Management in Datacenters Using Finite-Markov Decision Process , 2014, IEEE/ACM Transactions on Networking.

[15]  Xiaohui Gu,et al.  CloudScale: elastic resource scaling for multi-tenant cloud systems , 2011, SoCC.

[16]  Haiying Shen,et al.  A Decentralized Network with Fast and Lightweight Autonomous Channel Selection in Vehicle Platoons for Collision Avoidance , 2016, 2016 IEEE 13th International Conference on Mobile Ad Hoc and Sensor Systems (MASS).

[17]  Feng Zhao,et al.  Energy aware consolidation for cloud computing , 2008, CLUSTER 2008.

[18]  Haiying Shen,et al.  Profiling and Understanding Virtualization Overhead in Cloud , 2015, 2015 44th International Conference on Parallel Processing.

[19]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[20]  Andrew Hiles Service Level Agreements , 1994 .

[21]  Haiying Shen,et al.  EcoFlow: An Economical and Deadline-Driven Inter-datacenter Video Flow Scheduling System , 2015, 2015 IEEE 35th International Conference on Distributed Computing Systems.

[22]  Xiaohui Gu,et al.  AGILE: Elastic Distributed Resource Scaling for Infrastructure-as-a-Service , 2013, ICAC.

[23]  Haiying Shen,et al.  Discovering the Densest Subgraph in MapReduce for Assortative Big Natural Graphs , 2015, 2015 24th International Conference on Computer Communication and Networks (ICCCN).

[24]  Haiying Shen,et al.  Cache contention aware Virtual Machine placement and migration in cloud datacenters , 2016, 2016 IEEE 24th International Conference on Network Protocols (ICNP).

[25]  Christian Engelmann,et al.  Proactive fault tolerance for HPC with Xen virtualization , 2007, ICS '07.

[26]  Randy H. Katz,et al.  Heterogeneity and dynamicity of clouds at scale: Google trace analysis , 2012, SoCC '12.

[27]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1971 .

[28]  Andrzej Kochut,et al.  Dynamic Placement of Virtual Machines for Managing SLA Violations , 2007, 2007 10th IFIP/IEEE International Symposium on Integrated Network Management.

[29]  Haiying Shen,et al.  A time-efficient connected densest subgraph discovery algorithm for big data , 2015, 2015 IEEE International Conference on Networking, Architecture and Storage (NAS).

[30]  Arun Venkataramani,et al.  Black-box and Gray-box Strategies for Virtual Machine Migration , 2007, NSDI.

[31]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[32]  Brian Kocoloski,et al.  A case for dual stack virtualization: consolidating HPC and commodity applications in the cloud , 2012, SoCC '12.

[33]  Haiying Shen,et al.  TOP: Vehicle Trajectory Based Driving Speed Optimization Strategy for Travel Time Minimization and Road Congestion Avoidance , 2016, 2016 IEEE 13th International Conference on Mobile Ad Hoc and Sensor Systems (MASS).

[34]  Haiying Shen,et al.  Towards green cloud computing: Demand allocation and pricing policies for cloud service brokerage , 2015, IEEE BigData.

[35]  Haiying Shen,et al.  Consolidating complementary VMs with spatial/temporal-awareness in cloud datacenters , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[36]  Zhenhuan Gong,et al.  PRESS: PRedictive Elastic ReSource Scaling for cloud systems , 2010, 2010 International Conference on Network and Service Management.

[37]  Haiying Shen,et al.  Designing a Hybrid Scale-Up/Out Hadoop Architecture Based on Performance Measurements for High Application Performance , 2015, 2015 44th International Conference on Parallel Processing.

[38]  Prashant J. Shenoy,et al.  Dynamic resource allocation for shared data centers using online measurements , 2003, IWQoS'03.