论文信息 - Towards resource-efficient cloud systems: Avoiding over-provisioning in demand-prediction based resource provisioning

Towards resource-efficient cloud systems: Avoiding over-provisioning in demand-prediction based resource provisioning

Demand-prediction based resource provisioning schemes help assure service level objectives (SLO) in cloud systems. We notice that if a provisioning scheme does not exclude bursts from historical resource demands in normal demand prediction or always uses a large padding to correct under-prediction, it will lead to resource over-provisioning and low resource utilization. To improve the previous schemes, in this paper, we present a Resource-efficient Predictive Resource Provisioning system in clouds (RPRP) that excludes bursts in demand prediction and has algorithms to specifically handle bursts to avoid resource over-provisioning. Rather than setting padding to a possibly high value, RPRP has a load-dependent padding algorithm that adaptively determines padding based on predicted demands. To handle bursts, RPRP embodies a responsive padding algorithm that adaptively adjusts padding to recover from both under-provisioning and over-provisioning. We implemented RPRP on top of Xen and conducted both trace-driven simulation and real-world testbed experiments. The experimental results show that RPRP achieves higher resource utilization, more accurate demand predictions, and fewer SLO violations than previous schemes.

Haiying Shen | Liuhua Chen

[1] A. Rowstron,et al. Towards predictable datacenter networks , 2011, SIGCOMM.

[2] Saikat Guha,et al. Generalized resource allocation for the cloud , 2012, SoCC '12.

[3] Ajay Gulati. VMware distributed resource Management : design , Implementation , and lessons learned , 2022 .

[4] Prashant J. Shenoy,et al. Seagull: Intelligent Cloud Bursting for Enterprise Applications , 2012, USENIX Annual Technical Conference.

[5] Haiying Shen,et al. RIAL: Resource Intensity Aware Load balancing in clouds , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[6] Haiying Shen,et al. CORP: Cooperative Opportunistic Resource Provisioning for Short-Lived Jobs in Cloud Systems , 2016, 2016 IEEE International Conference on Cluster Computing (CLUSTER).

[7] Rajkumar Buyya,et al. CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms , 2011, Softw. Pract. Exp..

[8] Haiying Shen,et al. Probabilistic demand allocation for cloud service brokerage , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[9] Yu-Ju Hong,et al. Dynamic server provisioning to minimize cost in an IaaS cloud , 2011, PERV.

[10] I. Stoica,et al. FairCloud: sharing the network in cloud computing , 2011, CCRV.

[11] Kang G. Shin,et al. Adaptive control of virtualized resources in utility computing environments , 2007, EuroSys '07.

[12] Jerome A. Rolia,et al. Capacity Management and Demand Prediction for Next Generation Data Centers , 2007, IEEE International Conference on Web Services (ICWS 2007).

[13] Aameek Singh,et al. Server-storage virtualization: Integration and load balancing in data centers , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[14] Haiying Shen,et al. Distributed Autonomous Virtual Resource Management in Datacenters Using Finite-Markov Decision Process , 2014, IEEE/ACM Transactions on Networking.

[15] Xiaohui Gu,et al. CloudScale: elastic resource scaling for multi-tenant cloud systems , 2011, SoCC.

[16] Haiying Shen,et al. A Decentralized Network with Fast and Lightweight Autonomous Channel Selection in Vehicle Platoons for Collision Avoidance , 2016, 2016 IEEE 13th International Conference on Mobile Ad Hoc and Sensor Systems (MASS).

[17] Feng Zhao,et al. Energy aware consolidation for cloud computing , 2008, CLUSTER 2008.

[18] Haiying Shen,et al. Profiling and Understanding Virtualization Overhead in Cloud , 2015, 2015 44th International Conference on Parallel Processing.

[19] P. Young,et al. Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[20] Andrew Hiles. Service Level Agreements , 1994 .

[21] Haiying Shen,et al. EcoFlow: An Economical and Deadline-Driven Inter-datacenter Video Flow Scheduling System , 2015, 2015 IEEE 35th International Conference on Distributed Computing Systems.

[22] Xiaohui Gu,et al. AGILE: Elastic Distributed Resource Scaling for Infrastructure-as-a-Service , 2013, ICAC.

[23] Haiying Shen,et al. Discovering the Densest Subgraph in MapReduce for Assortative Big Natural Graphs , 2015, 2015 24th International Conference on Computer Communication and Networks (ICCCN).

[24] Haiying Shen,et al. Cache contention aware Virtual Machine placement and migration in cloud datacenters , 2016, 2016 IEEE 24th International Conference on Network Protocols (ICNP).

[25] Christian Engelmann,et al. Proactive fault tolerance for HPC with Xen virtualization , 2007, ICS '07.

[26] Randy H. Katz,et al. Heterogeneity and dynamicity of clouds at scale: Google trace analysis , 2012, SoCC '12.

[27] Gwilym M. Jenkins,et al. Time series analysis, forecasting and control , 1971 .

[28] Andrzej Kochut,et al. Dynamic Placement of Virtual Machines for Managing SLA Violations , 2007, 2007 10th IFIP/IEEE International Symposium on Integrated Network Management.

[29] Haiying Shen,et al. A time-efficient connected densest subgraph discovery algorithm for big data , 2015, 2015 IEEE International Conference on Networking, Architecture and Storage (NAS).

[30] Arun Venkataramani,et al. Black-box and Gray-box Strategies for Virtual Machine Migration , 2007, NSDI.

[31] Andrew Warfield,et al. Live migration of virtual machines , 2005, NSDI.

[32] Brian Kocoloski,et al. A case for dual stack virtualization: consolidating HPC and commodity applications in the cloud , 2012, SoCC '12.

[33] Haiying Shen,et al. TOP: Vehicle Trajectory Based Driving Speed Optimization Strategy for Travel Time Minimization and Road Congestion Avoidance , 2016, 2016 IEEE 13th International Conference on Mobile Ad Hoc and Sensor Systems (MASS).

[34] Haiying Shen,et al. Towards green cloud computing: Demand allocation and pricing policies for cloud service brokerage , 2015, IEEE BigData.

[35] Haiying Shen,et al. Consolidating complementary VMs with spatial/temporal-awareness in cloud datacenters , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[36] Zhenhuan Gong,et al. PRESS: PRedictive Elastic ReSource Scaling for cloud systems , 2010, 2010 International Conference on Network and Service Management.

[37] Haiying Shen,et al. Designing a Hybrid Scale-Up/Out Hadoop Architecture Based on Performance Measurements for High Application Performance , 2015, 2015 44th International Conference on Parallel Processing.

[38] Prashant J. Shenoy,et al. Dynamic resource allocation for shared data centers using online measurements , 2003, IWQoS'03.