Resource Demand Misalignment: An Important Factor to Consider for Reducing Resource Over-Provisioning in Cloud Datacenters

Previous resource provisioning strategies in cloud datacenters allocate physical resources to virtual machines (VMs) based on the predicted resource utilization pattern of VMs. The pattern for VMs of a job is usually derived from historical utilizations of multiple VMs of the job. We observed that these utilization curves are usually misaligned in time, which would lead to resource over-prediction and hence over-provisioning. Since this resource utilization misalignment problem has not been revealed and studied before, in this paper, we study the VM resource utilization from public datacenter traces and Hadoop benchmark jobs to verify the commonness of the utilization misalignments. Then, to reduce resource over-provisioning, we propose three VM resource utilization pattern refinement algorithms to improve the original generated pattern by lowering the cap of the pattern, reducing cap provision duration and varying the minimum value of the pattern. We then extend these algorithms to further improve the resource efficiency by considering periodical resource demand patterns that have multiple pulses in a pattern. These algorithms can be used in any resource provisioning strategy that considers predicted resource utilizations of VMs of a job. We then adopt these refinement algorithms in an initial VM allocation mechanism and test them in trace-driven experiments and real-world testbed experiments. The experimental results show that each improved mechanism can increase resource utilization, and reduce the number of PMs needed to satisfy tenant requests. Also, our extended refinement algorithms are effective in improving resource efficiency of the refinement algorithms.

[1]  Arun Venkataramani,et al.  Sandpiper: Black-box and gray-box resource management for virtual machines , 2009, Comput. Networks.

[2]  Haiying Shen,et al.  Performance Measurement on Scale-Up and Scale-Out Hadoop with Remote and Local File Systems , 2016, 2016 IEEE 9th International Conference on Cloud Computing (CLOUD).

[3]  Daniel Grosu,et al.  Truthful Greedy Mechanisms for Dynamic Virtual Machine Provisioning and Allocation in Clouds , 2015, IEEE Transactions on Parallel and Distributed Systems.

[4]  Hai Jin,et al.  Virtual Machine Power Accounting with Shapley Value , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[5]  Jerome A. Rolia,et al.  Resource pool management: Reactive versus proactive or let's be friends , 2009, Comput. Networks.

[6]  Akshat Verma,et al.  pMapper: Power and Migration Cost Aware Application Placement in Virtualized Systems , 2008, Middleware.

[7]  J. B. G. Frenk,et al.  On the multidimensional vector bin packing , 1990, Acta Cybern..

[8]  C. K. Michael Tse,et al.  A Stable Matching-Based Virtual Machine Allocation Mechanism for Cloud Data Centers , 2016, 2016 IEEE World Congress on Services (SERVICES).

[9]  Guihai Chen,et al.  STAR: Strategy-Proof Double Auctions for Multi-Cloud, Multi-Tenant Bandwidth Reservation , 2015, IEEE Transactions on Computers.

[10]  Jin Li,et al.  Design and theoretical analysis of virtual machine placement algorithm based on peak workload characteristics , 2017, Soft Comput..

[11]  Hari Balakrishnan,et al.  Choreo: network-aware task placement for cloud applications , 2013, Internet Measurement Conference.

[12]  Tao Lu,et al.  Clique Migration: Affinity Grouping of Virtual Machines for Inter-cloud Live Migration , 2014, 2014 9th IEEE International Conference on Networking, Architecture, and Storage.

[13]  Dario Pompili,et al.  Energy-Aware Application-Centric VM Allocation for HPC Workloads , 2011, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.

[14]  Xiao Zhang,et al.  CPI2: CPU performance isolation for shared compute clusters , 2013, EuroSys '13.

[15]  Haiying Shen,et al.  Consolidating complementary VMs with spatial/temporal-awareness in cloud datacenters , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[16]  Haiying Shen,et al.  An Exploration of Designing a Hybrid Scale-Up/Out Hadoop Architecture Based on Performance Measurements , 2017, IEEE Transactions on Parallel and Distributed Systems.

[17]  Zhenhuan Gong,et al.  PRESS: PRedictive Elastic ReSource Scaling for cloud systems , 2010, 2010 International Conference on Network and Service Management.

[18]  Haiying Shen,et al.  Designing a Hybrid Scale-Up/Out Hadoop Architecture Based on Performance Measurements for High Application Performance , 2015, 2015 44th International Conference on Parallel Processing.

[19]  Ishai Menache,et al.  Network-Aware Scheduling for Data-Parallel Jobs: Plan When You Can , 2015, Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication.

[20]  R. S. Shaji,et al.  A Cost Effective Load Balancing Scheme for Better Resource Utilization in Cloud Computing , 2014 .

[21]  Jie Wu,et al.  Burstiness-Aware Resource Reservation for Server Consolidation in Computing Clouds , 2016, IEEE Transactions on Parallel and Distributed Systems.

[22]  Saeed Sharifian,et al.  A new model for virtual machine migration in virtualized cluster server based on Fuzzy Decision Making , 2010, ArXiv.

[23]  Ling Liu,et al.  Cura: A Cost-Optimized Model for MapReduce in a Cloud , 2013, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing.

[24]  Aameek Singh,et al.  Server-storage virtualization: Integration and load balancing in data centers , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[25]  Jie Wu,et al.  Let's stay together: Towards traffic aware virtual machine placement in data centers , 2012, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[26]  Ion Stoica,et al.  FairCloud: sharing the network in cloud computing , 2011, SIGCOMM '12.

[27]  Ehsan Ahvar,et al.  CACEV: A Cost and Carbon Emission-Efficient Virtual Machine Placement Method for Green Distributed Clouds , 2016, 2016 IEEE International Conference on Services Computing (SCC).

[28]  Haiying Shen,et al.  New bandwidth sharing and pricing policies to achieve a win-win situation for cloud provider and tenants , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[29]  Scott Shenker,et al.  Choosy: max-min fair sharing for datacenter jobs with constraints , 2013, EuroSys '13.

[30]  Jing Xu,et al.  Multi-Objective Virtual Machine Placement in Virtualized Data Center Environments , 2010, 2010 IEEE/ACM Int'l Conference on Green Computing and Communications & Int'l Conference on Cyber, Physical and Social Computing.

[31]  Umesh Bellur,et al.  Optimal Placement Algorithms for Virtual Machines , 2010, ArXiv.

[32]  Zoltán Ádám Mann,et al.  Interplay of Virtual Machine Selection and Virtual Machine Placement , 2016, ESOCC.

[33]  Hai Jin,et al.  Heterogeneity and Interference-Aware Virtual Machine Provisioning for Predictable Performance in the Cloud , 2016, IEEE Transactions on Computers.

[34]  Chita R. Das,et al.  Migration, Assignment, and Scheduling of Jobs in Virtualized Environment , 2011, HotCloud.

[35]  David H. Bailey,et al.  The NAS parallel benchmarks summary and preliminary results , 1991, Proceedings of the 1991 ACM/IEEE Conference on Supercomputing (Supercomputing '91).

[36]  Xiaohui Gu,et al.  CloudScale: elastic resource scaling for multi-tenant cloud systems , 2011, SoCC.

[37]  Feng Zhao,et al.  Energy aware consolidation for cloud computing , 2008, CLUSTER 2008.

[38]  Athanasios V. Vasilakos,et al.  Managing Performance Overhead of Virtual Machines in Cloud Computing: A Survey, State of the Art, and Future Directions , 2014, Proceedings of the IEEE.

[39]  Xiaojun Ruan,et al.  Performance-to-Power Ratio Aware Virtual Machine (VM) Allocation in Energy-Efficient Clouds , 2015, 2015 IEEE International Conference on Cluster Computing.

[40]  Christoforos E. Kozyrakis,et al.  Reconciling high server utilization and sub-millisecond quality-of-service , 2014, EuroSys '14.

[41]  Husnu S. Narman,et al.  CCRP: Customized cooperative resource provisioning for high resource utilization in clouds , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[42]  Hitesh Ballani,et al.  Towards predictable datacenter networks , 2011, SIGCOMM 2011.

[43]  Michael Abd-El-Malek,et al.  Omega: flexible, scalable schedulers for large compute clusters , 2013, EuroSys '13.

[44]  Rajkumar Buyya,et al.  Dynamic resource demand prediction and allocation in multi‐tenant service clouds , 2016, Concurr. Comput. Pract. Exp..

[45]  T. V. Lakshman,et al.  Online Allocation of Virtual Machines in a Distributed Cloud , 2017, IEEE/ACM Transactions on Networking.

[46]  Abhishek Verma,et al.  Large-scale cluster management at Google with Borg , 2015, EuroSys.

[47]  Charles L. Seitz,et al.  Myrinet: A Gigabit-per-Second Local Area Network , 1995, IEEE Micro.

[48]  David R. Kaeli,et al.  Quantifying load imbalance on virtualized enterprise servers , 2010, WOSP/SIPEW '10.

[49]  Ning Ding,et al.  The only constant is change: incorporating time-varying network reservations in data centers , 2012, SIGCOMM.

[50]  Gautam Kar,et al.  Application Performance Management in Virtualized Server Environments , 2006, 2006 IEEE/IFIP Network Operations and Management Symposium NOMS 2006.

[51]  Yuqing Zhu,et al.  BigDataBench: A big data benchmark suite from internet services , 2014, 2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA).

[52]  Rajkumar Buyya,et al.  CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms , 2011, Softw. Pract. Exp..

[53]  Colas Schretter,et al.  Monte Carlo and Quasi-Monte Carlo Methods , 2016 .