Online Virtual Machine Allocation with Predictions

The cloud computing industry has grown rapidly over the last decade, and with this growth there is a significant increase in demand for compute resources. Demand is manifested in the form of Virtual Machine (VM) requests, which need to be assigned to physical machines in a way that minimizes resource fragmentation and efficiently utilizes the available machines. This problem can be modeled as a dynamic version of the bin packing problem with the objective of minimizing the total usage time of the bins (physical machines). Earlier works on dynamic bin packing assumed that no knowledge is available to the scheduler and later works studied models in which lifetime/duration of each "item" (VM in our context) is available to the scheduler. This extra information was shown to improve exponentially the achievable competitive ratio. Motivated by advances in Machine Learning that provide good estimates of workload characteristics, this paper studies the effect of having extra information regarding future (total) demand. In the cloud context, since demand is an aggregate over many VM requests, it can be predicted with high accuracy (e.g., using historical data). We show that the competitive factor can be dramatically improved by using this additional information; in some cases, we achieve constant competitiveness, or even a competitive factor that approaches 1. Along the way, we design new offline algorithms with improved approximation ratios for the dynamic bin-packing problem.

[1]  Samir Khuller,et al.  Brief Announcement: A Greedy 2 Approximation for the Active Time Problem , 2018, SPAA.

[2]  Yuxi Li,et al.  Deep Reinforcement Learning: An Overview , 2017, ArXiv.

[3]  Lachlan L. H. Andrew,et al.  Dynamic Right-Sizing for Power-Proportional Data Centers , 2011, IEEE/ACM Transactions on Networking.

[4]  Wentong Cai,et al.  On First Fit Bin Packing for Online Cloud Server Allocation , 2016, 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS).

[5]  D. T. Lee,et al.  A simple on-line bin-packing algorithm , 1985, JACM.

[6]  Wentong Cai,et al.  On dynamic bin packing for resource allocation in the cloud , 2014, SPAA.

[7]  Randeep Bhatia,et al.  Line System Design and a Generalized Coloring Problem , 2003, ESA.

[8]  Leah Epstein,et al.  A New Lower Bound for Classic Online Bin Packing , 2018, Algorithmica.

[9]  David S. Johnson,et al.  Near-optimal bin packing algorithms , 1973 .

[10]  Xueyan Tang,et al.  Clairvoyant Dynamic Bin Packing for Job Scheduling with Minimum Server Usage Time , 2016, SPAA.

[11]  Vijay Kumar,et al.  Approximation Algorithms for Wavelength Assignment , 2005, FSTTCS.

[12]  Wentong Cai,et al.  Play Request Dispatching for Efficient Virtual Machine Usage in Cloud Gaming , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Edward G. Coffman,et al.  Approximation algorithms for bin packing: a survey , 1996 .

[14]  György Dósa,et al.  Performance Guarantees for One-Dimensional Bin Packing , 2007, Handbook of Approximation Algorithms and Metaheuristics.

[15]  Silvio Lattanzi,et al.  Online Scheduling via Learned Weights , 2020, SODA.

[16]  Gianpiero Monaco,et al.  Minimizing total busy time in parallel scheduling with application to optical networks , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[17]  Ricardo Bianchini,et al.  Resource Central: Understanding and Predicting Workloads for Improved Resource Management in Large Cloud Platforms , 2017, SOSP.

[18]  Navendu Jain,et al.  Managing cost, performance, and reliability tradeoffs for energy-aware server provisioning , 2011, 2011 Proceedings IEEE INFOCOM.

[19]  Peter Winkler,et al.  Wavelength assignment and generalized interval graph coloring , 2003, SODA '03.

[20]  Google,et al.  Improving Online Algorithms via ML Predictions , 2024, NeurIPS.

[21]  Sergei Vassilvitskii,et al.  Competitive caching with machine learned advice , 2018, ICML.

[22]  Wentong Cai,et al.  Dynamic Bin Packing for On-Demand Cloud Resource Allocation , 2016, IEEE Transactions on Parallel and Distributed Systems.

[23]  Jim Gao,et al.  Machine Learning Applications for Data Center Optimization , 2014 .

[24]  Yossi Azar,et al.  Tight Bounds for Clairvoyant Dynamic Bin Packing , 2017, SPAA.

[25]  Raouf Boutaba,et al.  A comprehensive survey on machine learning for networking: evolution, applications and research opportunities , 2018, Journal of Internet Services and Applications.

[26]  Edward G. Coffman,et al.  Dynamic Bin Packing , 1983, SIAM J. Comput..

[27]  Yossi Azar,et al.  Truthful Online Scheduling with Commitments , 2015, EC.

[28]  Alejandro López-Ortiz,et al.  Efficient Online Strategies for Renting Servers in the Cloud , 2015, SOFSEM.

[29]  Mihai Burcea,et al.  An 8/3 Lower Bound for Online Dynamic Bin Packing , 2012, ISAAC.

[30]  Ricardo Bianchini,et al.  History-Based Harvesting of Spare Cycles and Storage in Large-Scale Datacenters , 2016, OSDI.

[31]  Ishai Menache,et al.  Efficient online scheduling for deadline-sensitive jobs: extended abstract , 2013, SPAA.

[32]  Nikhil R. Devanur,et al.  Truth and Regret in Online Scheduling , 2017, EC.

[33]  János Csirik,et al.  Performance Guarantees for One-Dimensional Bin Packing , 2007, Handbook of Approximation Algorithms and Metaheuristics.

[34]  Jakub Marecek,et al.  Handbook of Approximation Algorithms and Metaheuristics , 2010, Comput. J..