论文信息 - Feedback control of server instances for right sizing in the cloud

Feedback control of server instances for right sizing in the cloud

We consider a computing system based on sum-moning server instances on the fly, possibly from a remote cloud service. A feedback rule must be designed to track the exogenous load with the right service capacity, taking into account the inherent lags in server creation and deletion. We use fluid and diffusion approximations of queueing models to analyze control schemes that manage the tradeoff between job queueing and idle capacity, in the large scale limit. In particular we propose a method in which the system can achieve negligible queueing while minimizing idle capacity. Theoretical results are supported by simulations.

Fernando Paganini | Andrés Ferragut | Diego Goldsztajn

[1] T. Kurtz. Strong approximation theorems for density dependent Markov chains , 1978 .

[2] S. Ethier,et al. Markov Processes: Characterization and Convergence , 2005 .

[3] T. Kurtz. Limit theorems for sequences of jump Markov processes approximating ordinary differential processes , 1971, Journal of Applied Probability.

[4] Sem C. Borst,et al. Scalable load balancing in networked systems: A survey of recent advances , 2018, SIAM Rev..

[5] James R. Larus,et al. Join-Idle-Queue: A novel load balancing algorithm for dynamically scalable web services , 2011, Perform. Evaluation.

[6] Ward Whitt,et al. Heavy-Traffic Limits for Queues with Many Exponential Servers , 1981, Oper. Res..

[7] Fernando Paganini,et al. Controlling the number of active instances in a cloud environment , 2018, PERV.

[8] Neil Walton,et al. Load Balancing in the Non-Degenerate Slowdown Regime , 2017 .

[9] Alexander L. Stolyar,et al. A Service System with Randomly Behaving On-demand Agents , 2016, SIGMETRICS.

[10] Varun Gupta,et al. Load Balancing in the Nondegenerate Slowdown Regime , 2019, Oper. Res..

[11] W. Marsden. I and J , 2012 .

[12] Alexander L. Stolyar,et al. Large-scale join-idle-queue system with general service times , 2017, J. Appl. Probab..

[13] John N. Tsitsiklis,et al. Delay, Memory, and Messaging Tradeoffs in Distributed Service Systems , 2018 .