Randomized Load Balancing in Finite Regimes

Randomized load balancing is a cost efficient policy for jobscheduling in parallel server queueing systems whereby, with everyincoming job, a central dispatcher randomly polls some servers andselects the one with the smallest queue. By exactly deriving thejobs' delay distribution in such systems, in explicit and closedform, Mitzenmacher [13] proved the so-called 'power-of-two' result, which states that the random polling of only two serversyields an exponential improvement in delay over randomly selectinga single server. Such a fundamental result, however, was obtainedin an asymptotic regime in the total number of servers, and doesdo not necessarily provide accurate estimates for practical finiteregimes with small or moderate number of servers. In this paper weobtain stochastic lower and upper bounds on the jobs' averagedelay in non-asymptotic/finite regimes, by extending ideas foranalyzing the particular case of the Join-the-Shortest-Queue (JSQ) policy. Numerical illustrations indicate not only that the (lower) bounds are remarkably accurate, but also that the asymptoticapproximation can be misleading in scenarios with a small numberof servers, and especially at very high utilizations.

[1]  Guy Latouche,et al.  A general class of Markov processes with explicit matrix-geometric solutions , 1986 .

[2]  R. L. Dobrushin,et al.  Queueing system with selection of the shortest of two queues: an assymptotic approach , 1996 .

[3]  Marcel F. Neuts,et al.  Matrix-geometric solutions in stochastic models - an algorithmic approach , 1982 .

[4]  Donald F. Towsley,et al.  Bounding the Mean Response Time of the Minimum Expected Delay Routing Policy: An Algorithmic Approach , 1995, IEEE Trans. Computers.

[5]  Tonghoon Suk,et al.  Randomized longest-queue-first scheduling for large-scale buffered systems , 2013, Advances in Applied Probability.

[6]  Miklós Telek,et al.  Introduction to Queueing Systems with Telecommunication Applications , 2012 .

[7]  Yiqiang Zhao,et al.  Queueing Analysis of a Jockeying Model , 1995, Oper. Res..

[8]  Fabrice Guillemin,et al.  The Power of Randomized Routing in Heterogeneous Loss Systems , 2015, 2015 27th International Teletraffic Congress.

[9]  Florin Ciucu,et al.  Stochastic bounds for randomized load balancing , 2012, PERV.

[10]  Murat Alanyali,et al.  Occupancy Distributions of Homogeneous Queueing Systems Under Opportunistic Scheduling , 2008, IEEE Transactions on Information Theory.

[11]  Colin McDiarmid,et al.  On the maximum queue length in the supermarket model , 2004 .

[12]  Ivo J. B. F. Adan,et al.  Upper and lower bounds for the waiting time in the symmetric shortest queue system , 1994, Ann. Oper. Res..

[13]  Vaidyanathan Ramaswami,et al.  A logarithmic reduction algorithm for quasi-birth-death processes , 1993, Journal of Applied Probability.

[14]  Yi Lu,et al.  Randomized load balancing with general service time distributions , 2010, SIGMETRICS '10.

[15]  Armand M. Makowski,et al.  Light traffic performance under the power of two load balancing strategy: the case of server heterogeneity , 2014, PERV.

[16]  Michael Mitzenmacher,et al.  The Power of Two Choices in Randomized Load Balancing , 2001, IEEE Trans. Parallel Distributed Syst..

[17]  Yi Lu,et al.  Asymptotic independence of queues under randomized load balancing , 2012, Queueing Syst. Theory Appl..