Join the Shortest Queue with Many Servers. The Heavy-Traffic Asymptotics

We consider queueing systems with n parallel queues under a Join the Shortest Queue (JSQ) policy in the Halfin-Whitt heavy-traffic regime. We use the martingale method to prove that a scaled process counting the number of idle servers and queues of length exactly two weakly converges to a two-dimensional reflected Ornstein-Uhlenbeck process, while processes counting longer queues converge to a deterministic system decaying to zero in constant time. This limiting system is comparable to that of the traditional Halfin-Whitt model, but there are key differences in the queueing behavior of the JSQ model. In particular, only a vanishing fraction of customers will have to wait, but those who do incur a constant order waiting time.

[1]  Tapani Lehtonen,et al.  On the optimality of the shortest line discipline , 1984 .

[2]  R. Weber On the optimal assignment of customers to parallel servers , 1978, Journal of Applied Probability.

[3]  Frank A. Haight,et al.  TWO QUEUES IN PARALLEL , 1958 .

[4]  G. J. Foschini,et al.  A Basic Dynamic Routing Problem and Diffusion , 1978, IEEE Trans. Commun..

[5]  Rongxin Wang,et al.  Heavy traffic limit theorems for a sequence of shortest queueing systems , 1995, Queueing Syst. Theory Appl..

[6]  J. Kingman Two Similar Queues in Parallel , 1961 .

[7]  Wang Rongxin,et al.  Heavy traffic limit theorems for a queueing system in which customers join the shortest line , 1989, Advances in Applied Probability.

[8]  Shlomo Halfin The shortest queue problem , 1985 .

[9]  Tolga Tezcan,et al.  State Space Collapse in Many-Server Diffusion Limits of Parallel Server Systems , 2011, Math. Oper. Res..

[10]  W. Whitt,et al.  Martingale proofs of many-server heavy-traffic limits for Markovian queues ∗ , 2007, 0712.4211.

[11]  Ward Whitt,et al.  An Introduction to Stochastic-Process Limits and their Application to Queues , 2002 .

[12]  W. Rudin Principles of mathematical analysis , 1964 .

[13]  D. E. Greene,et al.  An inequality for a class of integral systems , 1977 .

[14]  Malwina Luczak,et al.  The supermarket model with arrival rate tending to one , 2012, 1201.5523.

[15]  J. Hunter Two Queues in Parallel , 1969 .

[16]  Tolga Tezcan Optimal Control of Distributed Parallel Server Systems Under the Halfin and Whitt Regime , 2008, Math. Oper. Res..

[17]  Tonghoon Suk,et al.  Randomized longest-queue-first scheduling for large-scale buffered systems , 2013, Advances in Applied Probability.

[18]  R. L. Dobrushin,et al.  Queueing system with selection of the shortest of two queues: an assymptotic approach , 1996 .

[19]  Michael Mitzenmacher,et al.  The Power of Two Choices in Randomized Load Balancing , 2001, IEEE Trans. Parallel Distributed Syst..

[20]  K. M. Das A note on an inequality due to Greene , 1979 .

[21]  Ward Whitt,et al.  Heavy-Traffic Limits for Queues with Many Exponential Servers , 1981, Oper. Res..