Heavy-traffic Delay Optimality in Pull-based Load Balancing Systems: Necessary and Sufficient Conditions

In this paper, we consider a load balancing system under a general pull-based policy. In particular, each arrival is randomly dispatched to any server whose queue length is below a threshold; if no such server exists, then the arrival is randomly assigned to any server. We are interested in the fundamental relationship between the threshold and the delay performance of the system in heavy traffic. To this end, we first establish the following necessary condition to guarantee heavy-traffic delay optimality: the threshold needs to grow to infinity as the exogenous arrival rate approaches the boundary of the capacity region (i.e., the load intensity approaches one) but the growth rate should be slower than a polynomial function of the mean number of tasks in the system. As a special case of this result, we directly show that the delay performance of the popular pull-based policy Join-Idle-Queue (JIQ) is not heavy traffic optimal, but performs strictly better than random routing. We further show that a sufficient condition for heavy-traffic delay optimality is that the threshold grows logarithmically with the mean number of tasks in the system. This result directly resolves a generalized version of the conjecture by Kelly and Laws.

[1]  Amarjit Budhiraja,et al.  Stationary Distribution Convergence for Generalized Jackson Networks in Heavy Traffic , 2009, Math. Oper. Res..

[2]  James R. Larus,et al.  Join-Idle-Queue: A novel load balancing algorithm for dynamically scalable web services , 2011, Perform. Evaluation.

[3]  Maury Bramson,et al.  State space collapse with application to heavy traffic limits for multiclass queueing networks , 1998, Queueing Syst. Theory Appl..

[4]  J. Harrison Heavy traffic analysis of a system with parallel servers: asymptotic optimality of discrete-review policies , 1998 .

[5]  Ness B. Shroff,et al.  Heavy-traffic Delay Optimality in Pull-based Load Balancing Systems , 2018, Proc. ACM Meas. Anal. Comput. Syst..

[6]  B. Hajek Hitting-time and occupation-time bounds implied by drift analysis with applications , 1982, Advances in Applied Probability.

[7]  Yi Lu,et al.  Priority algorithm for near-data scheduling: Throughput and heavy-traffic optimality , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[8]  G. J. Foschini,et al.  A Basic Dynamic Routing Problem and Diffusion , 1978, IEEE Trans. Commun..

[9]  R. Srikant,et al.  Heavy-Traffic Delay Insensitivity in Connection-Level Models of Data Transfer with Proportionally Fair Bandwidth Sharing , 2018, PERV.

[10]  R. Srikant,et al.  Heavy traffic optimal resource allocation algorithms for cloud computing clusters , 2012, 2012 24th International Teletraffic Congress (ITC 24).

[11]  Amy R. Ward,et al.  Critical Thresholds for Dynamic Routing in Queueing Networks , 2002, Queueing Syst. Theory Appl..

[12]  Alexander L. Stolyar Pull-based load distribution in large-scale heterogeneous service systems , 2015, Queueing Syst. Theory Appl..

[13]  R. Srikant,et al.  Optimal heavy-traffic queue length scaling in an incompletely saturated switch , 2018, Queueing Syst. Theory Appl..

[14]  R. Srikant,et al.  Asymptotically tight steady-state queue length bounds implied by drift conditions , 2011, Queueing Syst. Theory Appl..

[15]  Hong Chen,et al.  Asymptotic Optimality of Balanced Routing , 2012, Oper. Res..

[16]  Ruth J. Williams,et al.  Diffusion approximations for open multiclass queueing networks: sufficient conditions involving state space collapse , 1998, Queueing Syst. Theory Appl..

[17]  Ward Whitt,et al.  Queue-and-Idleness-Ratio Controls in Many-Server Service Systems , 2009, Math. Oper. Res..

[18]  Ronald J. Williams,et al.  Dynamic scheduling of a system with two parallel servers in heavy traffic with resource pooling: asymptotic optimality of a threshold policy , 2001 .

[19]  Lei Ying,et al.  MapTask Scheduling in MapReduce With Data Locality: Throughput and Heavy-Traffic Optimality , 2013, IEEE/ACM Transactions on Networking.

[20]  Sem C. Borst,et al.  Universality of load balancing schemes on the diffusion scale , 2016, J. Appl. Probab..

[21]  R. Srikant,et al.  Queue Length Behavior in a Switch under the MaxWeight Algorithm , 2015 .

[22]  Wang Rongxin,et al.  Heavy traffic limit theorems for a queueing system in which customers join the shortest line , 1989, Advances in Applied Probability.

[23]  F. P. Kelly,et al.  Dynamic routing in open queueing networks: Brownian models, cut constraints and resource pooling , 1993, Queueing Syst. Theory Appl..

[24]  Yin Sun,et al.  Designing Low-Complexity Heavy-Traffic Delay-Optimal Load Balancing Schemes: Theory to Algorithms , 2019, PERV.

[25]  D. Gamarnik,et al.  Validity of heavy traffic steady-state approximations in generalized Jackson networks , 2004, math/0410066.

[26]  Ness B. Shroff,et al.  Flexible Load Balancing with Multi-dimensional State-space Collapse , 2018, Perform. Evaluation.

[27]  Yi Lu,et al.  Scheduling with multi-level data locality: Throughput and heavy-traffic optimality , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[28]  Ness B. Shroff,et al.  Designing Low-Complexity Heavy-Traffic Delay-Optimal Load Balancing Schemes , 2017, Proc. ACM Meas. Anal. Comput. Syst..

[29]  Ward Whitt,et al.  Heavy-Traffic Limits for Queues with Many Exponential Servers , 1981, Oper. Res..

[30]  Mor Armony,et al.  Dynamic Routing in Large-Scale Service Systems with Heterogeneous Servers , 2005, Queueing Syst. Theory Appl..

[31]  Martin I. Reiman,et al.  Some diffusion approximations with state space collapse , 1984 .

[32]  Tolga Tezcan,et al.  State Space Collapse in Many-Server Diffusion Limits of Parallel Server Systems , 2011, Math. Oper. Res..