On the Analysis of Randomized Load Balancing Schemes

Abstract. It is well known that simple randomized load balancing schemes can balance load effectively while incurring only a small overhead, making such schemes appealing for practical systems. In this paper we provide new analyses for several such dynamic randomized load balancing schemes. Our work extends a previous analysis of the supermarket model, a model that abstracts a simple, efficient load balancing scheme in the setting where jobs arrive at a large system of parallel processors. In this model, customers arrive at a system of n servers as a Poisson stream of rate λ n , λ < 1 , with service requirements exponentially distributed with mean 1. Each customer chooses d servers independently and uniformly at random from the n servers, and is served according to the First In First Out (FIFO) protocol at the choice with the fewest customers. For the supermarket model, it has been shown that using d=2 choices yields an exponential improvement in the expected time a customer spends in the system over d=1 choice (simple random selection) in equilibrium. Here we examine several variations, including constant service times and threshold models, where a customer makes up to d successive choices until finding one below a set threshold. Our approach involves studying limiting, deterministic models representing the behavior of these systems as the number of servers n goes to infinity. Results of our work include useful general theorems for showing that these deterministic systems are stable or converge exponentially to fixed points. We also demonstrate that allowing customers two choices instead of just one leads to exponential improvements in the expected time a customer spends in the system in several of the related models we study, reinforcing the concept that just two choices yields significant power in load balancing.

[1]  Michael Mitzenmacher,et al.  The Power of Two Choices in Randomized Load Balancing , 2001, IEEE Trans. Parallel Distributed Syst..

[2]  V. Lakshmikantham,et al.  Review: Anthony N. Michel and Richard K. Miller, Qualitative analysis of large scale dynamical systems , 1978 .

[3]  John N. Tsitsiklis,et al.  The efficiency of greedy routing in hypercubes and butterflies , 1994, IEEE Trans. Commun..

[4]  Mor Harchol-Balter,et al.  Bounding delays in packet-routing networks , 1995, STOC '95.

[5]  M. Vidyasagar,et al.  Qualitative Analysis of Large Scale Dynamical Systems , 2012, IEEE Transactions on Systems, Man, and Cybernetics.

[6]  Songnian Zhou A Trace-Driven Simulation Study of Dynamic Load Balancing , 1988, IEEE Trans. Software Eng..

[7]  Ivo J. B. F. Adan,et al.  Analysis of the symmetric shortest queue problem , 1990 .

[8]  Edward D. Lazowska,et al.  Adaptive load sharing in homogeneous distributed systems , 1986, IEEE Transactions on Software Engineering.

[9]  Anthony Unwin,et al.  Reversibility and Stochastic Networks , 1980 .

[10]  Michael Mitzenmacher,et al.  How Useful Is Old Information? , 2000, IEEE Trans. Parallel Distributed Syst..

[11]  R. L. Dobrushin,et al.  Queueing system with selection of the shortest of two queues: an assymptotic approach , 1996 .

[12]  T. Kurtz Solutions of ordinary differential equations as limits of pure jump markov processes , 1970, Journal of Applied Probability.

[13]  Michael Mitzenmacher,et al.  Bounds on the greedy routing algorithm for array networks , 1994, SPAA '94.

[14]  Azer Bestavros Load profiling: a methodology for scheduling real-time tasks in a distributed system , 1997, Proceedings of 17th International Conference on Distributed Computing Systems.

[15]  Eli Upfal,et al.  Balanced Allocations , 1999, SIAM J. Comput..

[16]  N. Wormald Differential Equations for Random Processes and Random Graphs , 1995 .

[17]  Jose Renato Santos,et al.  Design of the Rio (randomized I/o) Storage Server , 1997 .

[18]  R.M. Karp, M. Luby, F. Meyer auf der Hei Efficient PRAM simulation on a distributed memory machine , 1996 .

[19]  L EagerDerek,et al.  Adaptive load sharing in homogeneous distributed systems , 1986 .

[20]  Friedhelm Meyer auf der Heide,et al.  Efficient PRAM simulation on a distributed memory machine , 1992, STOC '92.

[21]  J. Shanthikumar,et al.  Extremal properties of the FIFO discipline in queueing networks , 1992, Journal of Applied Probability.

[22]  Azer Bestavros,et al.  Load Profiling in Distributed Real-Time Systems , 1997, Information Sciences.

[23]  T. Kurtz Approximation of Population Processes , 1987 .

[24]  Richard M. Karp,et al.  An optimal algorithm for on-line bipartite matching , 1990, STOC '90.

[25]  Bruce E. Hajek,et al.  Analysis of Simple Algorithms for Dynamic Load Balancing , 1997, Math. Oper. Res..

[26]  Thomas Kunz,et al.  The Influence of Different Workload Descriptions on a Heuristic Load Balancing Scheme , 1991, IEEE Trans. Software Eng..

[27]  Ivo J. B. F. Adan,et al.  Upper and lower bounds for the waiting time in the symmetric shortest queue system , 1994, Ann. Oper. Res..

[28]  B. Hajek Asymptotic analysis of an assignment problem arising in a distributed communications protocol , 1988, Proceedings of the 27th IEEE Conference on Decision and Control.

[29]  Richard M. Karp,et al.  Maximum Matchings in Sparse Random Graphs , 1981, FOCS 1981.

[30]  T. Kurtz Limit theorems for sequences of jump Markov processes approximating ordinary differential processes , 1971, Journal of Applied Probability.

[31]  Michael Mitzenmacher,et al.  Constant time per edge is optimal on rooted tree networks , 1996, SPAA '96.

[32]  S. Wittevrongel,et al.  Queueing Systems , 2019, Introduction to Stochastic Processes and Simulation.

[33]  Edward D. Lazowska,et al.  A Comparison of Receiver-Initiated and Sender-Initiated Adaptive Load Sharing , 1986, Perform. Evaluation.

[34]  Edward D. Lazowska,et al.  The limited performance benefits of migrating active processes for load sharing , 1988, SIGMETRICS '88.

[35]  John N. Tsitsiklis,et al.  The efficiency of greedy routing in hypercubes and butterflies , 1991, SPAA '91.

[36]  T. Kurtz Strong approximation theorems for density dependent Markov chains , 1978 .

[37]  Edward D. Lazowska,et al.  The limited performance benefits of migrating active processes for load sharing , 1988, SIGMETRICS 1988.

[38]  Leonard Kleinrock,et al.  Queueing Systems: Volume I-Theory , 1975 .