Optimal probabilistic routing in distributed parallel queues

We consider the fundamental problem of routing customers among distributed parallel queues to minimize an objective function based on equilibrium sojourn times under general assumptions for the arrival and service processes and under the assumption that customers are routed to the parallel queues in a probabilistic manner. More specifically, we derive explicit solutions for the asymptotically optimal vector of probabilities that control the routing of customers upon arrival among a set of heterogeneous general single-server queues through stochastic-process limits. Our assumption of probabilistic routing is consistent with previous theoretical studies of this optimization problem, and our solutions can be used for the parameter settings of other routing mechanisms found in practice. Stochastic-process limits are exploited in order to be able to handle general arrival and service processes and obtain explicit solutions to the scheduling optimization problems of interest.

[1]  G. J. Foschini,et al.  A Basic Dynamic Routing Problem and Diffusion , 1978, IEEE Trans. Commun..

[2]  Mark S. Squillante,et al.  MATRIX-ANALYTIC ANALYSIS OF A MAP/PH/1 QUEUE FITTED TO WEB SERVER DATA , 2002 .

[3]  Dimitri P. Bertsekas,et al.  Nonlinear Programming , 1997 .

[4]  Sem C. Borst Optimal probabilistic allocation of customer types to servers , 1995, SIGMETRICS '95/PERFORMANCE '95.

[5]  Mor Harchol-Balter,et al.  Analysis of SRPT scheduling: investigating unfairness , 2001, SIGMETRICS '01.

[6]  Sheldon M. Ross,et al.  Stochastic Processes , 2018, Gauge Integral Structures for Stochastic Calculus and Quantum Electrodynamics.

[7]  Avishai Mandelbaum,et al.  Strong Approximations for Time-Dependent Queues , 1995, Math. Oper. Res..

[8]  Mark S. Squillante,et al.  Optimal stochastic scheduling in multiclass parallel queues , 1999, SIGMETRICS '99.

[9]  Mark S. Squillante,et al.  IBM Research Report Optimal Probabilistic Routing in Distributed Parallel Queues , 2004 .

[10]  Susan H. Xu,et al.  Asymptotically Optimal Routing and Servive Rate Allocation in a Multiserver Queueing System , 1997, Oper. Res..

[11]  Tapani Lehtonen,et al.  On the optimality of the shortest line discipline , 1984 .

[12]  Mor Harchol-Balter,et al.  Exploiting process lifetime distributions for dynamic load balancing , 1995, SIGMETRICS.

[13]  Sanjay K. Bose,et al.  Fundamentals of Queueing Networks , 2002 .

[14]  Mor Harchol-Balter Task assignment with unknown duration , 2002, JACM.

[15]  Daniel M. Dias,et al.  A scalable and highly available web server , 1996, COMPCON '96. Technologies for the Information Superhighway Digest of Papers.

[16]  Edward D. Lazowska,et al.  Adaptive load sharing in homogeneous distributed systems , 1986, IEEE Transactions on Software Engineering.

[17]  Guerney D. H. Hunt,et al.  Network Dispatcher: A Connection Router for Scalable Internet Services , 1998, Comput. Networks.

[18]  P. Sparaggis,et al.  Minimizing response times and queue lengths in systems of parallel queues , 1999 .

[19]  Hui Wang,et al.  First passage times of a jump diffusion process , 2003, Advances in Applied Probability.

[20]  Anurag Kumar,et al.  Adaptive Optimal Load Balancing in a Nonhomogeneous Multiserver System with a Central Job Scheduler , 1990, IEEE Trans. Computers.

[21]  David D. Yao,et al.  Optimal load balancing and scheduling in a distributed computer system , 1991, JACM.

[22]  Ward Whitt,et al.  Deciding Which Queue to Join: Some Counterexamples , 1986, Oper. Res..

[23]  R. Weber On the optimal assignment of customers to parallel servers , 1978, Journal of Applied Probability.

[24]  Yung-Terng Wang,et al.  Load Sharing in Distributed Systems , 1985, IEEE Transactions on Computers.

[25]  Asser N. Tantawi,et al.  Optimal static load balancing in distributed computer systems , 1985, JACM.

[26]  Teunis J. Ott,et al.  Load-balancing heuristics and process behavior , 1986, SIGMETRICS '86/PERFORMANCE '86.

[27]  Sergei Grishechkin GI/G/1 processor sharing queue in heavy traffic , 1994, Advances in Applied Probability.