Round-robin routing policy: value functions and mean performance with job- and server-specific costs

We study the Round-Robin (RR) routing to a system of parallel queues. The cost structure comprises two components: a service fee and a queueing delay related component, where both can be job- and queue-specific random variables. With Poisson arrivals, the inter-arrival time to each queue obeys Erlang's distribution. This allows us to study the mean and transient behavior of the queues separately. The service fee is independent of the queueing, and we obtain the corresponding mean cost rate and value function in closed forms. With respect to queueing delay, we first derive integral expressions enabling efficient computation of the corresponding value function. By decomposition, these yield also the value function for the whole system of m parallel queues fed by RR. Given the value function, one can carry out the first policy iteration step with arbitrary holding cost rates (e.g., delay, slowdown etc.) yielding efficient size-, cost- and state-aware policies. Moreover, the mean waiting time in an M/G/m-RR system gets resolved at the same time. The results are demonstrated in the numerical examples, where we compute near optimal task assignment policies for a sample system with two servers.

[1]  G. J. Franx,et al.  The transient M/D/c Queueing System , 2002 .

[2]  Ger Koole,et al.  On the Optimality of the Generalized Shortest Queue Policy , 1990, Probability in the Engineering and Informational Sciences.

[3]  Bruce E. Hajek,et al.  The Proof of a Folk Theorem on Queuing Delay with Applications to Routing in Networks , 1983, JACM.

[4]  Esa Hyytiä,et al.  Size- and state-aware dispatching problem with queue-specific job sizes , 2012, Eur. J. Oper. Res..

[5]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[6]  Mor Harchol-Balter,et al.  On Choosing a Task Assignment Policy for a Distributed Server System , 1998, J. Parallel Distributed Comput..

[7]  Kevin D. Glazebrook,et al.  Generalised ‘join the shortest queue’ policies for the dynamic routing of jobs to multi-class queues , 2003, J. Oper. Res. Soc..

[8]  Ajem Guido Janssen,et al.  Back to the roots of the M/D/"s" queue and the works of Erlang, Crommelin and Pollaczek , 2008 .

[9]  P. Sparaggis,et al.  Minimizing response times and queue lengths in systems of parallel queues , 1999 .

[10]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[11]  Tapani Lehtonen,et al.  On the optimality of the shortest line discipline , 1984 .

[12]  Zhen Liu,et al.  Optimal Load Balancing on Distributed Homogeneous Unreliable Processors , 1998, Oper. Res..

[13]  Mor Harchol-Balter,et al.  Task assignment in a distributed system (extended abstract): improving performance by unbalancing load , 1997, SIGMETRICS '98/PERFORMANCE '98.

[14]  Jorma T. Virtamo,et al.  On the Value Function of the M/G/1 FCFS and LCFS Queues , 2012, J. Appl. Probab..

[15]  Geert Jan Franx,et al.  A simple solution for the M/D/c waiting time distribution , 2001, Oper. Res. Lett..

[16]  Sandjai Bhulai,et al.  On the value function of the M/Cox(r)/1 queue , 2006, Journal of Applied Probability.

[17]  K. R. Krishnan Joining the right queue: a state-dependent decision rule , 1990 .

[18]  Z. Liu,et al.  Optimality of the round-robin routing policy , 1992, Journal of Applied Probability.

[19]  Vishal Misra,et al.  Optimal state-free, size-aware dispatching for heterogeneous M/G/-type systems , 2005, Perform. Evaluation.

[20]  Anthony Ephremides,et al.  A simple dynamic routing problem , 1980 .

[21]  Mor Harchol-Balter,et al.  Performance Modeling and Design of Computer Systems: Queueing Theory in Action , 2013 .

[22]  R. Weber On the optimal assignment of customers to parallel servers , 1978, Journal of Applied Probability.

[23]  C. G. Cassandras,et al.  Stochastic ordering properties and optimal routing control for a class of finite capacity queueing systems , 1990, 29th IEEE Conference on Decision and Control.

[24]  Bruce E. Hajek,et al.  Extremal Splittings of Point Processes , 1985, Math. Oper. Res..

[25]  Yair Arian,et al.  Algorithms for generalized round robin routing , 1992, Oper. Res. Lett..

[26]  Henk Tijms New and old results for the M/D/c queue , 2006 .

[27]  Ronald W. Wolff,et al.  Multiple-server system with flexible arrivals , 2011, Advances in Applied Probability.

[28]  Esa Hyytiä,et al.  Minimizing slowdown in heterogeneous size-aware dispatching systems , 2012, SIGMETRICS '12.

[29]  Don Towsley,et al.  Optimal Routing and Scheduling of Customers with Deadlines , 1994, Probability in the Engineering and Informational Sciences.

[30]  Alan Scheller-Wolf,et al.  Surprising results on task assignment in server farms with high-variability workloads , 2009, SIGMETRICS '09.

[31]  Rong Wu,et al.  Multi-layered round robin routing for parallel servers , 2006, Queueing Syst. Theory Appl..