Fair Dynamic Routing in Large-Scale Heterogeneous-Server Systems

In a call center, there is a natural trade-off between minimizing customer wait time and fairly dividing the workload among agents of different skill levels. The relevant control is the routing policy, that is, the decision concerning which agent should handle an arriving call when more than one agent is available. We formulate an optimization problem for a call center with heterogeneous agent pools, in which each pool is distinguished by the speed at which agents in that pool handle calls. The objective is to minimize steady-state expected customer wait time subject to a “fairness” constraint on the workload division. We first solve the optimization problem by formulating it as a Markov decision process (MDP), and solving a related linear program. We note that this approach does not in general lead to an optimal policy that has a simple structure. Fortunately, the optimal policy does appear to have a simple structure as the system size grows large, in the Halfin-Whitt many-server heavy-traffic limit regime. Therefore, we solve the diffusion control problem that arises in this regime and interpret its solution as a policy for the original system. The resulting routing policy is a threshold policy that determines server pool priorities based on the total number of customers in the system. We prove that a continuous modification of our proposed threshold routing policy is asymptotically optimal in the Halfin-Whitt limit regime. We furthermore present simulation results to illustrate that our proposed threshold routing policy outperforms a common routing policy used in call centers (that routes to the agent that has been idle the longest).

[1]  H. Kushner Numerical Methods for Stochastic Control Problems in Continuous Time , 2000 .

[2]  Chin-Chi Wu,et al.  Improving Inter-server Fairness in Active Queue Management , 2007, IEEE Communications Letters.

[3]  Adam Wierman,et al.  Fairness and classifications , 2007, PERV.

[4]  Ger Koole,et al.  Monotonicity in Markov Reward and Decision Chains: Theory and Applications , 2007, Found. Trends Stoch. Syst..

[5]  Ward Whitt,et al.  Queue-and-Idleness-Ratio Controls in Many-Server Service Systems , 2009, Math. Oper. Res..

[6]  MandelbaumAvishai,et al.  Scheduling Flexible Servers with Convex Delay Costs , 2004 .

[7]  Yulia Tseytlin,et al.  Queueing Systems with Heterogeneous Servers : Improving Patients ’ Flow in Hospitals M . Sc . Research Proposal , 2007 .

[8]  P. Protter,et al.  Weak Limit Theorems for Stochastic Integrals and Stochastic Differential Equations , 1991 .

[9]  Benjamin Avi-Itzhak,et al.  QUANTIFYING FAIRNESS IN QUEUING SYSTEMS , 2008, Probability in the Engineering and Informational Sciences.

[10]  Ronald J. Williams,et al.  Dynamic scheduling of a system with two parallel servers in heavy traffic with resource pooling: asymptotic optimality of a threshold policy , 2001 .

[11]  Ashok K. Agrawala,et al.  Control of a Heterogeneous Two-Server Exponential Queueing System , 1983, IEEE Transactions on Software Engineering.

[12]  R. Atar Scheduling control for queueing systems with many servers: asymptotic optimality in heavy traffic , 2005, math/0602526.

[13]  S. L. Bell,et al.  Dynamic Scheduling of a Parallel Server System in Heavy Traffic with Complete Resource Pooling: Asymptotic Optimality of a Threshold Policy , 2005 .

[14]  D. Gamarnik,et al.  Validity of heavy traffic steady-state approximations in generalized Jackson networks , 2004, math/0410066.

[15]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[16]  J. Michael Harrison,et al.  Brownian Models of Queueing Networks with Heterogeneous Customer Populations , 1988 .

[17]  Rami Atar,et al.  Efficient Routing in Heavy Traffic Under Partial Sampling of Service Times , 2008, Math. Oper. Res..

[18]  Rami Atar,et al.  A blind policy for equalizing cumulative idleness , 2011, Queueing Syst. Theory Appl..

[19]  Francis de Véricourt,et al.  On the incomplete results for the heterogeneous server problem , 2006, Queueing Syst. Theory Appl..

[20]  W. Whitt,et al.  Transient behavior of regulated Brownian motion, I: Starting at the origin , 1987, Advances in Applied Probability.

[21]  Kevin D. Glazebrook,et al.  Parallel Scheduling of Multiclass M/M/m Queues: Approximate and Heavy-Traffic Optimization of Achievable Performance , 2001, Oper. Res..

[22]  M. Rubinovitch The Slow Server Problem , 1983 .

[23]  P. Billingsley,et al.  Convergence of Probability Measures , 1969 .

[24]  Noah Gans,et al.  A Call-Routing Problem with Service-Level Constraints , 2003, Oper. Res..

[25]  P. R. Kumar,et al.  Optimal control of a queueing system with two heterogeneous servers , 1984 .

[26]  Avishai Mandelbaum,et al.  Call Centers with Impatient Customers: Many-Server Asymptotics of the M/M/n + G Queue , 2005, Queueing Syst. Theory Appl..

[27]  Paul E. Spector,et al.  The Role of Justice in Organizations: A Meta-Analysis , 2001 .

[28]  Mor Armony,et al.  Dynamic Routing in Large-Scale Service Systems with Heterogeneous Servers , 2005, Queueing Syst. Theory Appl..

[29]  Avishai Mandelbaum,et al.  Staffing Many-Server Queues with Impatient Customers: Constraint Satisfaction in Call Centers , 2009, Oper. Res..

[30]  Ioannis Karatzas,et al.  Brownian Motion and Stochastic Calculus , 1987 .

[31]  Sem C. Borst,et al.  Dimensioning Large Call Centers , 2000, Oper. Res..

[32]  Ward Whitt,et al.  Service-Level Differentiation in Many-Server Service Systems via Queue-Ratio Routing , 2010, Oper. Res..

[33]  Eitan Altman,et al.  Sensitivity of constrained Markov decision processes , 1991, Ann. Oper. Res..

[34]  Alexander L. Stolyar,et al.  OPTIMAL ROUTING IN OUTPUT-QUEUED FLEXIBLE SERVER SYSTEMS , 2005, Probability in the Engineering and Informational Sciences.

[35]  Donald E. Conlon,et al.  Justice at the millennium: a meta-analytic review of 25 years of organizational justice research. , 2001, The Journal of applied psychology.

[36]  V. V. Rykov,et al.  On the slow server problem , 2009 .

[37]  Benjamin Avi-Itzhak,et al.  QUANTIFYING FAIRNESS IN QUEUEING SYSTEMS: PRINCIPLES AND APPLICATIONS , 2004 .

[38]  A. V. Balakrishnan,et al.  Stochastic differential systems , 1973 .

[39]  Avishai Mandelbaum,et al.  Routing and Staffing in Large-Scale Service Systems: The Case of Homogeneous Impatient Customers and Heterogeneous Servers , 2011, Oper. Res..

[40]  W. Whitt,et al.  PIECEWISE-LINEAR DIFFUSION PROCESSES , 1995 .

[41]  Sandjai Bhulai,et al.  A queueing model for call blending in call centers , 2003, IEEE Trans. Autom. Control..

[42]  Rami Atar,et al.  Scheduling a multi class queue with many exponential servers: asymptotic optimality in heavy traffic , 2004, math/0407058.

[43]  Ward Whitt,et al.  Some Useful Functions for Functional Limit Theorems , 1980, Math. Oper. Res..

[44]  M. Yor DIFFUSIONS, MARKOV PROCESSES AND MARTINGALES: Volume 2: Itô Calculus , 1989 .

[45]  J. Harrison Heavy traffic analysis of a system with parallel servers: asymptotic optimality of discrete-review policies , 1998 .

[46]  J. Michael Harrison,et al.  Dynamic Scheduling of a Multiclass Queue in the Halfin-Whitt Heavy Traffic Regime , 2004, Oper. Res..

[47]  Avishai Mandelbaum,et al.  Telephone Call Centers: Tutorial, Review, and Research Prospects , 2003, Manuf. Serv. Oper. Manag..

[48]  Ward Whitt,et al.  The Impact of Increased Employee Retention on Performance in a Customer Contact Center , 2006, Manuf. Serv. Oper. Manag..

[49]  Itay Gurvich,et al.  When Promotions Meet Operations: Cross-Selling and Its Effect on Call Center Performance , 2010, Manuf. Serv. Oper. Manag..

[50]  Tolga Tezcan,et al.  State Space Collapse in Many-Server Diffusion Limits of Parallel Server Systems , 2011, Math. Oper. Res..

[51]  R. Stockbridge A martingale approach to the slow server problem , 1991 .

[52]  Itay Gurvich,et al.  Service-Level Differentiation in Call Centers with Fully Flexible Servers , 2008, Manag. Sci..

[53]  Amy R. Ward,et al.  Critical Thresholds for Dynamic Routing in Queueing Networks , 2002, Queueing Syst. Theory Appl..

[54]  Fabricio Bandeira Cabral,et al.  Queues with heterogeneous servers and uninformed customers: who works the most? , 2007, 0706.0560.

[55]  E. Altman Constrained Markov Decision Processes , 1999 .

[56]  Alexander L. Stolyar,et al.  Scheduling Flexible Servers with Convex Delay Costs: Heavy-Traffic Optimality of the Generalized cµ-Rule , 2004, Oper. Res..

[57]  Fabricio Bandeira Cabral,et al.  The Slow Server Problem for Uninformed Customers , 2005, Queueing Syst. Theory Appl..

[58]  Ward Whitt,et al.  Heavy-Traffic Limits for Queues with Many Exponential Servers , 1981, Oper. Res..

[59]  Ward Whitt,et al.  Scheduling Flexible Servers with Convex Delay Costs in Many-Server Service Systems , 2009, Manuf. Serv. Oper. Manag..

[60]  W. Fleming,et al.  Controlled Markov processes and viscosity solutions , 1992 .

[61]  MandelbaumAvishai,et al.  Call Centers with Impatient Customers , 2005 .

[62]  Rami Atar Central limit theorem for a many-server queue with random service rates. , 2008 .