Achieving Rapid Recovery in an Overload Control for Large-Scale Service Systems

We consider an automatic overload control for two large service systems modeled as multiserver queues such as call centers. We assume that the two systems are designed to operate independently, but want to help each other respond to unexpected overloads. The proposed overload control automatically activates sharing sending some customers from one system to the other once a ratio of the queue lengths in the two systems crosses an activation threshold with ratio and activation threshold parameters for each direction. In this paper, we are primarily concerned with ensuring that the system recovers rapidly after the overload is over, either because i the two systems return to normal loading or ii the direction of the overload suddenly shifts in the opposite direction. To achieve rapid recovery, we introduce lower thresholds for the queue ratios, below which one-way sharing is released. As a basis for studying the complex dynamics, we develop a new six-dimensional fluid approximation for a system with time-varying arrival rates, extending a previous fluid approximation involving a stochastic averaging principle. We conduct simulations to confirm that the new algorithm is effective for predicting the system performance and choosing effective control parameters. The simulation and the algorithm show that the system can experience an inefficient nearly periodic behavior, corresponding to an oscillating equilibrium congestion collapse if the sharing is strongly inefficient and the control parameters are set inappropriately.

[1]  Natalia Yankovic,et al.  The Impact of Ambulance Diversion on Heart Attack Deaths , 2010, Inquiry : a journal of medical care organization, provision and financing.

[2]  Ward Whitt,et al.  Efficiently providing multiple grades of service with protection against overloads in shared resources , 1995, AT&T Technical Journal.

[3]  Allen J. Scott,et al.  Applications of Queueing Theory , 1972 .

[4]  Ulf Körner Overload control of SPC systems , 1991 .

[5]  Ward Whitt,et al.  A Fluid Approximation for Service Systems Responding to Unexpected Overloads , 2011, Oper. Res..

[6]  Eduardo D. Sontag,et al.  Mathematical Control Theory Second Edition , 1998 .

[7]  Ward Whitt,et al.  A Switching Fluid Limit of a Stochastic Network Under a State-Space-Collapse Inducing Control with Chattering , 2014, 1407.7053.

[8]  Ward Whitt,et al.  Large-time asymptotics for the Gt/Mt/st+GIt many-server fluid queue with abandonment , 2011, Queueing Syst. Theory Appl..

[9]  Henning Schulzrinne,et al.  Congestion control for real-time traffic in high-speed networks , 1990, Proceedings. IEEE INFOCOM '90: Ninth Annual Joint Conference of the IEEE Computer and Communications Societies@m_The Multiple Facets of Integration.

[10]  Ward Whitt,et al.  Scheduling Flexible Servers with Convex Delay Costs in Many-Server Service Systems , 2009, Manuf. Serv. Oper. Manag..

[11]  Arjan van der Schaft,et al.  An Introduction to Hybrid Dynamical Systems, Springer Lecture Notes in Control and Information Sciences 251 , 1999 .

[12]  Karl Aberer,et al.  Congestion Control for Distributed Hash Tables , 2006, Fifth IEEE International Symposium on Network Computing and Applications (NCA'06).

[13]  M. Reiman,et al.  Optimality of Randomized Trunk Reservation , 1994 .

[14]  Ward Whitt,et al.  Stabilizing Customer Abandonment in Many-Server Queues with Time-Varying Arrivals , 2012, Oper. Res..

[15]  Avishai Mandelbaum,et al.  Designing a Call Center with Impatient Customers , 2002, Manuf. Serv. Oper. Manag..

[16]  Ward Whitt,et al.  Queue-and-Idleness-Ratio Controls in Many-Server Service Systems , 2009, Math. Oper. Res..

[17]  W. A. Massey,et al.  DYNAMIC PRICING TO CONTROL LOSS SYSTEMS WITH QUALITY OF SERVICE TARGETS , 2009, Probability in the Engineering and Informational Sciences.

[18]  Ward Whitt,et al.  Service-Level Differentiation in Many-Server Service Systems via Queue-Ratio Routing , 2010, Oper. Res..

[19]  Bernard O. Koopman,et al.  Air-Terminal Queues under Time-Dependent Conditions , 1972, Oper. Res..

[20]  Zeynep Akşin,et al.  The Modern Call Center: A Multi‐Disciplinary Perspective on Operations Management Research , 2007 .

[21]  V. Baier,et al.  Highly sensitive thermopile heat power sensor for micro-fluid calorimetry of biochemical processes , 2005 .

[22]  Ward Whitt,et al.  Efficiency-Driven Heavy-Traffic Approximations for Many-Server Queues with Abandonments , 2004, Manag. Sci..

[23]  Carri W. Chan,et al.  When to use Speedup : An Examination of Intensive Care Units with Readmissions , 2011 .

[24]  Sally Floyd,et al.  Promoting the use of end-to-end congestion control in the Internet , 1999, TNET.

[25]  Aleksej F. Filippov,et al.  Differential Equations with Discontinuous Righthand Sides , 1988, Mathematics and Its Applications.

[26]  F. Burk Euler's Constant , 1985 .

[27]  Ward Whitt,et al.  An ODE for an Overloaded X Model Involving a Stochastic Averaging Principle , 2010, ArXiv.

[28]  Johannes Schumacher,et al.  An Introduction to Hybrid Dynamical Systems, Springer Lecture Notes in Control and Information Sciences 251 , 1999 .

[29]  Cheng Jin,et al.  FAST TCP: Motivation, Architecture, Algorithms, Performance , 2006, IEEE/ACM Transactions on Networking.

[30]  Daniel Liberzon,et al.  Switching in Systems and Control , 2003, Systems & Control: Foundations & Applications.

[31]  Ward Whitt,et al.  Chattering and congestion collapse in an overload switching control , 2016 .

[32]  Itay Gurvich,et al.  Overflow Networks: Approximations and Implications to Call Center Outsourcing , 2012, Oper. Res..

[33]  Ashok Erramilli,et al.  Oscillations and Chaos in a Flow Model of a Switching System , 1991, IEEE J. Sel. Areas Commun..

[34]  Ward Whitt,et al.  Fluid Models for Multiserver Queues with Abandonments , 2006, Oper. Res..

[35]  JEL Classification , 1998 .

[36]  Itay Gurvich,et al.  Centralized vs. Decentralized Ambulance Diversion: A Network Perspective , 2011, Manag. Sci..

[37]  Nicholas Bambos,et al.  Fairness in overloaded parallel queues , 2010, 1011.1237.

[38]  Ward Whitt,et al.  Diffusion approximation for an overloaded X model via a stochastic averaging principle , 2010, Queueing Syst. Theory Appl..

[39]  J. Dai On Positive Harris Recurrence of Multiclass Queueing Networks: A Unified Approach Via Fluid Limit Models , 1995 .

[40]  William A. Massey,et al.  A time-varying call center design via lagrangian mechanics , 2009 .

[41]  Ward Whitt,et al.  Responding to Unexpected Overloads in Large-Scale Service Systems , 2009, Manag. Sci..

[42]  Ward Whitt,et al.  A Fluid Limit for an Overloaded X Model Via an Averaging Principle , 2010, ArXiv.

[43]  Alexander L. Stolyar,et al.  Shadow-Routing Based Control of Flexible Multiserver Pools in Overload , 2011, Oper. Res..

[44]  R. I. Wilkinson Theories for toll traffic engineering in the U. S. A. , 1956 .

[45]  Paul Atkinson,et al.  Emergency Department Crowding: Time for Interventions and Policy Evaluations , 2012, Emergency medicine international.

[46]  Gilles Reinhardt,et al.  The relationship between inpatient discharge timing and emergency department boarding. , 2012, The Journal of emergency medicine.

[47]  Pengyi Shi,et al.  Hospital Inpatient Operations : Mathematical Models and Managerial Insights , 2012 .

[48]  W. Whitt,et al.  Martingale proofs of many-server heavy-traffic limits for Markovian queues ∗ , 2007, 0712.4211.

[49]  G. F. Newell,et al.  Applications of Queueing Theory. , 1983 .

[50]  Avishai Mandelbaum,et al.  Telephone Call Centers: Tutorial, Review, and Research Prospects , 2003, Manuf. Serv. Oper. Manag..

[51]  Ward Whitt,et al.  Nearly periodic behavior in the overloaded $G/D/s+GI$ queue , 2011 .

[52]  MandelbaumAvishai,et al.  Scheduling Flexible Servers with Convex Delay Costs , 2004 .

[53]  P. Olver Nonlinear Systems , 2013 .

[54]  Ward Whitt,et al.  Effective bandwidths with priorities , 1998, TNET.

[55]  Ward Whitt,et al.  The Gt/GI/st+GI many-server fluid queue , 2012, Queueing Syst. Theory Appl..

[56]  Ward Whitt,et al.  A Fluid Limit for an Overloaded X Model via a Stochastic Averaging Principle , 2010, Math. Oper. Res..

[57]  S. Zachary,et al.  Loss networks , 2009, 0903.0640.

[58]  Ward Whitt,et al.  A many-server fluid limit for the Gt/G//st+G/ queueing model experiencing periods of overloading , 2012, Oper. Res. Lett..

[59]  Ward Whitt,et al.  Staffing of Time-Varying Queues to Achieve Time-Stable Performance , 2008, Manag. Sci..

[60]  Avishai Mandelbaum,et al.  ON PATIENT FLOW IN HOSPITALS: A DATA-BASED QUEUEING-SCIENCE PERSPECTIVE , 2015 .

[61]  Harry Heffes,et al.  Overload Performance of Several Processor Queueing Disciplines for the M/M/1 Queue , 1986, IEEE Trans. Commun..

[62]  A. Juditsky,et al.  Solving variational inequalities with Stochastic Mirror-Prox algorithm , 2008, 0809.0815.

[63]  Devavrat Shah,et al.  Fluid models of congestion collapse in overloaded switched networks , 2011, Queueing Syst. Theory Appl..

[64]  Andrey V. Savkin,et al.  Qualitative Theory of Hybrid Dynamical Systems , 2012 .

[65]  R. Young,et al.  75.9 Euler’s Constant , 1991, The Mathematical Gazette.

[66]  J. H. Weber,et al.  A simulation study of routing and control in communications networks , 1964 .

[67]  Fernando Paganini,et al.  Internet congestion control , 2002 .