Dynamic overload balancing in server farms

We consider the problem of optimal load balancing in a server farm under overload conditions. A convex penalty minimization problem is studied to optimize queue overflow rates at the servers. We introduce a new class of α-fair penalty functions, and show that the cases of α = 0, 1, ∞ correspond to minimum sum penalty, penalty proportional fairness, and min-max fairness, respectively. These functions are useful to maximize the time to first buffer overflow and minimize the recovery time from temporary overload. In addition, we show that any policy that solves an overload minimization problem with strictly increasing penalty functions must be throughput optimal. A dynamic control policy is developed to solve the overload minimization problem in a stochastic setting. This policy generalizes the well-known join-the-shortest-queue (JSQ) policy and uses intelligent job tagging to optimize queue overflow rates without the knowledge of traffic arrival rates.

[1]  Ward Whitt,et al.  Fluid Models for Overloaded Multiclass Many-Server Queueing Systems with First-Come, First-Served Routing , 2008, Manag. Sci..

[2]  Jean C. Walrand,et al.  Fair end-to-end window-based congestion control , 2000, TNET.

[3]  Nicholas Bambos,et al.  Fairness in overloaded parallel queues , 2010, 1011.1237.

[4]  Leandros Tassiulas,et al.  Stability properties of constrained queueing systems and scheduling policies for maximum throughput in multihop radio networks , 1992 .

[5]  David E. Culler,et al.  USENIX Association Proceedings of USITS ’ 03 : 4 th USENIX Symposium on Internet Technologies and Systems , 2003 .

[6]  Devavrat Shah,et al.  Fluid models of congestion collapse in overloaded switched networks , 2011, Queueing Syst. Theory Appl..

[7]  Michael J. Neely,et al.  Delay and rate-optimal control in a multi-class priority queue with adjustable service rates , 2012, 2012 Proceedings IEEE INFOCOM.

[8]  Leandros Tassiulas,et al.  Dynamic CPU scheduling for QoS provisioning , 2013, 2013 IFIP/IEEE International Symposium on Integrated Network Management (IM 2013).

[9]  Sem C. Borst,et al.  Bandwidth-sharing in overloaded networks , 2008, 2008 42nd Annual Conference on Information Sciences and Systems.

[10]  Richard M. Wilson,et al.  A course in combinatorics , 1992 .

[11]  Dimitri P. Bertsekas,et al.  Data Networks , 1986 .

[12]  W. Whitt,et al.  Analysis of join-the-shortest-queue routing for web server farms , 2007, Perform. Evaluation.

[13]  Frank Kelly,et al.  Charging and rate control for elastic traffic , 1997, Eur. Trans. Telecommun..

[14]  Sigrún Andradóttir,et al.  Dynamic server allocation for unstable queueing networks with flexible servers , 2012, Queueing Syst. Theory Appl..

[15]  Ward Whitt,et al.  A Fluid Limit for an Overloaded X Model via a Stochastic Averaging Principle , 2010, Math. Oper. Res..

[16]  Jean-Yves Le Boudec,et al.  A unified framework for max-min and min-max fairness with applications , 2007, TNET.

[17]  Jean-Yves Le Boudec,et al.  A Unified Framework for Max-Min and Min-Max Fairness With Applications , 2007, IEEE/ACM Transactions on Networking.

[18]  Alexander L. Stolyar,et al.  Shadow-Routing Based Control of Flexible Multiserver Pools in Overload , 2011, Oper. Res..

[19]  Dimitri P. Bertsekas,et al.  Data networks (2nd ed.) , 1992 .

[20]  Leandros Tassiulas,et al.  Optimal overload response in sensor networks , 2006, IEEE Transactions on Information Theory.

[21]  Prasant Mohapatra,et al.  Session-based overload control in QoS-aware Web servers , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.