A c/μ-Rule for Service Resource Allocation in Group-Server Queues

In this paper, we study a dynamic on/off server scheduling problem in a queueing system with multi-class servers, where servers are heterogeneous and can be classified into $K$ groups. Servers in the same group are homogeneous. A scheduling policy determines the number of working servers (servers that are turned on) in each group at every state $n$ (number of customers in the system). Our goal is to find the optimal scheduling policy to minimize the long-run average cost, which consists of an increasing convex holding cost and a linear operating cost. We use the sensitivity-based optimization theory to characterize the optimal policy. A necessary and sufficient condition of the optimal policy is derived. We also prove that the optimal policy has monotone structures and a quasi bang-bang control is optimal. We find that the optimal policy is indexed by the value of $c - \mu G(n)$, where $c$ is the operating cost rate, $\mu$ is the service rate for a server, and $G(n)$ is a computable quantity called perturbation realization factor. Specifically, the group with smaller negative $c - \mu G(n)$ is more preferred to be turned on, while the group with positive $c - \mu G(n)$ should be turned off. However, the preference ranking of each group is affected by $G(n)$ and the preference order may change with the state $n$, the arrival rate, and the cost function. Under a reasonable condition of scale economies, we further prove that the optimal policy obeys a so-called $c$/$\mu$-rule. That is, the servers with smaller $c$/$\mu$ should be turned on with higher priority and the preference order of groups remains unchanged. This rule can be viewed as a sister version of the famous $c\mu$-rule for polling queues. With the monotone property of $G(n)$, we further prove that the optimal policy has a multi-threshold structure when the $c$/$\mu$-rule is applied.

[1]  Wayne E. Smith Various optimizers for single‐stage production , 1956 .

[2]  M. Yadin,et al.  Queueing Systems with a Removable Service Station , 1963 .

[3]  Daniel P. Heyman,et al.  Optimal Operating Policies for M/G/1 Queuing Systems , 1968, Oper. Res..

[4]  Matthew J. Sobel,et al.  Optimal Average-Cost Policy for a Queue with Start-Up and Shut-Down Costs , 1969, Oper. Res..

[5]  G. Klimov Time-Sharing Service Systems. I , 1975 .

[6]  Daniel P. Heyman,et al.  The T-Policy for the M/G/1 Queue , 1977 .

[7]  Colin E. Bell Optimal Operation of an M/M/2 Queue with Removable Servers , 1980, Oper. Res..

[8]  Zvi Rosberg,et al.  Optimal control of service in tandem queues , 1982 .

[9]  P. R. Kumar,et al.  Optimal control of a queueing system with two heterogeneous servers , 1984 .

[10]  Jean Walrand,et al.  A note on optimal control of a queuing system with two heterogeneous servers , 1984 .

[11]  J. Baras,et al.  Two competing queues with linear costs: the μc-rule is often optimal , 1984, Advances in Applied Probability.

[12]  Armand M. Makowski,et al.  K competing queues with geometric service requirements and linear costs: The μc-rule is always optimal☆ , 1985 .

[13]  Refael Hassin,et al.  ON THE OPTIMALITY OF FIRST COME LAST SERVED QUEUES , 1985 .

[14]  J. Walrand,et al.  The cμ rule revisited , 1985, Advances in Applied Probability.

[15]  R. Weber,et al.  Optimal control of service rates in networks of queues , 1987, Advances in Applied Probability.

[16]  David Yao,et al.  Decentralized control of service rates in a closed Jackson network , 1987, 26th IEEE Conference on Decision and Control.

[17]  P. Whittle Restless bandits: activity allocation in a changing world , 1988, Journal of Applied Probability.

[18]  M. Kijima,et al.  FURTHER RESULTS FOR DYNAMIC SCHEDULING OF MULTICLASS G/G/1 QUEUES , 1989 .

[19]  J. Bather,et al.  Multi‐Armed Bandit Allocation Indices , 1990 .

[20]  Xi-Ren Cao,et al.  Perturbation analysis of discrete event dynamic systems , 1991 .

[21]  Susan H. Xu,et al.  Optimal Expulsion Control - A Dual Approach to Admission Control of an Ordered-Entry System , 1993, Oper. Res..

[22]  Shaler Stidham,et al.  A survey of Markov decision models for control of networks of queues , 1993, Queueing Syst. Theory Appl..

[23]  R. Weber On a conjecture about assigning jobs to processors of differing speeds , 1993, IEEE Trans. Autom. Control..

[24]  Don Towsley,et al.  Optimal scheduling in a machine with stochastic varying processing rate , 1994, IEEE Trans. Autom. Control..

[25]  Xi-Ren Cao,et al.  Realization Probabilities: The Dynamics of Queuing Systems , 1994 .

[26]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[27]  Dye-Jyun Ma,et al.  A direct approach to decentralized control of service rates in a closed Jackson network , 1994 .

[28]  J. V. Mieghem Dynamic Scheduling with Convex Delay Costs: The Generalized CU Rule , 1995 .

[29]  Michael C. Fu,et al.  Monotone Optimal Policies for a Transient Queueing Staffing Problem , 2000, Oper. Res..

[30]  Panagiotis Kasteridis,et al.  On the d-policy for the m/g/1 queue , 2001 .

[31]  V. V. Rykov Monotone Control of Queueing Systems with Heterogeneous Servers , 2001, Queueing Syst. Theory Appl..

[32]  Yannis Viniotis,et al.  Threshold control policies for heterogeneous server systems , 2002, Math. Methods Oper. Res..

[33]  Refael Hassin,et al.  To Queue or Not to Queue: Equilibrium Behavior in Queueing Systems , 2002 .

[34]  Jan A. Van Mieghem Due-Date Scheduling: Asymptotic Optimality of Generalized Longest Queue and Generalized Largest Delay Rules , 2003, Oper. Res..

[35]  Quanlin Li,et al.  Two Types of RG-Factorizations of Quasi-birth-and-death Processes and Their Applications to Stochastic Integral Functionals , 2004 .

[36]  Alexander L. Stolyar,et al.  Scheduling Flexible Servers with Convex Delay Costs: Heavy-Traffic Optimality of the Generalized cµ-Rule , 2004, Oper. Res..

[37]  Asser N. Tantawi,et al.  An analytical model for multi-tier internet services and its applications , 2005, SIGMETRICS '05.

[38]  Naishuo Tian,et al.  Vacation Queueing Models Theory and Applications , 2006 .

[39]  Francis de Véricourt,et al.  On the incomplete results for the heterogeneous server problem , 2006, Queueing Syst. Theory Appl..

[40]  Zhe George Zhang,et al.  Performance Analysis of a Queue with Congestion-Based Staffing Policy , 2009, Manag. Sci..

[41]  R. B. Patel,et al.  EEHC: Energy efficient heterogeneous clustered scheme for wireless sensor networks , 2009, Comput. Commun..

[42]  Krishna Kant,et al.  Data center evolution: A tutorial on state of the art, issues, and challenges , 2009, Comput. Networks.

[43]  N. Shimkin,et al.  The c / Rule for Many-Server Queues with Abandonment , 2009 .

[44]  Xi-Ren Cao,et al.  Stochastic learning and optimization - A sensitivity-based approach , 2007, Annual Reviews in Control.

[45]  Mor Harchol-Balter,et al.  Optimality analysis of energy-performance trade-off for server farm management , 2010, Perform. Evaluation.

[46]  Gong Zhang,et al.  Adaptive Data Migration in Multi-tiered Storage Based Cloud Environment , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[47]  Daniel Mossé,et al.  Power optimization for dynamic configuration in heterogeneous web server clusters , 2010, J. Syst. Softw..

[48]  Amy R. Ward,et al.  Fair Dynamic Routing in Large-Scale Heterogeneous-Server Systems , 2010, Oper. Res..

[49]  Christos G. Cassandras,et al.  Optimal scheduling of parallel queues using stochastic flow models , 2011, Discret. Event Dyn. Syst..

[50]  Amy R. Ward,et al.  Blind Fair Routing in Large-Scale Service Systems with Heterogeneous Customers and Servers , 2013, Oper. Res..

[51]  Li Xia,et al.  Max-Min Optimality of Service Rate Control in Closed Queueing Networks , 2013, IEEE Transactions on Automatic Control.

[52]  Douglas G. Down,et al.  Energy-Aware Scheduling on Heterogeneous Processors , 2014, IEEE Transactions on Automatic Control.

[53]  Peter J. Varman,et al.  Balancing fairness and efficiency in tiered storage systems with bottleneck-aware allocation , 2014, FAST.

[54]  Li Xia,et al.  Service rate control of closed Jackson networks from game theoretic perspective , 2014, Eur. J. Oper. Res..

[55]  Qing-Shan Jia,et al.  Parameterized Markov decision process and its application to service rate control , 2015, Autom..

[56]  Soroush Saghafian,et al.  A cµ Rule for Two-Tiered Parallel Servers , 2015 .

[57]  Refael Hassin,et al.  Optimal service‐capacity allocation in a loss system , 2015 .

[58]  William P. Millhiser,et al.  Optimality of the fastest available server policy , 2016, Queueing Syst. Theory Appl..

[59]  Zhongsheng Hua,et al.  Competition and Coordination in Two‐tier Public Service Systems Under Government Fiscal Policy , 2016 .

[60]  Moshe Zukerman,et al.  Asymptotically Optimal Job Assignment for Energy-Efficient Processor-Sharing Server Farms , 2016, IEEE Journal on Selected Areas in Communications.

[61]  Mark S. Squillante,et al.  Optimal Resource Capacity Management for Stochastic Networks , 2017, Oper. Res..

[62]  Li Xia,et al.  Group-Server Queues , 2017, QTNA.

[63]  John N. Tsitsiklis,et al.  Flexible Queueing Architectures , 2015, Oper. Res..