Stochastic and fluid index policies for resource allocation problems

We develop a unifying framework to obtain efficient index policies for restless multi-armed bandit problems with birth-and-death state evolution. This is a broad class of stochastic resource allocation problems whose objective is to determine efficient policies to share resources among competing projects. In a seminal work, Whittle developed a methodology to derive well-performing (Whittle's) index policies that are obtained by solving a relaxed version of the original problem. Our first main contribution is the derivation of a closed-form expression for Whittle's index as a function of the steady-state probabilities. It can be efficiently calculated, however, it requires several technical conditions to be verified, and in addition, it does not provide qualitative insights into Whittle's index. We therefore formulate a fluid version of the relaxed optimization problem and in our second main contribution we develop a fluid index policy. The latter does provide qualitative insights and is close to Whittle's index. The applicability of our approach is illustrated by two important problems: optimal class selection and optimal load balancing. Allowing state-dependent capacities we can model important phenomena: e.g. power-aware server-farms and opportunistic scheduling in wireless systems. Numerical simulations show that Whittle's index and our fluid index policy are both nearly optimal.

[1]  Kevin D. Glazebrook,et al.  Index Policies for the Admission Control and Routing of Impatient Customers to Heterogeneous Service Stations , 2009, Oper. Res..

[2]  Carlos F. G. Bispo The single-server scheduling problem with convex costs , 2013, Queueing Syst. Theory Appl..

[3]  Kevin D. Glazebrook,et al.  Whittle's index policy for a multi-class queueing system with convex holding costs , 2003, Math. Methods Oper. Res..

[4]  P. Whittle Restless Bandits: Activity Allocation in a Changing World , 1988 .

[5]  J. Michael Harrison,et al.  Dynamic Control of a Queue with Adjustable Service Rate , 2001, Oper. Res..

[6]  Kevin D. Glazebrook,et al.  Stochastic scheduling: A short history of index policies and new approaches to index generation for dynamic resource allocation , 2014, J. Sched..

[7]  Atilla Eryilmaz,et al.  Asymptotically optimal downlink scheduling over Markovian fading channels , 2012, 2012 Proceedings IEEE INFOCOM.

[8]  2015 IEEE Conference on Computer Communications, INFOCOM 2015, Kowloon, Hong Kong, April 26 - May 1, 2015 , 2015, IEEE Conference on Computer Communications.

[9]  Christian M. Ernst,et al.  Multi-armed Bandit Allocation Indices , 1989 .

[10]  I. M. Verloop Asymptotic optimal control of multi-class restless bandits , 2013 .

[11]  Linus Schrage,et al.  The Queue M/G/1 with the Shortest Remaining Processing Time Discipline , 1966, Oper. Res..

[12]  Urtzi Ayesta,et al.  Index policies for a multi-class queue with convex holding cost and abandonments , 2014, SIGMETRICS '14.

[13]  Elena Yudovina,et al.  Stochastic networks , 1995, Physics Subject Headings (PhySH).

[14]  Jean Walrand,et al.  The c# rule revisited , 1985 .

[15]  Nahum Shimkin,et al.  The cµ/theta Rule for Many-Server Queues with Abandonment , 2010, Oper. Res..

[16]  J. Bather,et al.  Multi‐Armed Bandit Allocation Indices , 1990 .

[17]  Lachlan L. H. Andrew,et al.  Power-Aware Speed Scaling in Processor Sharing Systems , 2009, IEEE INFOCOM 2009.

[18]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[19]  Urtzi Ayesta,et al.  Dynamic fluid-based scheduling in a multi-class abandonment queue , 2013, Perform. Evaluation.

[20]  J. V. Mieghem Dynamic Scheduling with Convex Delay Costs: The Generalized CU Rule , 1995 .

[21]  R. Weber,et al.  On an index policy for restless bandits , 1990, Journal of Applied Probability.

[22]  José Niño-Mora,et al.  Dynamic priority allocation via restless bandit marginal productivity indices , 2007, 2304.06115.

[23]  Sem C. Borst User-level performance of channel-aware scheduling algorithms in wireless data networks , 2005, IEEE/ACM Transactions on Networking.