Resource-sharing in a single server with time-varying capacity

We investigate the problem of sharing the resources of a single server with time-varying capacity with the objective of minimizing the mean delay. We formulate the resource allocation problem as a Markov Decision Process. The problem is not solvable analytically in full generality, and we thus set out to obtain an approximate solution. In our main contribution, we extend the framework of multi-armed bandits to develop a heuristic solution of index type. At every given time, the heuristic assigns an index to every user that depends solely on its current state, and serves the user with highest current index value. We show that in the case of constant capacity, the heuristic policy is equivalent to the so-called Gittins index rule, which is known to be optimal under the assumption of constant capacity.

[1]  P. Whittle Restless Bandits: Activity Allocation in a Changing World , 1988 .

[2]  Sem C. Borst User-level performance of channel-aware scheduling algorithms in wireless data networks , 2005, IEEE/ACM Transactions on Networking.

[3]  Urtzi Ayesta,et al.  A modeling framework for optimizing the flow-level scheduling with time-varying channels , 2010, Perform. Evaluation.

[4]  Donald R. Smith Technical Note - A New Proof of the Optimality of the Shortest Remaining Processing Time Discipline , 1978, Oper. Res..

[5]  Linus Schrage,et al.  Letter to the Editor - A Proof of the Optimality of the Shortest Remaining Processing Time Discipline , 1968, Oper. Res..

[6]  Kevin D. Glazebrook,et al.  Multi-Armed Bandit Allocation Indices: Gittins/Multi-Armed Bandit Allocation Indices , 2011 .

[7]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[8]  Leandros Tassiulas,et al.  Dynamic server allocation to parallel queues with randomly varying connectivity , 1993, IEEE Trans. Inf. Theory.

[9]  R. Weber On the Gittins Index for Multiarmed Bandits , 1992 .

[10]  J. Shanthikumar,et al.  On extremal service disciplines in single-stage queueing systems , 1990, Journal of Applied Probability.

[11]  S. F. Yashkov,et al.  Processor-sharing queues: Some progress in analysis , 1987, Queueing Syst. Theory Appl..

[12]  J. George Shanthikumar,et al.  Scheduling Multiclass Single Server Queueing Systems to Stochastically Maximize the Number of Successful Departures , 1989, Probability in the Engineering and Informational Sciences.

[13]  Christian M. Ernst,et al.  Multi-armed Bandit Allocation Indices , 1989 .

[14]  J. Bather,et al.  Multi‐Armed Bandit Allocation Indices , 1990 .

[15]  Lachlan L. H. Andrew,et al.  Power-Aware Speed Scaling in Processor Sharing Systems , 2009, IEEE INFOCOM 2009.

[16]  Urtzi Ayesta,et al.  On the Gittins index in the M/G/1 queue , 2009, Queueing Syst. Theory Appl..

[17]  Kee-Eung Kim,et al.  Solving Very Large Weakly Coupled Markov Decision Processes , 1998, AAAI/IAAI.