论文信息 - Optimal index rules for single resource allocation to stochastic dynamic competitors

Optimal index rules for single resource allocation to stochastic dynamic competitors

In this paper we present a generic Markov decision process model of optimal single resource allocation to a collection of stochastic dynamic competitors. The main goal is to identify sufficient conditions under which this problem is optimally solved by an index rule. The main focus is on the frozen-if-not-allocated assumption, which is notoriously found in problems including the multi-armed bandit problem, tax problem, Klimov network, job sequencing, object search and detection. The problem is approached by a Lagrangian relaxation and decomposed into a collection of normalized parametric single-competitor subproblems, which are then optimally solved by the well-known Gittins index. We show that the problem is equivalent to solving a time sequence of its Lagrangian relaxations. We further show that our approach gives insights on sufficient conditions for optimality of index rules in restless problems (in which the frozen-if-not-allocated assumption is dropped) with single resource; this paper is the first to prove such conditions.

Peter Jacko | P. Jacko

[1] J. Gittins. Bandit processes and dynamic allocation indices , 1979 .

[2] P. Whittle. Restless bandits: activity allocation in a changing world , 1988, Journal of Applied Probability.

[3] José Niño-Mora,et al. A (2/3)n3 Fast-Pivoting Algorithm for the Gittins Index and Optimal Stopping of a Markov Chain , 2007, INFORMS J. Comput..

[4] Jean Walrand,et al. The c# rule revisited , 1985 .

[5] R. Weber,et al. On an index policy for restless bandits , 1990, Journal of Applied Probability.

[6] Daniel Adelman,et al. Relaxations of Weakly Coupled Stochastic Dynamic Programs , 2008, Oper. Res..

[7] P. Jacko. Marginal productivity index policies for dynamic priority allocation in restless bandit models , 2011 .

[8] Mingyan Liu,et al. Optimality of Myopic Sensing in Multi-Channel Opportunistic Access , 2008, 2008 IEEE International Conference on Communications.

[9] Peter Whittle,et al. Applied Probability in Great Britain , 2002, Oper. Res..

[10] P. Whittle. Tax problems in the undiscounted case , 2005 .

[11] José Niño Mora. Restless Bandits, Partial Conservation Laws and Indexability , 2000 .