Adaptive Greedy Rules for Dynamic and Stochastic Resource Capacity Allocation Problems
暂无分享,去创建一个
[1] John J. McCall,et al. The Economics of Search , 2007 .
[2] P. Whittle. Arm-Acquiring Bandits , 1981 .
[3] P. Whittle. Restless bandits: activity allocation in a changing world , 1988, Journal of Applied Probability.
[4] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[5] José Niño-Mora. An index policy for multiarmed multimode restless bandits , 2008, VALUETOOLS.
[6] G. Dantzig. Discrete-Variable Extremum Problems , 1957 .
[7] Rangarajan K. Sundaram. Generalized Bandit Problems , 2005 .
[8] John N. Tsitsiklis,et al. The complexity of optimal queueing network control , 1994, Proceedings of IEEE 9th Annual Conference on Structure in Complexity Theory.
[9] José Niño-Mora,et al. Dynamic priority allocation via restless bandit marginal productivity indices , 2007, 2304.06115.
[10] Richard Weber,et al. Comments on: Dynamic priority allocation via restless bandit marginal productivity indices , 2007 .
[11] J. Bather,et al. Multi‐Armed Bandit Allocation Indices , 1990 .
[12] P. Jacko. Marginal productivity index policies for dynamic priority allocation in restless bandit models , 2011 .
[13] DE Economist. A SURVEY ON THE BANDIT PROBLEM WITH SWITCHING COSTS , 2004 .
[14] Keith D. Kastella,et al. Foundations and Applications of Sensor Management , 2010 .
[15] John N. Tsitsiklis,et al. The Complexity of Optimal Queuing Network Control , 1999, Math. Oper. Res..