Priority index heuristic for multi-armed bandit problems with set-up costs and/or set-up time delays
暂无分享,去创建一个
[1] R. Hartley,et al. Optimisation Over Time: Dynamic Programming and Stochastic Control: , 1983 .
[2] Max-Olivier Hongler,et al. Optimal hysteresis for a class of deterministic deteriorating two-armed Bandit problem with switching costs , 2003, Autom..
[3] J. Banks,et al. Switching Costs and the Gittins Index , 1994 .
[4] Albert Y. Ha. Optimal Dynamic Scheduling Policy for a Make-To-Stock Production System , 1997, Oper. Res..
[5] Demosthenis Teneketzis,et al. Multi-armed bandits with switching penalties , 1996, IEEE Trans. Autom. Control..
[6] D. Teneketzis,et al. Optimal stochastic scheduling of forest networks with switching penalties , 1994, Advances in Applied Probability.
[7] Fabrice Dusonchet. Dynamic scheduling for production systems operating in a random environment , 2003 .
[8] J. Ben Atkinson,et al. An Introduction to Queueing Networks , 1988 .