Multi-armed bandit problems with multiple plays and switching cost
暂无分享,去创建一个
R. Agrawal | D. Teneketzis | M. Hegde | A. R. | Hegde M | T. D.
[1] J. Walrand,et al. Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards , 1987 .
[2] H. Robbins,et al. Asymptotically efficient adaptive allocation rules , 1985 .
[3] Sheldon M. Ross,et al. Stochastic Processes , 2018, Gauge Integral Structures for Stochastic Calculus and Quantum Electrodynamics.