论文信息 - Multi-armed bandit problems with multiple plays and switching cost - 字舞流文

Multi-armed bandit problems with multiple plays and switching cost

R. Agrawal | D. Teneketzis | M. Hegde | A. R. | Hegde M | T. D.

[1] J. Walrand,et al. Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards , 1987 .

[2] H. Robbins,et al. Asymptotically efficient adaptive allocation rules , 1985 .

[3] Sheldon M. Ross,et al. Stochastic Processes , 2018, Gauge Integral Structures for Stochastic Calculus and Quantum Electrodynamics.