Multi-Armed bandit problem revisited
暂无分享,去创建一个
[1] A. Mandelbaum. Discrete multi-armed bandits and multi-parameter processes , 1986 .
[2] P. Whittle. Multi‐Armed Bandits and the Gittins Index , 1980 .
[3] R. Weber. On the Gittins Index for Multiarmed Bandits , 1992 .
[4] K. Glazebrook. Optimal strategies for families of alternative bandit processes , 1983 .
[5] Gideon Weiss,et al. Turnpike Optimality of Smith's Rule in Parallel Machines Stochastic Scheduling , 1992, Math. Oper. Res..
[6] J. Bather,et al. Multi‐Armed Bandit Allocation Indices , 1990 .
[7] Jean Walrand,et al. Extensions of the multiarmed bandit problem: The discounted case , 1985 .
[8] P. Whittle. Arm-Acquiring Bandits , 1981 .
[9] A. Mandelbaum. CONTINUOUS MULTI-ARMED BANDITS AND MULTIPARAMETER PROCESSES , 1987 .