Multiple Identifications in Multi-Armed Bandits
暂无分享,去创建一个
Sébastien Bubeck | Tengyao Wang | Nitin Viswanathan | Sébastien Bubeck | Tengyao Wang | N. Viswanathan
[1] John N. Tsitsiklis,et al. The Sample Complexity of Exploration in the Multi-Armed Bandit Problem , 2004, J. Mach. Learn. Res..
[2] R. Bechhofer. A Single-Sample Multiple Decision Procedure for Ranking Means of Normal Populations with known Variances , 1954 .
[3] R. Munos,et al. Best Arm Identification in Multi-Armed Bandits , 2010, COLT.
[4] Rémi Munos,et al. Pure exploration in finitely-armed and continuous-armed bandits , 2011, Theor. Comput. Sci..
[5] Rémi Munos,et al. Pure Exploration in Multi-armed Bandits Problems , 2009, ALT.
[6] Ambuj Tewari,et al. PAC Subset Selection in Stochastic Multi-armed Bandits , 2012, ICML.
[7] Shie Mannor,et al. PAC Bounds for Multi-armed Bandit and Markov Decision Processes , 2002, COLT.
[8] Alessandro Lazaric,et al. Multi-Bandit Best Arm Identification , 2011, NIPS.