Online Optimization in X-Armed Bandits
暂无分享,去创建一个
Csaba Szepesvári | Rémi Munos | Sébastien Bubeck | Gilles Stoltz | Csaba Szepesvari | R. Munos | Gilles Stoltz | Sébastien Bubeck
[1] J. Doob. Stochastic processes , 1953 .
[2] R. Agrawal. The Continuum-Armed Bandit Problem , 1995 .
[3] Robert D. Kleinberg. Nearly Tight Bounds for the Continuum-Armed Bandit Problem , 2004, NIPS.
[4] Olivier Teytaud,et al. Modification of UCT with Patterns in Monte-Carlo Go , 2006 .
[5] Gábor Lugosi,et al. Prediction, learning, and games , 2006 .
[6] Csaba Szepesvári,et al. Bandit Based Monte-Carlo Planning , 2006, ECML.
[7] Rémi Munos,et al. Bandit Algorithms for Tree Search , 2007, UAI.
[8] Peter Auer,et al. Improved Rates for the Stochastic Continuum-Armed Bandit Problem , 2007, COLT.
[9] Eli Upfal,et al. Multi-Armed Bandits in Metric Spaces ∗ , 2008 .