Open Problem: Adversarial Multiarmed Bandits with Limited Advice
暂无分享,去创建一个
[1] Peter Auer,et al. The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..
[2] Jean-Yves Audibert,et al. Regret Bounds and Minimax Policies under Partial Monitoring , 2010, J. Mach. Learn. Res..
[3] Gábor Lugosi,et al. Prediction, learning, and games , 2006 .
[4] John Langford,et al. Contextual Bandit Algorithms with Supervised Learning Guarantees , 2010, AISTATS.
[5] Sébastien Bubeck,et al. Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems , 2012, Found. Trends Mach. Learn..
[6] Ohad Shamir,et al. Efficient Learning with Partially Observed Attributes , 2010, ICML.
[7] Wei Chu,et al. A contextual-bandit approach to personalized news article recommendation , 2010, WWW '10.
[8] Koby Crammer,et al. Prediction with Limited Advice and Multiarmed Bandits with Paid Observations , 2014, ICML.