From Bandits to Experts: On the Value of Side-Observations
暂无分享,去创建一个
[1] V. Baston. Some cyclic inequalities , 1974 .
[2] R. Agrawal. The Continuum-Armed Bandit Problem , 1995 .
[3] Peter Auer,et al. The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..
[4] Zhi Ning Chen,et al. Ultra Wideband Wireless Communication , 2005 .
[5] Gábor Lugosi,et al. Prediction, learning, and games , 2006 .
[6] David Zuckerman,et al. Electronic Colloquium on Computational Complexity, Report No. 100 (2005) Linear Degree Extractors and the Inapproximability of MAX CLIQUE and CHROMATIC NUMBER , 2005 .
[7] John Langford,et al. The Epoch-Greedy Algorithm for Multi-armed Bandits with Side Information , 2007, NIPS.
[8] Eli Upfal,et al. Multi-Armed Bandits in Metric Spaces ∗ , 2008 .
[9] Jean-Yves Audibert,et al. Minimax Policies for Adversarial and Stochastic Bandits. , 2009, COLT 2009.
[10] Nicolò Cesa-Bianchi,et al. Combinatorial Bandits , 2012, COLT.
[11] John N. Tsitsiklis,et al. Linearly Parameterized Bandits , 2008, Math. Oper. Res..
[12] Wei Chu,et al. A contextual-bandit approach to personalized news article recommendation , 2010, WWW '10.