Online combinatorial optimization with stochastic decision sets and adversarial losses
暂无分享,去创建一个
[1] Philip Wolfe,et al. Contributions to the theory of games , 1953 .
[2] James Hannan,et al. 4. APPROXIMATION TO RAYES RISK IN REPEATED PLAY , 1958 .
[3] Yoram Singer,et al. Using and combining predictors that specialize , 1997, STOC '97.
[4] G. Koshevoy. Choice functions and abstract convex geometries , 1999 .
[5] Peter Auer,et al. The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..
[6] Marcus Hutter,et al. Prediction with Expert Advice by Following the Perturbed Leader for General Weights , 2004, ALT.
[7] Avrim Blum,et al. Online Geometric Optimization in the Bandit Setting Against an Adaptive Adversary , 2004, COLT.
[8] Santosh S. Vempala,et al. Efficient algorithms for online decision problems , 2005, Journal of computer and system sciences (Print).
[9] Gábor Lugosi,et al. Prediction, learning, and games , 2006 .
[10] Magyar Tud. The On-Line Shortest Path Problem Under Partial Monitoring , 2007 .
[11] Thomas P. Hayes,et al. The Price of Bandit Information for Online Optimization , 2007, NIPS.
[12] Nicolò Cesa-Bianchi,et al. Combinatorial Bandits , 2012, COLT.
[13] Varun Kanade,et al. Sleeping Experts and Bandits with Stochastic Action Availability and Adversarial Rewards , 2009, AISTATS.
[14] Gerhard Friedrich,et al. Recommender Systems - An Introduction , 2010 .
[15] Robert D. Kleinberg,et al. Regret bounds for sleeping experts and bandits , 2010, Machine Learning.
[16] Sham M. Kakade,et al. Towards Minimax Policies for Online Linear Optimization with Bandit Feedback , 2012, COLT.
[17] Gergely Neu,et al. An Efficient Algorithm for Learning with Semi-bandit Feedback , 2013, ALT.
[18] Gábor Lugosi,et al. Regret in Online Combinatorial Optimization , 2012, Math. Oper. Res..
[19] Thomas Steinke,et al. Learning Hurdles for Sleeping Experts , 2014, ACM Trans. Comput. Theory.