From Bandits to Experts: A Tale of Domination and Independence
暂无分享,去创建一个
Noga Alon | Claudio Gentile | Nicolò Cesa-Bianchi | Yishay Mansour | Y. Mansour | N. Alon | Nicolò Cesa-Bianchi | N. Cesa-Bianchi | C. Gentile
[1] Vasek Chvátal,et al. A Greedy Heuristic for the Set-Covering Problem , 1979, Math. Oper. Res..
[2] Manfred K. Warmuth,et al. The weighted majority algorithm , 1989, 30th Annual Symposium on Foundations of Computer Science.
[3] Vladimir Vovk,et al. Aggregating strategies , 1990, COLT '90.
[4] Alan M. Frieze,et al. On the independence number of random graphs , 1990, Discret. Math..
[5] Noga Alon,et al. The Probabilistic Method , 2015, Fundamentals of Ramsey Theory.
[6] David Haussler,et al. How to use expert advice , 1993, STOC.
[7] Manfred K. Warmuth,et al. The Weighted Majority Algorithm , 1994, Inf. Comput..
[8] Yoav Freund,et al. A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.
[9] Claudio Gentile,et al. Adaptive and Self-Confident On-Line Learning Algorithms , 2000, J. Comput. Syst. Sci..
[10] Peter Auer,et al. The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..
[11] Santosh S. Vempala,et al. Efficient algorithms for online decision problems , 2005, J. Comput. Syst. Sci..
[12] Gábor Lugosi,et al. Prediction, learning, and games , 2006 .
[13] Jean-Yves Audibert,et al. Minimax Policies for Adversarial and Stochastic Bandits. , 2009, COLT 2009.
[14] Nicolò Cesa-Bianchi,et al. Combinatorial Bandits , 2012, COLT.
[15] A. Said,et al. How social relationships affect user similarities , 2010 .
[16] Shie Mannor,et al. From Bandits to Experts: On the Value of Side-Observations , 2011, NIPS.