Combinatorial Partial Monitoring Game with Linear Feedback and Its Applications
暂无分享,去创建一个
Wei Chen | John C. S. Lui | Robert D. Kleinberg | John C.S. Lui | Bruno D. Abrahao | Tian Lin | B. Abrahao | Wei Chen | Tian Lin
[1] Jeffrey D. Smith,et al. Design and Analysis of Algorithms , 2009, Lecture Notes in Computer Science.
[2] Vladimir Vovk,et al. Aggregating strategies , 1990, COLT '90.
[3] Manfred K. Warmuth,et al. The Weighted Majority Algorithm , 1994, Inf. Comput..
[4] Christian Schindelhauer,et al. Discrete Prediction Games with Arbitrary Feedback and Loss , 2001, COLT/EuroCOLT.
[5] Peter Auer,et al. Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.
[6] Nicolò Cesa-Bianchi,et al. Regret Minimization Under Partial Monitoring , 2006, 2006 IEEE Information Theory Workshop - ITW '06 Punta del Este.
[7] Gábor Lugosi,et al. Prediction, learning, and games , 2006 .
[8] H. Robbins. Some aspects of the sequential design of experiments , 1952 .
[9] Jean-Yves Audibert,et al. Minimax Policies for Adversarial and Stochastic Bandits. , 2009, COLT 2009.
[10] Nicolò Cesa-Bianchi,et al. Combinatorial Bandits , 2012, COLT.
[11] Csaba Szepesvári,et al. Toward a classification of finite partial-monitoring games , 2010, Theor. Comput. Sci..
[12] Csaba Szepesvári,et al. Minimax Regret of Finite Partial-Monitoring Games in Stochastic Environments , 2011, COLT.
[13] Sébastien Bubeck,et al. Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems , 2012, Found. Trends Mach. Learn..
[14] Bhaskar Krishnamachari,et al. Combinatorial Network Optimization With Unknown Variables: Multi-Armed Bandits With Linear Rewards and Individual Observations , 2010, IEEE/ACM Transactions on Networking.
[15] Csaba Szepesvári,et al. An adaptive algorithm for finite stochastic partial monitoring , 2012, ICML.
[16] Csaba Szepesvári,et al. Toward a classification of finite partial-monitoring games , 2010, Theor. Comput. Sci..
[17] Wei Chen,et al. Combinatorial Multi-Armed Bandit: General Framework and Applications , 2013, ICML.
[18] T. L. Lai Andherbertrobbins. Asymptotically Efficient Adaptive Allocation Rules , 2022 .