Adaptation to Easy Data in Prediction with Limited Advice
暂无分享,去创建一个
[1] Vladimir Vovk,et al. Aggregating strategies , 1990, COLT '90.
[2] Tor Lattimore,et al. Refined Lower Bounds for Adversarial Bandits , 2016, NIPS.
[3] Peter Auer,et al. The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..
[4] Haipeng Luo,et al. Achieving All with No Parameters: AdaNormalHedge , 2015, COLT.
[5] Aleksandrs Slivkins,et al. One Practical Algorithm for Both Stochastic and Adversarial Bandits , 2014, ICML.
[6] Wouter M. Koolen,et al. Second-order Quantile Methods for Experts and Combinatorial Games , 2015, COLT.
[7] H. Robbins. Some aspects of the sequential design of experiments , 1952 .
[8] Gábor Lugosi,et al. Prediction, learning, and games , 2006 .
[9] Koby Crammer,et al. Open Problem: Adversarial Multiarmed Bandits with Limited Advice , 2013, COLT.
[10] Haipeng Luo,et al. More Adaptive Algorithms for Adversarial Bandits , 2018, COLT.
[11] Koby Crammer,et al. Prediction with Limited Advice and Multiarmed Bandits with Paid Observations , 2014, ICML.
[12] Satyen Kale,et al. Multiarmed Bandits With Limited Expert Advice , 2013, COLT.
[13] Ohad Shamir,et al. Bandit Regret Scaling with the Effective Loss Range , 2017, ALT.
[14] H. Robbins,et al. Asymptotically efficient adaptive allocation rules , 1985 .
[15] Gábor Lugosi,et al. An Improved Parametrization and Analysis of the EXP3++ Algorithm for Stochastic and Adversarial Bandits , 2017, COLT.
[16] Manfred K. Warmuth,et al. The weighted majority algorithm , 1989, 30th Annual Symposium on Foundations of Computer Science.
[17] Olivier Wintenberger,et al. Optimal learning with Bernstein online aggregation , 2014, Machine Learning.
[18] Gilles Stoltz,et al. A second-order bound with excess losses , 2014, COLT.
[19] Peter Auer,et al. Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.
[20] Noga Alon,et al. Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback , 2014, SIAM J. Comput..
[21] Peter Auer,et al. An algorithm with nearly optimal pseudo-regret for both stochastic and adversarial bandits , 2016, COLT.
[22] Sébastien Bubeck,et al. Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems , 2012, Found. Trends Mach. Learn..
[23] Yishay Mansour,et al. Improved second-order bounds for prediction with expert advice , 2006, Machine Learning.
[24] Aleksandrs Slivkins,et al. 25th Annual Conference on Learning Theory The Best of Both Worlds: Stochastic and Adversarial Bandits , 2022 .
[25] Julian Zimmert,et al. Tsallis-INF: An Optimal Algorithm for Stochastic and Adversarial Bandits , 2018, J. Mach. Learn. Res..