No-Regret Algorithms for Heavy-Tailed Linear Bandits
暂无分享,去创建一个
[1] Thomas P. Hayes,et al. Stochastic Linear Optimization under Bandit Feedback , 2008, COLT.
[2] Peter Auer,et al. The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..
[3] J. Hull. Risk Management And Financial Institutions , 2006 .
[4] Noga Alon,et al. The Space Complexity of Approximating the Frequency Moments , 1999 .
[5] Daniel J. Hsu,et al. Heavy-tailed regression with a generalized median-of-means , 2014, ICML.
[6] Nicolò Cesa-Bianchi,et al. Bandits With Heavy Tail , 2012, IEEE Transactions on Information Theory.
[7] Tze Leung Lai,et al. Self-Normalized Processes , 2009 .
[8] S. Rachev. Handbook of heavy tailed distributions in finance , 2003 .
[9] W. Hoeffding. Probability Inequalities for sums of Bounded Random Variables , 1963 .
[10] Nicolò Cesa-Bianchi,et al. Online Learning with Switching Costs and Other Adaptive Adversaries , 2013, NIPS.
[11] Wei Chu,et al. A contextual-bandit approach to personalized news article recommendation , 2010, WWW '10.
[12] Robert D. Kleinberg. Nearly Tight Bounds for the Continuum-Armed Bandit Problem , 2004, NIPS.
[13] Qing Zhao,et al. Adaptive shortest-path routing under unknown and stochastically varying link states , 2012, 2012 10th International Symposium on Modeling and Optimization in Mobile, Ad Hoc and Wireless Networks (WiOpt).
[14] Csaba Szepesvári,et al. Improved Algorithms for Linear Stochastic Bandits , 2011, NIPS.
[15] H. Robbins. Some aspects of the sequential design of experiments , 1952 .
[16] John Langford,et al. Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits , 2014, ICML.
[17] John Langford,et al. Contextual Bandit Algorithms with Supervised Learning Guarantees , 2010, AISTATS.
[18] Sébastien Bubeck,et al. Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems , 2012, Found. Trends Mach. Learn..
[19] Peter Auer,et al. Using Confidence Bounds for Exploitation-Exploration Trade-offs , 2003, J. Mach. Learn. Res..
[20] John N. Tsitsiklis,et al. Linearly Parameterized Bandits , 2008, Math. Oper. Res..
[21] Wei Chu,et al. Contextual Bandits with Linear Payoff Functions , 2011, AISTATS.