暂无分享,去创建一个
[1] Tapio Elomaa,et al. On Following the Perturbed Leader in the Bandit Setting , 2005, ALT.
[2] Jan Poland,et al. FPL Analysis for Adaptive Bandits , 2005, SAGA.
[3] Santosh S. Vempala,et al. Efficient algorithms for online decision problems , 2005, J. Comput. Syst. Sci..
[4] H. Robbins. Some aspects of the sequential design of experiments , 1952 .
[5] Wojciech Kotlowski,et al. Follow the Leader with Dropout Perturbations , 2014, COLT.
[6] Luc Devroye,et al. Prediction by random-walk perturbation , 2013, COLT.
[7] M. Bagnoli,et al. Log-concave probability and its applications , 2004 .
[8] Árpád Baricz,et al. Mills' ratio: Monotonicity patterns and functional inequalities , 2008 .
[9] Gergely Neu,et al. An Efficient Algorithm for Learning with Semi-bandit Feedback , 2013, ALT.
[10] C. EWARTA.. Sufficient Conditions for Monotone Hazard Rate An Application to Latency-Probability Curves , 2003 .
[11] Ambuj Tewari,et al. Online Linear Optimization via Smoothing , 2014, COLT.
[12] James Hannan,et al. 4. APPROXIMATION TO RAYES RISK IN REPEATED PLAY , 1958 .
[13] Sébastien Bubeck,et al. Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems , 2012, Found. Trends Mach. Learn..
[14] Ambuj Tewari,et al. Fighting Bandits with a New Kind of Smoothness , 2015, NIPS.
[15] Gábor Lugosi,et al. Minimax Policies for Combinatorial Prediction Games , 2011, COLT.
[16] S. Foss,et al. An Introduction to Heavy-Tailed and Subexponential Distributions , 2011 .
[17] D. Bertsekas. Stochastic optimization problems with nondifferentiable cost functionals , 1973 .
[18] Richard A. Chechile,et al. Mathematical tools for hazard function analysis , 2003 .
[19] Y. Freund,et al. The non-stochastic multi-armed bandit problem , 2001 .
[20] Richard A. Chechile. CorrigendumCorrigendum to: “Mathematical tools for hazard function analysis” [J. Math. Psychol. 47 (2003) 478–494] , 2009 .