暂无分享,去创建一个
Huan Liu | Ling Jian | Xijun Liang | Binbin Liu | Jundong Li | Yunquan Song | Huan Liu | Jundong Li | Yunquan Song | Xijun Liang | Ling Jian | Binbin Liu
[1] Adam Tauman Kalai,et al. Online convex optimization in the bandit setting: gradient descent without a gradient , 2004, SODA '05.
[2] Sanjeev Arora,et al. Efficient algorithms for online convex optimization and their applications , 2006 .
[3] Shai Shalev-Shwartz,et al. Online Learning and Online Convex Optimization , 2012, Found. Trends Mach. Learn..
[4] Philip S. Yu,et al. Mining concept-drifting data streams using ensemble classifiers , 2003, KDD '03.
[5] Patrick Thiran,et al. Stochastic Optimization with Bandit Sampling , 2017, ArXiv.
[6] Steven C. H. Hoi,et al. Online Learning: A Comprehensive Survey , 2018, Neurocomputing.
[7] Santosh S. Vempala,et al. Simulated annealing in convex bodies and an O*(n4) volume algorithm , 2006, J. Comput. Syst. Sci..
[8] Ohad Shamir,et al. On the Complexity of Bandit and Derivative-Free Stochastic Convex Optimization , 2012, COLT.
[9] Elad Hazan,et al. Bandit Convex Optimization: Towards Tight Bounds , 2014, NIPS.
[10] Mehryar Mohri,et al. Optimistic Bandit Convex Optimization , 2016, NIPS.
[11] Gergely Neu,et al. An Efficient Algorithm for Learning with Semi-bandit Feedback , 2013, ALT.
[12] Ryoji Kataoka,et al. Robust Online Learning to Rank via Selective Pairwise Approach Based on Evaluation Measures , 2013, Inf. Media Technol..
[13] Tjalling J. Ypma,et al. Historical Development of the Newton-Raphson Method , 1995, SIAM Rev..
[14] Santosh S. Vempala,et al. Simulated annealing in convex bodies and an O*(n/sup 4/) volume algorithm , 2003, 44th Annual IEEE Symposium on Foundations of Computer Science, 2003. Proceedings..
[15] Elad Hazan,et al. Interior-Point Methods for Full-Information and Bandit Online Learning , 2012, IEEE Transactions on Information Theory.
[16] Robert D. Kleinberg. Nearly Tight Bounds for the Continuum-Armed Bandit Problem , 2004, NIPS.
[17] Csaba Szepesvári,et al. Online Learning to Rank in Stochastic Click Models , 2017, ICML.
[18] Ambuj Tewari,et al. Improved Regret Guarantees for Online Smooth Convex Optimization with Bandit Feedback , 2011, AISTATS.
[19] Miklós Simonovits,et al. Random walks and an O*(n5) volume algorithm for convex bodies , 1997, Random Struct. Algorithms.
[20] Martin Zinkevich,et al. Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.
[21] Thomas P. Hayes,et al. The Price of Bandit Information for Online Optimization , 2007, NIPS.
[22] Mingyan Liu,et al. Online algorithms for the multi-armed bandit problem with Markovian rewards , 2010, 2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton).
[23] Elad Hazan,et al. Logarithmic regret algorithms for online convex optimization , 2006, Machine Learning.
[24] Yi Ding,et al. Large Scale Kernel Methods for Online AUC Maximization , 2017, 2017 IEEE International Conference on Data Mining (ICDM).
[25] Shai Shalev-Shwartz,et al. On Graduated Optimization for Stochastic Non-Convex Problems , 2015, ICML.
[26] Elad Hazan,et al. Competing in the Dark: An Efficient Algorithm for Bandit Linear Optimization , 2008, COLT.
[27] V. Milman,et al. Isotropic position and inertia ellipsoids and zonoids of the unit ball of a normed n-dimensional space , 1989 .
[28] Steven C. H. Hoi,et al. LIBOL: a library for online learning algorithms , 2014, J. Mach. Learn. Res..
[29] Yin Tat Lee,et al. Kernel-based methods for bandit convex optimization , 2016, STOC.
[30] Lin Xiao,et al. Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback. , 2010, COLT 2010.
[31] Yuanzhi Li,et al. An optimal algorithm for bandit convex optimization , 2016, ArXiv.
[32] Avrim Blum,et al. Online Geometric Optimization in the Bandit Setting Against an Adaptive Adversary , 2004, COLT.
[33] Geoffrey J. Gordon. Regret bounds for prediction problems , 1999, COLT '99.