论文信息 - Stochastic and Adversarial Online Learning without Hyperparameters

Stochastic and Adversarial Online Learning without Hyperparameters

Most online optimization algorithms focus on one of two things: performing well in adversarial settings by adapting to unknown data parameters (such as Lipschitz constants), typically achieving $O(\sqrt{T})$ regret, or performing well in stochastic settings where they can leverage some structure in the losses (such as strong convexity), typically achieving $O(\log(T))$ regret. Algorithms that focus on the former problem hitherto achieved $O(\sqrt{T})$ in the stochastic setting rather than $O(\log(T))$. Here we introduce an online optimization algorithm that achieves $O(\log^4(T))$ regret in a wide class of stochastic settings while gracefully degrading to the optimal $O(\sqrt{T})$ regret in adversarial settings (up to logarithmic factors). Our algorithm does not require any prior knowledge about the data or tuning of parameters to achieve superior performance.

Ashok Cutkosky | Kwabena A. Boahen | K. Boahen | Ashok Cutkosky

[1] Matthew J. Streeter,et al. Adaptive Bound Optimization for Online Convex Optimization , 2010, COLT 2010.

[2] Martin Zinkevich,et al. Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.

[3] P. Bartlett,et al. Optimal strategies and minimax lower bounds for online convex games [Technical Report No. UCB/EECS-2008-19] , 2008 .

[4] Wouter M. Koolen,et al. MetaGrad: Multiple Learning Rates in Online Learning , 2016, NIPS.

[5] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[6] Ashok Cutkosky,et al. Online Learning Without Prior Information , 2017, COLT.

[7] Matthew J. Streeter,et al. No-Regret Algorithms for Unconstrained Online Convex Optimization , 2012, NIPS.

[8] Peter L. Bartlett,et al. Adaptive Online Gradient Descent , 2007, NIPS.

[9] Elad Hazan,et al. Logarithmic regret algorithms for online convex optimization , 2006, Machine Learning.

[10] Francesco Orabona,et al. Dimension-Free Exponentiated Gradient , 2013, NIPS.

[11] Ashok Cutkosky,et al. Online Convex Optimization with Unconstrained Domains and Losses , 2017, NIPS.

[12] Wouter M. Koolen,et al. Combining Adversarial Guarantees and Stochastic Fast Rates in Online Learning , 2016, NIPS.

[13] Shai Shalev-Shwartz,et al. Online Learning and Online Convex Optimization , 2012, Found. Trends Mach. Learn..

[14] Alessandro Lazaric,et al. Exploiting easy data in online optimization , 2014, NIPS.

[15] Francesco Orabona,et al. Coin Betting and Parameter-Free Online Learning , 2016, NIPS.