论文信息 - Trading-Off Static and Dynamic Regret in Online Least-Squares and Beyond

Trading-Off Static and Dynamic Regret in Online Least-Squares and Beyond

Recursive least-squares algorithms often use forgetting factors as a heuristic to adapt to non-stationary data streams. The first contribution of this paper rigorously characterizes the effect of forgetting factors for a class of online Newton algorithms. For exp-concave and strongly convex objectives, the algorithms achieve the dynamic regret of $\max\{O(\log T),O(\sqrt{TV})\}$, where $V$ is a bound on the path length of the comparison sequence. In particular, we show how classic recursive least-squares with a forgetting factor achieves this dynamic regret bound. By varying $V$, we obtain a trade-off between static and dynamic regret. In order to obtain more computationally efficient algorithms, our second contribution is a novel gradient descent step size rule for strongly convex functions. Our gradient descent rule recovers the order optimal dynamic regret bounds described above. For smooth problems, we can also obtain static regret of $O(T^{1-\beta})$ and dynamic regret of $O(T^\beta V^*)$, where $\beta \in (0,1)$ and $V^*$ is the path length of the sequence of minimizers. By varying $\beta$, we obtain a trade-off between static and dynamic regret.

Jianjun Yuan | Andrew Lamperski

[1] Elad Hazan,et al. Introduction to Online Convex Optimization , 2016, Found. Trends Optim..

[2] Ali H. Sayed,et al. Adaptive Filters , 2008 .

[3] Jianjun Yuan,et al. Online Adaptive Principal Component Analysis and Its extensions , 2019, ICML.

[4] Eric Moulines,et al. On Upper-Confidence Bound Policies for Switching Bandit Problems , 2011, ALT.

[5] Martin Zinkevich,et al. Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.

[6] Vijay Kumar,et al. Online learning in online auctions , 2003, SODA '03.

[7] Koby Crammer,et al. Online Passive-Aggressive Algorithms , 2003, J. Mach. Learn. Res..

[8] Zhi-Hua Zhou,et al. Distribution-Free One-Pass Learning , 2017, IEEE Transactions on Knowledge and Data Engineering.

[9] Olivier Cappé,et al. Weighted Linear Bandits for Non-Stationary Environments , 2019, NeurIPS.

[10] Seshadhri Comandur,et al. Efficient learning algorithms for changing environments , 2009, ICML '09.

[11] Jianjun Yuan,et al. Online Convex Optimization for Cumulative Constraints , 2018, NeurIPS.

[12] Jinfeng Yi,et al. Tracking Slowly Moving Clairvoyant: Optimal Dynamic Regret of Online Learning with True and Noisy Gradient , 2016, ICML.

[13] Elad Hazan,et al. Logarithmic regret algorithms for online convex optimization , 2006, Machine Learning.

[14] Omar Besbes,et al. Non-Stationary Stochastic Optimization , 2013, Oper. Res..

[15] Gábor Lugosi,et al. Prediction, learning, and games , 2006 .

[16] Aryan Mokhtari,et al. Optimization in Dynamic Environments : Improved Regret Rates for Strongly Convex Problems , 2016 .

[17] Lijun Zhang,et al. Adaptive Online Learning in Dynamic Environments , 2018, NeurIPS.

[18] Yurii Nesterov,et al. Introductory Lectures on Convex Optimization - A Basic Course , 2014, Applied Optimization.