论文信息 - On Online Optimization: Dynamic Regret Analysis of Strongly Convex and Smooth Problems

On Online Optimization: Dynamic Regret Analysis of Strongly Convex and Smooth Problems

The regret bound of dynamic online learning algorithms is often expressed in terms of the variation in the function sequence (VT ) and/or the path-length of the minimizer sequence after T rounds. For strongly convex and smooth functions, Zhang et al. (2017) establish the squared path-length of the minimizer sequence (C∗ 2,T ) as a lower bound on regret. They also show that online gradient descent (OGD) achieves this lower bound using multiple gradient queries per round. In this paper, we focus on unconstrained online optimization. We first show that a preconditioned variant of OGD achieves O ( min{C∗ T , C∗ 2,T } ) with one gradient query per round (C∗ T refers to the normal path-length). We then propose online optimistic Newton (OON) method for the case when the first and second order information of the function sequence is predictable. The regret bound of OON is captured via the quartic path-length of the minimizer sequence (C∗ 4,T ), which can be much smaller than C∗ 2,T . We finally show that by using multiple gradients for OGD, we can achieve an upper bound of O(min{C∗ 2,T , VT }) on regret.

Shahin Shahrampour | Ting-Jui Chang | Shahin Shahrampour | Ting-Jui Chang

[1] Rong Jin,et al. Dynamic Regret of Strongly Adaptive Methods , 2017, ICML.

[2] Elad Hazan,et al. Logarithmic regret algorithms for online convex optimization , 2006, Machine Learning.

[3] Karthik Sridharan,et al. Online Learning with Predictable Sequences , 2012, COLT.

[4] Jianjun Yuan,et al. Trading-Off Static and Dynamic Regret in Online Least-Squares and Beyond , 2020, AAAI.

[5] Omar Besbes,et al. Non-Stationary Stochastic Optimization , 2013, Oper. Res..

[6] Amit Daniely,et al. Strongly Adaptive Online Learning , 2015, ICML.

[7] Martin Zinkevich,et al. Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.

[8] Jinfeng Yi,et al. Improved Dynamic Regret for Non-degenerate Functions , 2016, NIPS.

[9] Aryan Mokhtari,et al. Optimization in Dynamic Environments : Improved Regret Rates for Strongly Convex Problems , 2016 .

[10] Karthik Sridharan,et al. Optimization, Learning, and Games with Predictable Sequences , 2013, NIPS.

[11] Elad Hazan,et al. Introduction to Online Convex Optimization , 2016, Found. Trends Optim..

[12] Vahid Tarokh,et al. Prediction in Online Convex Optimization for Parametrizable Objective Functions , 2019, 2019 IEEE 58th Conference on Decision and Control (CDC).

[13] Lijun Zhang,et al. Adaptive Online Learning in Dynamic Environments , 2018, NeurIPS.

[14] Rong Jin,et al. 25th Annual Conference on Learning Theory Online Optimization with Gradual Variations , 2022 .

[15] Rebecca Willett,et al. Online Convex Optimization in Dynamic Environments , 2015, IEEE Journal of Selected Topics in Signal Processing.

[16] Tianbao Yang,et al. Minimizing Dynamic Regret and Adaptive Regret Simultaneously , 2020, AISTATS.

[17] Lijun Zhang,et al. Adaptive Regret of Convex and Smooth Functions , 2019, ICML.

[18] Emiliano Dall'Anese,et al. Inexact Online Proximal-gradient Method for Time-varying Convex Optimization , 2020, 2020 American Control Conference (ACC).

[19] Shahin Shahrampour,et al. Online Optimization : Competing with Dynamic Comparators , 2015, AISTATS.

[20] Antoine Lesage-Landry,et al. Second-Order Online Nonconvex Optimization , 2020, IEEE Transactions on Automatic Control.

[21] Seshadhri Comandur,et al. Electronic Colloquium on Computational Complexity, Report No. 88 (2007) Adaptive Algorithms for Online Decision Problems , 2022 .