论文信息 - Minimizing Dynamic Regret and Adaptive Regret Simultaneously

Minimizing Dynamic Regret and Adaptive Regret Simultaneously

Regret minimization is treated as the golden rule in the traditional study of online learning. However, regret minimization algorithms tend to converge to the static optimum, thus being suboptimal for changing environments. To address this limitation, new performance measures, including dynamic regret and adaptive regret have been proposed to guide the design of online algorithms. The former one aims to minimize the global regret with respect to a sequence of changing comparators, and the latter one attempts to minimize every local regret with respect to a fixed comparator. Existing algorithms for dynamic regret and adaptive regret are developed independently, and only target one performance measure. In this paper, we bridge this gap by proposing novel online algorithms that are able to minimize the dynamic regret and adaptive regret simultaneously. In fact, our theoretical guarantee is even stronger in the sense that one algorithm is able to minimize the dynamic regret over any interval.

[1] Rebecca Willett,et al. Online Learning for Changing Environments using Coin Betting , 2017, ArXiv.

[2] Wouter M. Koolen,et al. A Closer Look at Adaptive Regret , 2012, J. Mach. Learn. Res..

[3] Mark Herbster,et al. Tracking the Best Expert , 1995, Machine Learning.

[4] Nicolò Cesa-Bianchi,et al. Mirror Descent Meets Fixed Share (and feels no regret) , 2012, NIPS.

[5] Ambuj Tewari,et al. Smoothness, Low Noise and Fast Rates , 2010, NIPS.

[6] Amit Daniely,et al. Strongly Adaptive Online Learning , 2015, ICML.

[7] Omar Besbes,et al. Non-Stationary Stochastic Optimization , 2013, Oper. Res..

[8] Seshadhri Comandur,et al. Efficient learning algorithms for changing environments , 2009, ICML '09.

[9] Peter L. Bartlett,et al. Adaptive Online Gradient Descent , 2007, NIPS.

[10] Manfred K. Warmuth,et al. The Weighted Majority Algorithm , 1994, Inf. Comput..

[11] Aryan Mokhtari,et al. Optimization in Dynamic Environments : Improved Regret Rates for Strongly Convex Problems , 2016 .

[12] Yoav Freund,et al. A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[13] Haipeng Luo,et al. Achieving All with No Parameters: AdaNormalHedge , 2015, COLT.

[14] Rong Jin,et al. Dynamic Regret of Strongly Adaptive Methods , 2017, ICML.

[15] Lijun Zhang,et al. Adaptive Online Learning in Dynamic Environments , 2018, NeurIPS.

[16] Yoram Singer,et al. Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..

[17] Lijun Zhang,et al. Adaptive Regret of Convex and Smooth Functions , 2019, ICML.

[18] Seshadhri Comandur,et al. Electronic Colloquium on Computational Complexity, Report No. 88 (2007) Adaptive Algorithms for Online Decision Problems , 2022 .

[19] Jinfeng Yi,et al. Tracking Slowly Moving Clairvoyant: Optimal Dynamic Regret of Online Learning with True and Noisy Gradient , 2016, ICML.

[20] Rebecca Willett,et al. Dynamical Models and tracking regret in online convex programming , 2013, ICML.

[21] Shahin Shahrampour,et al. Online Optimization : Competing with Dynamic Comparators , 2015, AISTATS.

[22] Gábor Lugosi,et al. Prediction, learning, and games , 2006 .

[23] Tamás Linder,et al. Efficient Tracking of Large Classes of Experts , 2011, IEEE Transactions on Information Theory.

[24] Elad Hazan,et al. Introduction to Online Convex Optimization , 2016, Found. Trends Optim..

[25] Francesco Orabona,et al. Improved Strongly Adaptive Online Learning using Coin Betting , 2016, AISTATS.

[26] Shai Shalev-Shwartz,et al. Online Learning and Online Convex Optimization , 2012, Found. Trends Mach. Learn..

[27] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[28] Elad Hazan,et al. Logarithmic regret algorithms for online convex optimization , 2006, Machine Learning.

[29] Jinfeng Yi,et al. Improved Dynamic Regret for Non-degenerate Functions , 2016, NIPS.

[30] Ambuj Tewari,et al. Optimal Stragies and Minimax Lower Bounds for Online Convex Games , 2008, COLT.

[31] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[32] Martin Zinkevich,et al. Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.