论文信息 - Shifting Regret, Mirror Descent, and Matrices

Shifting Regret, Mirror Descent, and Matrices

We consider the problem of online prediction in changing environments. In this framework the performance of a predictor is evaluated as the loss relative to an arbitrarily changing predictor, whose individual components come from a base class of predictors. Typical results in the literature consider different base classes (experts, linear predictors on the simplex, etc.) separately. Introducing an arbitrary mapping inside the mirror decent algorithm, we provide a framework that unifies and extends existing results. As an example, we prove new shifting regret bounds for matrix prediction problems.

András György | Csaba Szepesvári | Csaba Szepesvari | A. György

[1] John Darzentas,et al. Problem Complexity and Method Efficiency in Optimization , 1983 .

[2] Mark Herbster,et al. Tracking the Best Expert , 1995, Machine-mediated learning.

[3] Frans M. J. Willems,et al. Coding for a binary independent piecewise-identically-distributed source , 1996, IEEE Trans. Inf. Theory.

[4] F. Willems,et al. Live-and-die coding for binary piecewise i.i.d. sources , 1997, Proceedings of IEEE International Symposium on Information Theory.

[5] Manfred K. Warmuth,et al. Tracking a Small Set of Experts by Mixing Past Posteriors , 2003, J. Mach. Learn. Res..

[6] Mark Herbster,et al. Tracking the Best Linear Predictor , 2001, J. Mach. Learn. Res..

[7] Marc Teboulle,et al. Mirror descent and nonlinear projected subgradient methods for convex optimization , 2003, Oper. Res. Lett..

[8] Martin Zinkevich,et al. Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.

[9] Gunnar Rätsch,et al. Matrix Exponentiated Gradient Updates for On-line Learning and Bregman Projection , 2004, J. Mach. Learn. Res..

[10] Sanjeev Arora,et al. A combinatorial, primal-dual approach to semidefinite programs , 2007, STOC '07.

[11] Seshadhri Comandur,et al. Efficient learning algorithms for changing environments , 2009, ICML '09.

[12] Alexander Shapiro,et al. Stochastic Approximation approach to Stochastic Programming , 2013 .

[13] Shai Shalev-Shwartz,et al. Near-Optimal Algorithms for Online Matrix Prediction , 2012, COLT.

[14] Tamás Linder,et al. Efficient Tracking of Large Classes of Experts , 2012, IEEE Trans. Inf. Theory.

[15] Wouter M. Koolen,et al. A Closer Look at Adaptive Regret , 2012, J. Mach. Learn. Res..

[16] Nicolò Cesa-Bianchi,et al. Mirror Descent Meets Fixed Share (and feels no regret) , 2012, NIPS.

[17] Rebecca Willett,et al. Dynamical Models and tracking regret in online convex programming , 2013, ICML.

[18] Amit Daniely,et al. Strongly Adaptive Online Learning , 2015, ICML.