论文信息 - Efficient tracking of a growing number of experts

Efficient tracking of a growing number of experts

We consider a variation on the problem of prediction with expert advice, where new forecasters that were unknown until then may appear at each round. As often in prediction with expert advice, designing an algorithm that achieves near-optimal regret guarantees is straightforward, using aggregation of experts. However, when the comparison class is sufficiently rich, for instance when the best expert and the set of experts itself changes over time, such strategies naively require to maintain a prohibitive number of weights (typically exponential with the time horizon). By contrast, designing strategies that both achieve a near-optimal regret and maintain a reasonable number of weights is highly non-trivial. We consider three increasingly challenging objectives (simple regret, shifting regret and sparse shifting regret) that extend existing notions defined for a fixed expert ensemble; in each case, we design strategies that achieve tight regret bounds, adaptive to the parameters of the comparison class, while being computationally inexpensive. Moreover, our algorithms are anytime, agnostic to the number of incoming experts and completely parameter-free. Such remarkable results are made possible thanks to two simple but highly effective recipes: first the "abstention trick" that comes from the specialist framework and enables to handle the least challenging notions of regret, but is limited when addressing more sophisticated objectives. Second, the "muting trick" that we introduce to give more flexibility. We show how to combine these two tricks in order to handle the most challenging class of comparison strategies.

Odalric-Ambrym Maillard | Jaouad Mourtada | Odalric-Ambrym Maillard | Jaouad Mourtada

[1] N. Merhav,et al. Low complexity sequential lossless coding for piecewise stationary memoryless sources , 1998, Proceedings. 1998 IEEE International Symposium on Information Theory (Cat. No.98CH36252).

[2] Vladimir Vovk,et al. Prediction with Expert Evaluators' Advice , 2009, ALT.

[3] Cosma Rohilla Shalizi,et al. Adapting to Non-stationarity with Growing Expert Ensembles , 2011, ArXiv.

[4] Seshadhri Comandur,et al. Efficient learning algorithms for changing environments , 2009, ICML '09.

[5] David Haussler,et al. Sequential Prediction of Individual Sequences Under General Loss Functions , 1998, IEEE Trans. Inf. Theory.

[6] Olivier Wintenberger,et al. Optimal learning with Bernstein online aggregation , 2014, Machine Learning.

[7] Wouter M. Koolen,et al. Learning the Learning Rate for Prediction with Expert Advice , 2014, NIPS.

[8] Vladimir Vovk,et al. Derandomizing Stochastic Prediction Strategies , 1997, COLT '97.

[9] László Györfi,et al. A simple randomized algorithm for sequential prediction of ergodic time series , 1999, IEEE Trans. Inf. Theory.

[10] Wouter M. Koolen,et al. Follow the leader if you can, hedge if you must , 2013, J. Mach. Learn. Res..

[11] Claudio Gentile,et al. Regret Minimization for Branching Experts , 2022 .

[12] Haipeng Luo,et al. Achieving All with No Parameters: Adaptive NormalHedge , 2015, ArXiv.

[13] Manfred K. Warmuth,et al. Tracking a Small Set of Experts by Mixing Past Posteriors , 2003, J. Mach. Learn. Res..

[14] Wouter M. Koolen,et al. Second-order Quantile Methods for Experts and Combinatorial Games , 2015, COLT.

[15] Yoram Singer,et al. Using and combining predictors that specialize , 1997, STOC '97.

[16] Frans M. J. Willems,et al. Coding for a binary independent piecewise-identically-distributed source , 1996, IEEE Trans. Inf. Theory.

[17] Claire Monteleoni,et al. Tracking climate models , 2011, CIDU.

[18] Gábor Lugosi,et al. Prediction, learning, and games , 2006 .

[19] Wouter M. Koolen,et al. Universal Codes From Switching Strategies , 2013, IEEE Transactions on Information Theory.

[20] Boris Ryabko,et al. Prediction of random sequences and universal coding , 2015 .

[21] Wouter M. Koolen,et al. Putting Bayes to sleep , 2012, NIPS.

[22] Vladimir Vovk,et al. A game of prediction with expert advice , 1995, COLT '95.

[23] Mark Herbster,et al. Tracking the Best Expert , 1995, Machine-mediated learning.

[24] Nicolò Cesa-Bianchi,et al. Mirror Descent Meets Fixed Share (and feels no regret) , 2012, NIPS.

[25] Scott McQuade,et al. Global Climate Model Tracking Using Geospatial Neighborhoods , 2012, AAAI.

[26] Francesco Orabona,et al. Improved Strongly Adaptive Online Learning using Coin Betting , 2016, AISTATS.

[27] Tamás Linder,et al. Efficient Tracking of Large Classes of Experts , 2012, IEEE Trans. Inf. Theory.

[28] Shai Shalev-Shwartz,et al. Online Learning and Online Convex Optimization , 2012, Found. Trends Mach. Learn..

[29] Wouter M. Koolen,et al. Combining Expert Advice Efficiently , 2008, COLT.

[30] Elad Hazan,et al. Introduction to Online Convex Optimization , 2016, Found. Trends Optim..

[31] Neri Merhav,et al. Universal Prediction , 1998, IEEE Trans. Inf. Theory.