Dynamic Local Regret for Non-convex Online Forecasting

We consider online forecasting problems for non-convex machine learning models. Forecasting introduces several challenges such as (i) frequent updates are necessary to deal with concept drift issues since the dynamics of the environment change over time, and (ii) the state of the art models are non-convex models. We address these challenges with a novel regret framework. Standard regret measures commonly do not consider both dynamic environment and non-convex models. We introduce a local regret for non-convex models in a dynamic environment. We present an update rule incurring a cost, according to our proposed local regret, which is sublinear in time T. Our update uses time-smoothed gradients. Using a real-world dataset we show that our time-smoothed approach yields several benefits when compared with state-of-the-art competitors: results are more stable against new data; training is more robust to hyperparameter selection; and our approach is more computationally efficient than the alternatives.

[1]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[2]  Lijun Zhang,et al.  Minimizing Adaptive Regret with One Gradient per Iteration , 2018, IJCAI.

[3]  Matthias W. Seeger,et al.  Deep State Space Models for Time Series Forecasting , 2018, NeurIPS.

[4]  Geoffrey E. Hinton,et al.  On the importance of initialization and momentum in deep learning , 2013, ICML.

[5]  Martin Zinkevich,et al.  Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.

[6]  H. Robbins A Stochastic Approximation Method , 1951 .

[7]  Travis E. Oliphant,et al.  Python for Scientific Computing , 2007, Computing in Science & Engineering.

[8]  Wouter M. Koolen,et al.  Minimax Time Series Prediction , 2015, NIPS.

[9]  Yi Zhang,et al.  Spectral Filtering for General Linear Dynamical Systems , 2018, NeurIPS.

[10]  Valentin Flunkert,et al.  DeepAR: Probabilistic Forecasting with Autoregressive Recurrent Networks , 2017, International Journal of Forecasting.

[11]  Jean-François Toubeau,et al.  Deep Learning-Based Multivariate Probabilistic Forecasting for Short-Term Scheduling in Power Markets , 2019, IEEE Transactions on Power Systems.

[12]  Seshadhri Comandur,et al.  Efficient learning algorithms for changing environments , 2009, ICML '09.

[13]  Tamás Henk,et al.  GEFCOM 2014 - Probabilistic Electricity Price Forecasting , 2015, KES-IDT.

[14]  Shie Mannor,et al.  Online Learning for Time Series Prediction , 2013, COLT.

[15]  Karan Singh,et al.  Efficient Regret Minimization in Non-Convex Games , 2017, ICML.

[16]  San Gultekin,et al.  Online Forecasting Matrix Factorization , 2017, IEEE Transactions on Signal Processing.

[17]  Amit Daniely,et al.  Strongly Adaptive Online Learning , 2015, ICML.

[18]  Rong Jin,et al.  Dynamic Regret of Strongly Adaptive Methods , 2017, ICML.

[19]  Mykola Pechenizkiy,et al.  An Overview of Concept Drift Applications , 2016 .

[20]  Steven C. H. Hoi,et al.  Online ARIMA Algorithms for Time Series Prediction , 2016, AAAI.

[21]  Dean Alderucci A SPECTRAL ALGORITHM FOR LEARNING HIDDEN MARKOV MODELS THAT HAVE SILENT STATES , 2015 .

[22]  Mehryar Mohri,et al.  Time series prediction and online learning , 2016, COLT.