Data Augmentation for Short-Term Time Series Prediction with Deep Learning

In this paper, a hybrid data augmentation technique for short-term time series prediction is proposed in order to overcome the underfitting problem in deep learning models based on recurrent neural networks such as Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU). The proposal hybrid technique consists of the combination of two basic data augmentation techniques that are generally used for time series classification, these are: time-warping and jittering. Time-warping allows the generation of synthetic data between each pair of values in the time series, extending its length, while jittering allows the synthetic data generated to be non-linear. To evaluate the proposal technique, it’s experimented with three non-seasonal short-term time series of Peru: CO2 emissions per capita, renewable energy consumption and Covid-19 positive cases, it is considered that predicting non-seasonal time series is more difficult than seasonal ones. The results show that the regression models based on recurrent neural networks using the selected time series with data augmentation improve results between 16.318% and 42.1426%.

[1]  David Banjerdpongchai,et al.  Short-Term Load Forecast for Energy Management Systems Using Time Series Analysis and Neural Network Method with Average True Range , 2019, 2019 First International Symposium on Instrumentation, Control, Artificial Intelligence, and Robotics (ICA-SYMP).

[2]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[3]  Seiichi Uchida,et al.  Time Series Data Augmentation for Neural Networks by Time Warping with a Discriminative Teacher , 2020, 2020 25th International Conference on Pattern Recognition (ICPR).

[4]  Khandakar M. Rashid,et al.  Window-Warping: A Time Series Data Augmentation of IMU Data for Construction Equipment Activity Identification , 2019 .

[5]  Khandakar M. Rashid,et al.  Times-series data augmentation and deep learning for construction equipment activity recognition , 2019, Adv. Eng. Informatics.

[6]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Valentin Flunkert,et al.  DeepAR: Probabilistic Forecasting with Autoregressive Recurrent Networks , 2017, International Journal of Forecasting.

[8]  Guido Pasquini,et al.  Testing the Performance of an Innovative Markerless Technique for Quantitative and Qualitative Gait Analysis , 2020, Sensors.

[9]  K. Torkkola,et al.  A Multi-Horizon Quantile Recurrent Forecaster , 2017, 1711.11053.

[10]  Elin Klages,et al.  Very Short Term Time-Series Forecasting of Solar Irradiance Without Exogenous Inputs , 2018, 1810.07066.

[11]  Julián A. Pucheta,et al.  Time-series prediction with BEMCA approach: Application to short rainfall series , 2017, 2017 IEEE Latin American Conference on Computational Intelligence (LA-CCI).

[12]  Taghi M. Khoshgoftaar,et al.  A survey on Image Data Augmentation for Deep Learning , 2019, Journal of Big Data.

[13]  Romain Tavenard,et al.  Data Augmentation for Time Series Classification using Convolutional Neural Networks , 2016 .

[14]  Anibal Flores,et al.  Comparison of Hybrid Recurrent Neural Networks for Univariate Time Series Forecasting , 2020, IntelliSys.

[15]  Anibal Flores,et al.  Recurrent Neural Networks for Meteorological Time Series Imputation , 2020 .

[16]  Dominic Thewlis,et al.  Simulating Time-Series Data for Improved Deep Neural Network Performance , 2019, IEEE Access.

[17]  Mariusz Oszust,et al.  Data Augmentation with Suboptimal Warping for Time-Series Classification , 2019, Sensors.