Implementing transfer learning across different datasets for time series forecasting

Abstract Due to the extensive practical value of time series prediction, many excellent algorithms have been proposed. Most of these methods are developed assuming that massive labeled training data are available. However, this assumption might be invalid in some actual situations. To address this limitation, a transfer learning framework with deep architectures is proposed. Since convolutional neural network (CNN) owns favorable feature extraction capability and can implement parallelization more easily, we propose a deep transfer learning method resorting to the architecture of CNN, termed as DTr-CNN for short. It can effectively alleviate the available labeled data absence and leverage useful knowledge to the current prediction. Notably, in our method, transfer learning process is implemented across different datasets. For a given target domain, in real-world scenarios, relativity of truly available potential source datasets may not be obvious, which is challenging and rarely referred to in most existing time series prediction methods. Aiming at this problem, the incorporation of Dynamic Time Warping (DTW) and Jensen-Shannon (JS) divergence is adopted for the selection of the appropriate source domain. Effectiveness of the proposed method is empirically underpinned by the experiments conducted on one group of synthetic and two groups of practical datasets. Besides, an additional experiment on NN5 dataset is conducted.

[1]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[2]  Michael Bohlke-Schneider,et al.  Criteria for Classifying Forecasting Methods , 2020, International Journal of Forecasting.

[3]  Nikolay Laptev,et al.  Reconstruction and Regression Loss for Time-Series Transfer Learning , 2018 .

[4]  Shih-Fu Chang,et al.  Cross-domain learning methods for high-level visual concept classification , 2008, 2008 15th IEEE International Conference on Image Processing.

[5]  C. Bergmeir,et al.  Recurrent Neural Networks for Time Series Forecasting: Current Status and Future Directions , 2019, International Journal of Forecasting.

[6]  Christoph Bergmeir,et al.  Forecasting across time series databases using recurrent neural networks on groups of similar series: A clustering approach , 2017, Expert Syst. Appl..

[7]  Q. Henry Wu,et al.  Local prediction of non-linear time series using support vector regression , 2008, Pattern Recognit..

[8]  Sander Bohte,et al.  Conditional Time Series Forecasting with Convolutional Neural Networks , 2017, 1703.04691.

[9]  Erik Cambria,et al.  Fuzzy commonsense reasoning for multimodal sentiment analysis , 2019, Pattern Recognit. Lett..

[10]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[11]  Michael I. Jordan,et al.  Learning Transferable Features with Deep Adaptation Networks , 2015, ICML.

[12]  J. Ord,et al.  A New Look at Models For Exponential Smoothing , 2001 .

[13]  Thomas Jansen,et al.  Optimization with randomized search heuristics - the (A)NFL theorem, realistic scenarios, and difficult functions , 2002, Theor. Comput. Sci..

[14]  Michael I. Jordan,et al.  Transferable Adversarial Training: A General Approach to Adapting Deep Classifiers , 2019, ICML.

[15]  Slawek Smyl,et al.  A hybrid method of exponential smoothing and recurrent neural networks for time series forecasting , 2020, International Journal of Forecasting.

[16]  Kate Saenko,et al.  Deep CORAL: Correlation Alignment for Deep Domain Adaptation , 2016, ECCV Workshops.

[17]  Peter Stagge,et al.  Recurrent neural networks for time series classification , 2003, Neurocomputing.

[18]  Tingfa Xu,et al.  Transfer learning-based discriminative correlation filter for visual tracking , 2020, Pattern Recognit..

[19]  Identification of Optimal Autoregressive Integrated Moving Average Model on Temperature Data , 2011 .

[20]  Qun Dai,et al.  A novel transfer learning framework for time series forecasting , 2018, Knowl. Based Syst..

[21]  Guokun Lai,et al.  Modeling Long- and Short-Term Temporal Patterns with Deep Neural Networks , 2017, SIGIR.

[22]  Alaa El. Sagheer,et al.  Time series forecasting of petroleum production using deep LSTM recurrent networks , 2019, Neurocomputing.

[23]  Germain Forestier,et al.  Transfer learning for time series classification , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[24]  Rohitash Chandra,et al.  Competition and Collaboration in Cooperative Coevolution of Elman Recurrent Neural Networks for Time-Series Prediction , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[25]  Jianxin Wu,et al.  Improving CNN linear layers with power mean non-linearity , 2019, Pattern Recognit..

[26]  Rob J Hyndman,et al.  Another look at measures of forecast accuracy , 2006 .

[27]  Edwin Lughofer,et al.  Robust Unsupervised Domain Adaptation for Neural Networks via Moment Alignment , 2017, Inf. Sci..

[28]  Saeed Zolfaghari,et al.  Chaotic time series prediction with residual analysis method using hybrid Elman-NARX neural networks , 2010, Neurocomputing.

[29]  Eamonn J. Keogh,et al.  Exact indexing of dynamic time warping , 2002, Knowledge and Information Systems.