Large Multivariate Time Series Forecasting

Research on the analysis of time series has gained momentum in recent years, as knowledge derived from time series analysis can improve the decision-making process for industrial and scientific fields. Furthermore, time series analysis is often an essential part of business intelligence systems. With the growing interest in this topic, a novel set of challenges emerges. Utilizing forecasting models that can handle a large number of predictors is a popular approach that can improve results compared to univariate models. However, issues arise for high dimensional data. Not all variables will have direct impact on the target variable and adding unrelated variables may make the forecasts less accurate. Thus, the authors explore methods that can effectively deal with time series with many predictors. The authors discuss state-of-the-art methods for optimizing the selection, dimension reduction, and shrinkage of predictors. While similar research exists, it exclusively targets small and medium datasets, and thus, the research aims to fill the knowledge gap in the context of big data applications.

[1]  Johan A. K. Suykens,et al.  Fast and scalable Lasso via stochastic Frank–Wolfe methods with a convergence guarantee , 2015, Machine Learning.

[2]  P. Whittle The Analysis of Multiple Stationary Time Series , 1953 .

[3]  Mircea Andrecut,et al.  Parallel GPU Implementation of Iterative PCA Algorithms , 2008, J. Comput. Biol..

[4]  Xindong Wu,et al.  Data mining with big data , 2014, IEEE Transactions on Knowledge and Data Engineering.

[5]  Christophe Croux,et al.  Least angle regression for time series forecasting with many predictors , 2008 .

[6]  Francisco Herrera,et al.  Big data preprocessing: methods and prospects , 2016 .

[7]  Lotfi Lakhal,et al.  Improving Multivariate Time Series Forecasting with Random Walks with Restarts on Causality Graphs , 2017, 2017 IEEE International Conference on Data Mining Workshops (ICDMW).

[8]  G. Kapetanios,et al.  Forecasting Large Datasets with Bayesian Reduced Rank Multivariate Models , 2009 .

[9]  Mark W. Watson,et al.  Generalized Shrinkage Methods for Forecasting Using Many Predictors , 2012 .

[10]  Xiao Zhong,et al.  Forecasting daily stock market return using dimensionality reduction , 2017, Expert Syst. Appl..

[11]  James H. Stock,et al.  Dynamic Factor Models , 2011 .

[12]  Xinsheng Zhang,et al.  Two-step adaptive model selection for vector autoregressive processes , 2013, J. Multivar. Anal..

[13]  Michael E. Tipping,et al.  Probabilistic Principal Component Analysis , 1999 .

[14]  Tomasz Wiktor Wlodarczyk Overview of Time Series Storage and Processing in a Cloud Environment , 2012, 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings.

[15]  Galit Shmueli,et al.  To Explain or To Predict? , 2010 .

[16]  J. Bai,et al.  Forecasting economic time series using targeted predictors , 2008 .

[17]  Albert Y. Zomaya,et al.  H-PARAFAC: Hierarchical Parallel Factor Analysis of Multidimensional Big Data , 2017, IEEE Transactions on Parallel and Distributed Systems.

[18]  D. Peña Dimension reduction in time series and the dynamic factor model , 2009 .

[19]  Tarek Elgamal,et al.  sPCA: Scalable Principal Component Analysis for Big Data on Distributed Platforms , 2015, SIGMOD Conference.

[20]  Stephen P. Boyd,et al.  Network Lasso: Clustering and Optimization in Large Graphs , 2015, KDD.

[21]  Lotfi Lakhal,et al.  A Causality Based Feature Selection Approach for Multivariate Time Series Forecasting , 2017, DBKDA 2017.

[22]  Guoqiang Peter Zhang,et al.  Time series forecasting using a hybrid ARIMA and neural network model , 2003, Neurocomputing.

[23]  Han Liu,et al.  Challenges of Big Data Analysis. , 2013, National science review.

[24]  S. Johansen Estimation and Hypothesis Testing of Cointegration Vectors in Gaussian Vector Autoregressive Models , 1991 .

[25]  Morten Mørup,et al.  Scalable group level probabilistic sparse factor analysis , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[26]  J. Stock,et al.  Forecasting Using Principal Components From a Large Number of Predictors , 2002 .

[27]  Bernhard Schölkopf,et al.  Randomized Nonlinear Component Analysis , 2014, ICML.

[28]  Melinda F. Thielbar Neural Networks for Time Series Forecasting: Practical Implications of Theoretical Results. , 2011 .

[29]  Lizhe Wang,et al.  Fast and Scalable Multi-Way Analysis of Massive Neural Data , 2015, IEEE Transactions on Computers.

[30]  Markku Hauta-Kasari,et al.  Real-time PCA calculation for spectral imaging (using SIMD and GP-GPU) , 2010, Journal of Real-Time Image Processing.

[31]  Shingo Mabu,et al.  Forecasting Real Time Series Data using Deep Belief Net and Reinforcement Learning , 2017, J. Robotics Netw. Artif. Life.

[32]  Ajith Abraham,et al.  Hybrid Intelligent Systems for Stock Market Analysis , 2001, International Conference on Computational Science.

[33]  G. Box Box and Jenkins: Time Series Analysis, Forecasting and Control , 2013 .

[34]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[35]  Jiuyong Li,et al.  Using causal discovery for feature selection in multivariate numerical time series , 2015, Machine Learning.

[36]  Matteo Barigozzi,et al.  Non-Stationary Dynamic Factor Models for Large Datasets , 2016, 1602.02398.

[37]  Irena Koprinska,et al.  Correlation and instance based feature selection for electricity load forecasting , 2015, Knowl. Based Syst..

[38]  Shen Furao,et al.  Forecasting exchange rate using deep belief networks and conjugate gradient method , 2015, Neurocomputing.

[39]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[40]  Dimitris Korobilis,et al.  Hierarchical Shrinkage Priors for Dynamic Regressions with Many Predictors , 2011 .

[41]  Fei-Yue Wang,et al.  Travel time prediction with LSTM neural network , 2016, 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC).

[42]  Farshid Vahid,et al.  Macroeconomic forecasting for Australia using a large number of predictors , 2019, International Journal of Forecasting.

[43]  Mario Vasak,et al.  Deep neural networks for ultra-short-term wind forecasting , 2015, 2015 IEEE International Conference on Industrial Technology (ICIT).

[44]  Takashi Matsubara,et al.  Deep learning for stock prediction using numerical and textual information , 2016, 2016 IEEE/ACIS 15th International Conference on Computer and Information Science (ICIS).

[45]  C. Granger Testing for causality: a personal viewpoint , 1980 .

[46]  Milton S. Boyd,et al.  Designing a neural network for forecasting financial and economic time series , 1996, Neurocomputing.

[47]  Christian Schumacher,et al.  POOLING VERSUS MODEL SELECTION FOR NOWCASTING GDP WITH MANY PREDICTORS: EMPIRICAL EVIDENCE FOR SIX INDUSTRIALIZED COUNTRIES , 2013 .

[48]  Yongxin Zhu,et al.  Distributed Discord Discovery: Spark Based Anomaly Detection in Time Series , 2015, 2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems.

[49]  Cyrus Shahabi,et al.  Feature Subset Selection on Multivariate Time Series with Extremely Large Spatial Features , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[50]  Juyang Weng,et al.  Candid Covariance-Free Incremental Principal Component Analysis , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[51]  Jiahan Li,et al.  Forecasting Macroeconomic Time Series: LASSO-Based Approaches and Their Forecast Combinations with Dynamic Factor Models , 2014 .

[52]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .