Partitioning and Multi-core Parallelization of Multi-equation Forecast Models

Forecasting is an important analysis technique used in many application domains such as electricity management, sales and retail and, traffic predictions. The employed statistical models already provide very accurate predictions, but recent developments in these domains pose new requirements on the calculation speed of the forecast models. Especially, the often used multi-equation models tend to be very complex and their estimation is very time consuming. To still allow the use of these highly accurate forecast models, it is necessary to improve the data processing capabilities of the involved data management systems. For this purpose, we introduce a partitioning approach for multi-equation forecast models that considers the specific data access pattern of these models to optimize the data storage and memory access. With the help of our approach we avoid the redundant reading of unnecessary values and improve the utilization of the CPU cache. Furthermore, we utilize the capabilities of modern multi-core hardware and parallelize the model estimation. Our experimental results on real-world data show speedups of up to 73x for the initial model estimation. Thus, our partitioning and parallelization approach significantly increases the efficiency of multi-equation models.

[1]  Pradeep Dubey,et al.  FAST: fast architecture sensitive tree search on modern CPUs and GPUs , 2010, SIGMOD Conference.

[2]  Peter R. Winters,et al.  Forecasting Sales by Exponentially Weighted Moving Averages , 1960 .

[3]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[4]  R. Ramanathan,et al.  Short-run forecasts of electricity loads and peaks , 1997 .

[5]  James W. Taylor,et al.  Triple seasonal methods for short-term electricity demand forecasting , 2010, Eur. J. Oper. Res..

[6]  E. M. Anagnostakis,et al.  Short-term load forecasting based on artificial neural networks parallel implementation , 2002 .

[7]  Stanley B. Zdonik,et al.  A skip-list approach for efficiently processing forecasting queries , 2008, Proc. VLDB Endow..

[8]  Linda See,et al.  Data preprocessing for river flow forecasting using neural networks: Wavelet transforms and data partitioning , 2006 .

[9]  Datong Chen,et al.  Forecasting high-dimensional data , 2010, SIGMOD Conference.

[10]  Sally A. McKee,et al.  Hitting the memory wall: implications of the obvious , 1995, CARN.

[11]  Remy Cottet,et al.  Bayesian Modeling and Forecasting of Intraday Electricity Load , 2003 .

[12]  Pradeep Dubey,et al.  Platform 2015: Intel ® Processor and Platform Evolution for the Next Decade , 2005 .

[13]  Satoshi Matsuoka,et al.  An 80-Fold Speedup, 15.0 TFlops Full GPU Acceleration of Non-Hydrostatic Weather Model ASUCA Production Code , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.

[14]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[15]  M. Medeiros,et al.  Modeling and forecasting short-term electricity load: A comparison of methods with an application to Brazilian data , 2008 .

[16]  Chusak Limsakul,et al.  A Computing Model of Artificial Intelligent Approaches to Mid-term Load Forecasting: a state-of-the-art- survey for the researcher , 2010 .

[17]  P. McSharry,et al.  A comparison of univariate methods for forecasting electricity demand up to a day ahead , 2006 .