Intelligent Crude Oil Price Forecaster

We propose two ensemble regression algorithms for forecasting the daily price of crude oil from features extracted from the U.S. Energy Administration and some international news agencies. An ensemble regression model consists of a group of homogeneous regressors with varying parameters, e.g. Linear regression models with different ridge regularization parameters. The first ensemble method called "recent leader" picks the individual regressor with least mean square error over recent data. The second model called "exponentially weighted ensemble" combines individual regressors in a linear fashion with weights of constituent models decaying exponentially with the mean square error over past predictions. These two methods were tested with linear regression, support vector regression, decision trees and Gaussian processes. Exponentially weighted ensemble with support vector regression had the best performance.

[1]  Karl Rihaczek,et al.  1. WHAT IS DATA MINING? , 2019, Data Mining for the Social Sciences.

[2]  Arthur E. Hoerl,et al.  Practical Use of Ridge Regression: A Challenge Met , 1985 .

[3]  Xiongpai Qin Making Use of the Big Data: Next Generation of Algorithm Trading , 2012, AICI.

[4]  Jeffrey D. Ullman,et al.  Mining of Massive Datasets: Data Mining , 2011 .

[5]  Lior Rokach,et al.  Top-down induction of decision trees classifiers - a survey , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[6]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.

[7]  Ian Witten,et al.  Data Mining , 2000 .

[8]  B. Noble,et al.  On certain integrals of Lipschitz-Hankel type involving products of bessel functions , 1955, Philosophical Transactions of the Royal Society of London. Series A, Mathematical and Physical Sciences.

[9]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.