The hypothesis in this paper is that a significant amount of intraday market data is either noise or redundant, and that if it is eliminated, then predictive models built using the remaining intraday data will be more accurate. To test this hypothesis, we use an evolutionary method (called Evolutionary Data Selection, EDS) to selectively remove out portions of training data that is to be made available to an intraday market predictor. After performing experiments in which data-selected and non-data-selected versions of the same predictive models are compared, it is shown that EDS is effective and does indeed boost predictor accuracy. It is also shown in the paper that building multiple models using EDS and placing them into an ensemble further increases performance. The datasets for evaluation are large intraday forex time series, specifically series from the EUR/USD, the USD/JPY and the EUR/JPY markets, and predictive models for two primary tasks per market are built: intraday return prediction and intraday volatility prediction.
[1]
Angelo Ranaldo,et al.
Intraday Patterns in FX Returns and Order Flow
,
2012
.
[3]
Francisco Herrera,et al.
Using evolutionary algorithms as instance selection for data reduction in KDD: an experimental study
,
2003,
IEEE Trans. Evol. Comput..
[4]
John C. Platt,et al.
Fast training of support vector machines using sequential minimal optimization, advances in kernel methods
,
1999
.
[5]
Conor Ryan,et al.
Modesty Is the Best Policy: Automatic Discovery of Viable Forecasting Goals in Financial Data
,
2010,
EvoApplications.
[6]
Leo Breiman,et al.
Random Forests
,
2001,
Machine Learning.
[7]
Ian H. Witten,et al.
The WEKA data mining software: an update
,
2009,
SKDD.