Multidimensional k-nearest neighbor model based on EEMD for financial time series forecasting

In this paper, we propose a new two-stage methodology that combines the ensemble empirical mode decomposition (EEMD) with multidimensional k-nearest neighbor model (MKNN) in order to forecast the closing price and high price of the stocks simultaneously. The modified algorithm of k-nearest neighbors (KNN) has an increasingly wide application in the prediction of all fields. Empirical mode decomposition (EMD) decomposes a nonlinear and non-stationary signal into a series of intrinsic mode functions (IMFs), however, it cannot reveal characteristic information of the signal with much accuracy as a result of mode mixing. So ensemble empirical mode decomposition (EEMD), an improved method of EMD, is presented to resolve the weaknesses of EMD by adding white noise to the original data. With EEMD, the components with true physical meaning can be extracted from the time series. Utilizing the advantage of EEMD and MKNN, the new proposed ensemble empirical mode decomposition combined with multidimensional k-nearest neighbor model (EEMD–MKNN) has high predictive precision for short-term forecasting. Moreover, we extend this methodology to the case of two-dimensions to forecast the closing price and high price of the four stocks (NAS, S&P500, DJI and STI stock indices) at the same time. The results indicate that the proposed EEMD–MKNN model has a higher forecast precision than EMD–KNN, KNN method and ARIMA.

[1]  T. Wang,et al.  Comparing the applications of EMD and EEMD on time-frequency analysis of seismic signal , 2012 .

[2]  Nigel Meade,et al.  A comparison of the accuracy of short term foreign exchange forecasting methods , 2002 .

[3]  P. Franses,et al.  Forecasting stock market volatility using (non‐linear) Garch models , 1996 .

[4]  N. Huang,et al.  A study of the characteristics of white noise using the empirical mode decomposition method , 2004, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[5]  J. Stock,et al.  A Comparison of Linear and Nonlinear Univariate Models for Forecasting Macroeconomic Time Series , 1998 .

[6]  Ibrahim Türkoglu,et al.  A hybrid method based on artificial immune system and k-NN algorithm for better prediction of protein cellular localization sites , 2009, Appl. Soft Comput..

[7]  Norman R. Swanson,et al.  Forecasting economic time series using flexible versus fixed specification and linear versus nonlinear econometric models , 1997 .

[8]  Balaji Rajagopalan,et al.  Statistical downscaling using K‐nearest neighbors , 2005 .

[9]  H. Liang,et al.  Artifact reduction in electrogastrogram based on empirical mode decomposition method , 2006, Medical and Biological Engineering and Computing.

[10]  Gerrit Hoogenboom,et al.  Weather analogue: A tool for real-time prediction of daily weather data realizations based on a modified k-nearest neighbor approach , 2008, Environ. Model. Softw..

[11]  T. A. Buishand,et al.  Simulation of extreme precipitation in the Rhine basin by nearest-neighbour resampling , 1998 .

[12]  Gabriel Rilling,et al.  EMD Equivalent Filter Banks, from Interpretation to Applications , 2005 .

[13]  Guoqiang Peter Zhang,et al.  Time series forecasting using a hybrid ARIMA and neural network model , 2003, Neurocomputing.

[14]  Bradley Matthew Battista,et al.  Application of the Empirical Mode Decomposition and Hilbert-Huang Transform to Seismic Reflection Data , 2007 .

[15]  Yi-Fan Wang,et al.  Predicting stock price using fuzzy grey prediction system , 2002, Expert Syst. Appl..

[16]  Indranil Mukherjee,et al.  Empirical mode decomposition analysis of two different financial time series and their comparison , 2008 .

[17]  Norden E. Huang,et al.  Ensemble Empirical Mode Decomposition: a Noise-Assisted Data Analysis Method , 2009, Adv. Data Sci. Adapt. Anal..

[18]  Gabriel Rilling,et al.  Empirical mode decomposition as a filter bank , 2004, IEEE Signal Processing Letters.

[19]  Hong Wang,et al.  Predicting stock index increments by neural networks: The role of trading volume under different horizons , 2008, Expert Syst. Appl..

[20]  R. McRoberts,et al.  Remote sensing support for national forest inventories , 2007 .

[21]  An-Sing Chen,et al.  Application of Neural Networks to an Emerging Financial Market: Forecasting and Trading the Taiwan Stock Index , 2001, Comput. Oper. Res..

[22]  S. Magnussen,et al.  A model-assisted k-nearest neighbour approach to remove extrapolation bias , 2010 .

[23]  Pengjian Shang,et al.  APPLICATION OF EMPIRICAL MODE DECOMPOSITION COMBINED WITH k-NEAREST NEIGHBORS APPROACH IN FINANCIAL TIME SERIES FORECASTING , 2012 .

[24]  Ronald E. McRoberts,et al.  Diagnostic tools for nearest neighbors techniques when used with satellite imagery , 2009 .

[25]  Filippo Castiglione Forecasting Price increments using an Artificial Neural Network , 2001, Adv. Complex Syst..

[26]  Jiann-Shing Shieh,et al.  Human heart beat analysis using a modified algorithm of detrended fluctuation analysis based on empirical mode decomposition. , 2009, Medical engineering & physics.

[27]  S. J. Loutridis,et al.  Damage detection in gear systems using empirical mode decomposition , 2004 .

[28]  Chin-Hsiung Loh,et al.  Application of the Empirical Mode Decomposition-Hilbert Spectrum Method to Identify Near-Fault Ground-Motion Characteristics and Structural Responses , 2004 .

[29]  Francis Eng Hock Tay,et al.  Financial Forecasting Using Support Vector Machines , 2001, Neural Computing & Applications.

[30]  K. Lai,et al.  A new approach for crude oil price analysis based on Empirical Mode Decomposition , 2008 .

[31]  Gary A. Davis,et al.  Nonparametric Regression and Short‐Term Freeway Traffic Forecasting , 1991 .

[32]  G. Grudnitski,et al.  Forecasting S&P and gold futures prices: An application of neural networks , 1993 .

[33]  M. S. Woolfson,et al.  Application of empirical mode decomposition to heart rate variability analysis , 2001, Medical and Biological Engineering and Computing.

[34]  Guido Deboeck,et al.  Trading on the Edge: Neural, Genetic, and Fuzzy Systems for Chaotic Financial Markets , 1994 .

[35]  Julián Andrada-Félix,et al.  Exchange-rate forecasts with simultaneous nearest-neighbour methods: evidence from the EMS , 1999 .

[36]  Zhong-Ke Gao,et al.  Scaling analysis of phase fluctuations in experimental three-phase flows , 2011 .

[37]  P. N. Suganthan,et al.  Empirical Mode Decomposition-k Nearest Neighbor Models for Wind Speed Forecasting , 2014 .

[38]  R. Donaldson,et al.  An artificial neural network-GARCH model for international stock return volatility , 1997 .

[39]  Joachim Saborowski,et al.  Spatial prediction of forest stand variables , 2009, European Journal of Forest Research.

[40]  N. Huang,et al.  The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis , 1998, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[41]  E P Souza Neto,et al.  Assessment of Cardiovascular Autonomic Control by the Empirical Mode Decomposition , 2004, Methods of Information in Medicine.

[42]  G. Zhang,et al.  A comparative study of linear and nonlinear models for aggregate retail sales forecasting , 2003 .

[43]  Ronald E. McRoberts,et al.  Estimating forest attribute parameters for small areas using nearest neighbors techniques , 2012 .

[44]  Ram Bilas Pachori,et al.  Analysis of normal and epileptic seizure EEG signals using empirical mode decomposition , 2011, Comput. Methods Programs Biomed..

[45]  K. Coughlin,et al.  11-Year solar cycle in the stratosphere extracted by the empirical mode decomposition method , 2004 .

[46]  R. Mehrotra,et al.  Conditional resampling of hydrologic time series using multiple predictor variables: A K-nearest neighbour approach , 2006 .

[47]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[48]  A. Hudak,et al.  Nearest neighbor imputation of species-level, plot-scale forest structure attributes from LiDAR data , 2008 .