Literature review: Machine learning techniques applied to financial market prediction

Abstract The search for models to predict the prices of financial markets is still a highly researched topic, despite major related challenges. The prices of financial assets are non-linear, dynamic, and chaotic; thus, they are financial time series that are difficult to predict. Among the latest techniques, machine learning models are some of the most researched, given their capabilities for recognizing complex patterns in various applications. With the high productivity in the machine learning area applied to the prediction of financial market prices, objective methods are required for a consistent analysis of the most relevant bibliography on the subject. This article proposes the use of bibliographic survey techniques that highlight the most important texts for an area of research. Specifically, these techniques are applied to the literature about machine learning for predicting financial market values, resulting in a bibliographical review of the most important studies about this topic. Fifty-seven texts were reviewed, and a classification was proposed for markets, assets, methods, and variables. Among the main results, of particular note is the greater number of studies that use data from the North American market. The most commonly used models for prediction involve support vector machines (SVMs) and neural networks. It was concluded that the research theme is still relevant and that the use of data from developing markets is a research opportunity.

[1]  Ping-Feng Pai,et al.  A hybrid ARIMA and support vector machines model in stock price forecasting , 2005 .

[2]  Marija Gorenc Novak,et al.  Prediction of stock price movement based on daily high prices , 2016 .

[3]  Ammar Belatreche,et al.  Evaluating machine learning classification for financial trading: An empirical approach , 2016, Expert Syst. Appl..

[4]  Cheng-Lung Huang,et al.  A hybrid SOFM-SVR with a filter-based feature selection for stock market forecasting , 2009, Expert Syst. Appl..

[5]  Jun Wang,et al.  Return scaling cross-correlation forecasting by stochastic time strength neural network in financial market dynamics , 2017, Soft Computing.

[6]  Muris Lage Junior,et al.  Variations of the kanban system: Literature review and classification , 2010 .

[7]  Song Wu,et al.  A Double-Layer Neural Network Framework for High-Frequency Forecasting , 2017, ACM Trans. Manag. Inf. Syst..

[8]  Michele Marchesi,et al.  A hybrid genetic-neural architecture for stock indexes forecasting , 2005, Inf. Sci..

[9]  Na Zhang,et al.  Bayesian regularisation neural network based on artificial intelligence optimisation , 2017, Int. J. Prod. Res..

[10]  Fred Collopy,et al.  How effective are neural networks at forecasting and prediction? A review and evaluation , 1998 .

[11]  Ömer Kaan Baykan,et al.  Predicting direction of stock price index movement using artificial neural networks and support vector machines: The sample of the Istanbul Stock Exchange , 2011, Expert Syst. Appl..

[12]  David Enke,et al.  The use of data mining and neural networks for forecasting stock market returns , 2005, Expert Syst. Appl..

[13]  Sahil Shah,et al.  Predicting stock and stock price index movement using Trend Deterministic Data Preparation and machine learning techniques , 2015, Expert Syst. Appl..

[14]  C. Granger,et al.  Efficient Market Hypothesis and Forecasting , 2002 .

[15]  David Enke,et al.  Neural Networks as a Decision Maker for Stock Trading: A Technical Analysis Approach , 2003 .

[16]  Mehmet Özçalici,et al.  Integrating metaheuristics and Artificial Neural Networks for improved stock price prediction , 2016, Expert Syst. Appl..

[17]  Pei-Chann Chang,et al.  A neural network with a case based dynamic window for stock trading prediction , 2009, Expert Syst. Appl..

[18]  Nicolas Huck,et al.  Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500 , 2017, Eur. J. Oper. Res..

[19]  Stephen L. Chiu,et al.  Fuzzy Model Identification Based on Cluster Estimation , 1994, J. Intell. Fuzzy Syst..

[20]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[21]  S. Sosvilla‐Rivero,et al.  On the profitability of technical trading rules based on artificial neural networks:: Evidence from the Madrid stock market , 2000 .

[22]  Ray Tsaih,et al.  Forecasting S&P 500 stock index futures with a hybrid AI system , 1998, Decis. Support Syst..

[23]  Deepak Kumar,et al.  Proximal support vector machine based hybrid prediction models for trend forecasting in financial markets , 2016, J. Comput. Sci..

[24]  E. Fama,et al.  Efficient Capital Markets : II , 2007 .

[25]  John S. Liu,et al.  Data envelopment analysis 1978-2010: A citation-based literature survey , 2013 .

[26]  Sheng-Tun Li,et al.  Knowledge discovery in financial investment for forecasting and trading strategy through wavelet-based SOM networks , 2008, Expert Syst. Appl..

[27]  Michel Ballings,et al.  Evaluating multiple classifiers for stock price direction prediction , 2015, Expert Syst. Appl..

[28]  Yi-Fan Wang,et al.  Predicting stock price using fuzzy grey prediction system , 2002, Expert Syst. Appl..

[29]  Russell L. Purvis,et al.  Forecasting the NYSE composite index with technical analysis, pattern recognizer, neural network, and genetic algorithm: a case study in romantic decision support , 2002, Decis. Support Syst..

[30]  Pengjian Shang,et al.  Multidimensional k-nearest neighbor model based on EEMD for financial time series forecasting , 2017 .

[31]  Stefan Lessmann,et al.  Bridging the divide in financial market forecasting: machine learners vs. financial economists , 2016, Expert Syst. Appl..

[32]  T. Bollerslev,et al.  Generalized autoregressive conditional heteroskedasticity , 1986 .

[33]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[34]  Jianzhou Wang,et al.  Stock index forecasting based on a hybrid model , 2012 .

[35]  Henry G. Small,et al.  Co-citation in the scientific literature: A new measure of the relationship between two documents , 1973, J. Am. Soc. Inf. Sci..

[36]  J. Campbell Stock Returns and the Term Structure , 1985 .

[37]  F. Tay,et al.  Application of support vector machines in financial time series forecasting , 2001 .

[38]  An-Sing Chen,et al.  Application of Neural Networks to an Emerging Financial Market: Forecasting and Trading the Taiwan Stock Index , 2001, Comput. Oper. Res..

[39]  Shouyang Wang,et al.  Forecasting stock market movement direction with support vector machine , 2005, Comput. Oper. Res..

[40]  Amiya Kumar Rath,et al.  A Naïve SVM-KNN based stock market trend reversal analysis for Indian benchmark indices , 2015, Appl. Soft Comput..

[41]  Herbert Kimura,et al.  Building direct citation networks , 2018, Scientometrics.

[42]  S. Lahmiri Entropy-Based Technical Analysis Indicators Selection for International Stock Markets Fluctuations Prediction Using Support Vector Machines , 2014 .

[43]  Adriano Lorena Inácio de Oliveira,et al.  Expert Systems With Applications , 2022 .

[44]  Tai-liang Chen,et al.  An intelligent pattern recognition model for supporting investment decisions in stock market , 2016, Inf. Sci..

[45]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[46]  Ricardo A. S. Fernandes,et al.  Maximum and minimum stock price forecasting of Brazilian power distribution companies based on artificial neural networks , 2015, Appl. Soft Comput..

[47]  Pedro Correia S. Bezerra,et al.  Volatility forecasting via SVR–GARCH with mixture of Gaussian kernels , 2017, Comput. Manag. Sci..

[48]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[49]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1971 .

[50]  Pei-Chann Chang,et al.  A Hybrid System Integrating a Wavelet and TSK Fuzzy Rules for Stock Price Forecasting , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[52]  Mark J. Kamstra,et al.  Neural network forecast combining with interaction effects , 1999 .

[53]  Vinicius Amorim Sobreiro,et al.  Human development and data envelopment analysis: A structured literature review , 2015 .

[54]  M. Boukadoum,et al.  An Ensemble System Based on Hybrid EGARCH-ANN with Different Distributional Assumptions to Predict S&P 500 Intraday Volatility , 2015 .

[55]  Shouyang Wang,et al.  Ensemble ANNs-PSO-GA Approach for Day-ahead Stock E-exchange Prices Forecasting , 2014, Int. J. Comput. Intell. Syst..

[56]  Ingoo Han,et al.  Genetic algorithms approach to feature discretization in artificial neural networks for the prediction of stock price index , 2000 .

[57]  Michael Y. Hu,et al.  Forecasting with artificial neural networks: The state of the art , 1997 .

[58]  Sergio Ortobelli Lozza,et al.  Fusion of multiple diverse predictors in stock market , 2017, Inf. Fusion.

[59]  R. Engle Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation , 1982 .

[60]  Hsinchun Chen,et al.  Textual analysis of stock market prediction using breaking financial news: The AZFin text system , 2009, TOIS.

[61]  B. Malkiel The Efficient Market Hypothesis and Its Critics , 2003 .

[62]  Fadel M. Megahed,et al.  Stock market one-day ahead movement prediction using disparate data sources , 2017, Expert Syst. Appl..

[63]  M. Zweig,et al.  Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. , 1993, Clinical chemistry.

[64]  Salim Lahmiri,et al.  IMPROVING FORECASTING ACCURACY OF THE S&P500 INTRA-DAY PRICE DIRECTION USING BOTH WAVELET LOW AND HIGH FREQUENCY COEFFICIENTS , 2014 .

[65]  Kimon P. Valavanis,et al.  Surveying stock market forecasting techniques - Part II: Soft computing methods , 2009, Expert Syst. Appl..

[66]  Kurt Hornik,et al.  Approximation capabilities of multilayer feedforward networks , 1991, Neural Networks.

[67]  Amir F. Atiya,et al.  Introduction to financial forecasting , 1996, Applied Intelligence.

[68]  David Enke,et al.  The adaptive selection of financial and economic variables for use with artificial neural networks , 2004, Neurocomputing.

[69]  Wen-Chyuan Chiang,et al.  An adaptive stock index trading decision support system , 2016, Expert Syst. Appl..

[70]  Hájek Petr,et al.  Forecasting stock prices using sentiment information in annual reports - A neural network and support vector regression approach , 2013 .

[71]  M. M. Kessler Bibliographic coupling between scientific papers , 1963 .

[72]  Youngohc Yoon,et al.  A Comparison of Discriminant Analysis versus Artificial Neural Networks , 1993 .

[73]  Xiao Zhong,et al.  Forecasting daily stock market return using dimensionality reduction , 2017, Expert Syst. Appl..

[74]  John S. Liu,et al.  An integrated approach for main path analysis: Development of the Hirsch index as an example , 2012, J. Assoc. Inf. Sci. Technol..

[75]  Kazuo Asakawa,et al.  Stock market prediction system with modular neural networks , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[76]  You-Shyang Chen,et al.  Modeling fitting-function-based fuzzy time series patterns for evolving stock index forecasting , 2014, Applied Intelligence.

[77]  L. Egghe,et al.  Theory and practise of the g-index , 2006, Scientometrics.

[78]  Yi-Fan Wang,et al.  Mining stock price using fuzzy rough set system , 2003, Expert Syst. Appl..

[79]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[80]  Kyoung-jae Kim,et al.  Financial time series forecasting using support vector machines , 2003, Neurocomputing.

[81]  Lijuan Cao,et al.  Support vector machines experts for time series forecasting , 2003, Neurocomputing.

[82]  E. Fama EFFICIENT CAPITAL MARKETS: A REVIEW OF THEORY AND EMPIRICAL WORK* , 1970 .

[83]  Norman P. Hummon,et al.  Connectivity in a citation network: The development of DNA theory☆ , 1989 .

[84]  Kai Keng Ang,et al.  Stock Trading Using RSPOP: A Novel Rough Set-Based Neuro-Fuzzy Approach , 2006, IEEE Transactions on Neural Networks.

[85]  Ricardo Colomo Palacios,et al.  CAST: Using neural networks to improve trading systems based on technical analysis by means of the RSI financial indicator , 2011, Expert Syst. Appl..

[86]  Chih-Fong Tsai,et al.  Combining multiple feature selection methods for stock prediction: Union, intersection, and multi-intersection approaches , 2010, Decis. Support Syst..

[87]  Manish Kumar,et al.  Forecasting stock index returns using ARIMA-SVM, ARIMA-ANN, and ARIMA-random forest hybrid models , 2014 .

[88]  Paulo Cortez,et al.  The impact of microblogging data for stock market prediction: Using Twitter to predict returns, volatility, trading volume and survey sentiment indices , 2017 .

[89]  Rajashree Dash,et al.  Performance analysis of an evolutionary recurrent Legendre Polynomial Neural Network in application to FOREX prediction , 2017, J. King Saud Univ. Comput. Inf. Sci..

[90]  Huanhuan Chen,et al.  Evolving Least Squares Support Vector Machines for Stock Market Trend Mining , 2009, IEEE Trans. Evol. Comput..

[91]  Nicole J. Saam,et al.  Lotka's law reconsidered: The evolution of publication and citation distributions in scientific fields , 1999, Scientometrics.

[92]  Zhi Xiao,et al.  A multiple support vector machine approach to stock index forecasting with mixed frequency sampling , 2017, Knowl. Based Syst..

[93]  Allan Tucker,et al.  Quantifying StockTwits semantic terms' trading behavior in financial markets: An effective application of decision tree algorithms , 2015, Expert Syst. Appl..

[94]  Luis F. Ortega,et al.  A neuro-wavelet model for the short-term forecasting of high-frequency time series of stock returns , 2013 .

[95]  Dong-jin Noh,et al.  Forecasting trends of high-frequency KOSPI200 index data using learning classifiers , 2012, Expert Syst. Appl..

[96]  Marc J. Schniederjans,et al.  A comparison between Fama and French's model and artificial neural networks in predicting the Chinese stock market , 2005, Comput. Oper. Res..

[97]  Stefan Seuring,et al.  A review of modeling approaches for sustainable supply chain management , 2013, Decis. Support Syst..

[98]  Jun Wang,et al.  Predicting agent-based financial time series model on lattice fractal with random Legendre neural network , 2017, Soft Comput..

[99]  Baikunth Nath,et al.  A fusion model of HMM, ANN and GA for stock market forecasting , 2007, Expert Syst. Appl..