Comparing the effectiveness of deep feedforward neural networks and shallow architectures for predicting stock price indices

Many existing learning algorithms suffer from limited architectural depth and the locality of estimators, making it difficult to generalize from the test set and providing inefficient and biased estimators. Deep architectures have been shown to appropriately learn correlation structures in time series data. This paper compares the effectiveness of a deep feedforward Neural Network (DNN) and shallow architectures (e.g., Support Vector Machine (SVM) and one-layer NN) when predicting a broad cross-section of stock price indices in both developed and emerging markets. An extensive evaluation is undertaken, using daily, hourly, minute and tick level data related to thirty-four financial indices from 32 countries across six years. Our evaluation results show a considerable advantage from training deep (cf. shallow) architectures, using a rectifier linear (RELU) activation function, across all thirty-four markets when ‘minute’ data is used. However, the predictive performance of DNN was not significantly better than that of shallower architectures when using tick level data. This result suggests that when training a DNN algorithm, the predictive accuracy peaks, regardless of training size. We also examine which activation function works best for stock price index data. Our results demonstrate that the RELU activation function performs better than TANH across all markets and time horizons when using DNN to predict stock price indices.

[1]  F. N. David,et al.  Principles and procedures of statistics. , 1961 .

[2]  Jason Weston,et al.  Large-scale kernel machines , 2007 .

[3]  Stavros Degiannakis,et al.  Volatility forecasting: intra-day versus inter-day models , 2008 .

[4]  Qi Xu,et al.  Fuzzy support vector machine for classification of EEG signals using wavelet-based features. , 2009, Medical engineering & physics.

[5]  T. Poggio,et al.  Deep vs. shallow networks : An approximation theory perspective , 2016, ArXiv.

[6]  Synho Do,et al.  How much data is needed to train a medical image deep learning system to achieve necessary high accuracy , 2015, 1511.06348.

[7]  Alexandra Gabriela Ţiţan The Efficient Market Hypothesis: Review of Specialized Literature and Empirical Research☆ , 2015 .

[8]  Camille Couprie,et al.  Learning Hierarchical Features for Scene Labeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Hsinchun Chen,et al.  Textual analysis of stock market prediction using breaking financial news: The AZFin text system , 2009, TOIS.

[10]  Campbell R. Harvey,et al.  PREDICTABLE RISK AND RETURNS IN EMERGING MARKETS , 1999 .

[11]  A. Lo Long-Term Memory in Stock Market Prices , 1989 .

[12]  Yoshua Bengio,et al.  Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[13]  Yudong Zhang,et al.  Stock market prediction of S&P 500 via combination of improved BCO approach and BP neural network , 2009, Expert Syst. Appl..

[14]  Chulwoo Han,et al.  Deep learning networks for stock market analysis and prediction: Methodology, data representations, and case studies , 2017, Expert Syst. Appl..

[15]  Luis E. Zárate,et al.  Applying Artificial Neural Networks to prediction of stock price and improvement of the directional prediction index - Case study of PETR4, Petrobras, Brazil , 2013, Expert Syst. Appl..

[16]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[17]  Robert P. Sheridan,et al.  Deep Neural Nets as a Method for Quantitative Structure-Activity Relationships , 2015, J. Chem. Inf. Model..

[18]  C. Granger,et al.  Efficient Market Hypothesis and Forecasting , 2002 .

[19]  Nello Cristianini,et al.  Support Vector Machines and Kernel Methods: The New Generation of Learning Machines , 2002, AI Mag..

[20]  J. J. Yang,et al.  International diversification with frontier markets , 2011 .

[21]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[22]  Rongda Chen,et al.  A SVM Stock Selection Model within PCA , 2014, ITQM.

[23]  Xiang Zhang,et al.  OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[24]  Yoshua Bengio,et al.  Deep Learning of Representations: Looking Forward , 2013, SLSP.

[25]  Nicolas Huck,et al.  Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500 , 2017, Eur. J. Oper. Res..

[26]  Khaled Rasheed,et al.  Stock market prediction with multiple classifiers , 2007, Applied Intelligence.

[27]  Jason Weston,et al.  Question Answering with Subgraph Embeddings , 2014, EMNLP.

[28]  E. Fama EFFICIENT CAPITAL MARKETS: A REVIEW OF THEORY AND EMPIRICAL WORK* , 1970 .

[29]  Shouyang Wang,et al.  Forecasting stock market movement direction with support vector machine , 2005, Comput. Oper. Res..

[30]  Brendan J. Frey,et al.  Deep learning of the tissue-regulated splicing code , 2014, Bioinform..

[31]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[32]  Evelita E. Celis,et al.  Political Cycle and Stock Market - The Case of Malaysia , 2015 .

[33]  David Haussler,et al.  Exploiting Generative Models in Discriminative Classifiers , 1998, NIPS.

[34]  Yevgeniy V. Bodyanskiy,et al.  Neural network approach to forecasting of quasiperiodic financial time series , 2006, Eur. J. Oper. Res..

[35]  Carl E. Rasmussen,et al.  In Advances in Neural Information Processing Systems , 2011 .

[36]  E. Fama,et al.  Efficient Capital Markets : II , 2007 .

[37]  Yann LeCun,et al.  What is the best multi-stage architecture for object recognition? , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[38]  Geoffrey E. Hinton,et al.  A Learning Algorithm for Boltzmann Machines , 1985, Cogn. Sci..

[39]  Geoffrey Zweig,et al.  Recent advances in deep learning for speech research at Microsoft , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[40]  Stefan Lessmann,et al.  Towards a methodology for measuring the true degree of efficiency in a speculative market , 2011, J. Oper. Res. Soc..

[41]  Rich Caruana,et al.  Do Deep Nets Really Need to be Deep? , 2013, NIPS.

[42]  Michel Ballings,et al.  Evaluating multiple classifiers for stock price direction prediction , 2015, Expert Syst. Appl..

[43]  P. Silvapulle,et al.  LONG-TERM MEMORY IN STOCK MARKET RETURNS: INTERNATIONAL EVIDENCE , 2001 .

[44]  John Yearwood,et al.  Predicting Australian Stock Market Index Using Neural Networks Exploiting Dynamical Swings and Intermarket Influences , 2003, J. Res. Pract. Inf. Technol..

[45]  Ömer Kaan Baykan,et al.  Predicting direction of stock price index movement using artificial neural networks and support vector machines: The sample of the Istanbul Stock Exchange , 2011, Expert Syst. Appl..

[46]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Chenn-Jung Huang,et al.  Application of wrapper approach and composite classifier to the stock trend prediction , 2008, Expert Syst. Appl..

[48]  Pei-Chann Chang,et al.  A neural network with a case based dynamic window for stock trading prediction , 2009, Expert Syst. Appl..

[49]  John M. Griffin,et al.  Do Market Efficiency Measures Yield Correct Inferences? A Comparison of Developed and Emerging Markets , 2010 .

[50]  Yann LeCun,et al.  Large-scale Learning with SVM and Convolutional for Generic Object Categorization , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[51]  Kamil Zbikowski,et al.  Using Volume Weighted Support Vector Machines with walk forward testing and feature selection for the purpose of creating stock trading strategy , 2015, Expert Syst. Appl..

[52]  Michael J. Schill,et al.  The Illusory Nature of Momentum Profits , 2004 .

[53]  Ken Perlin,et al.  Real-Time Continuous Pose Recovery of Human Hands Using Convolutional Networks , 2014, ACM Trans. Graph..

[54]  John A. Cole,et al.  Random Walks and Market Efficiency Tests: Evidence from Emerging Equity Markets , 1999 .

[55]  Diego Klabjan,et al.  Classification-Based Financial Markets Prediction Using Deep Neural Networks , 2016, Algorithmic Finance.

[56]  Jason Weston,et al.  A user's guide to support vector machines. , 2010, Methods in molecular biology.

[57]  Sahil Shah,et al.  Predicting stock and stock price index movement using Trend Deterministic Data Preparation and machine learning techniques , 2015, Expert Syst. Appl..

[58]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[59]  Gbenga Ibikunle,et al.  How random are intraday stock prices? Evidence from deep learning , 2017 .

[60]  Campbell R. Harvey,et al.  Emerging Equity Market Volatility , 1995 .

[61]  Thomas Fischer,et al.  Deep learning with long short-term memory networks for financial market predictions , 2017, Eur. J. Oper. Res..

[62]  Geoffrey E. Hinton,et al.  Restricted Boltzmann machines for collaborative filtering , 2007, ICML '07.

[63]  Allan Timmermann,et al.  Is the Distribution of Stock Returns Predictable? , 2008 .

[64]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[65]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[66]  Marc-André Mittermayer,et al.  Forecasting Intraday stock price trends with text mining techniques , 2004, 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the.

[67]  Yan Wu,et al.  Convolutional deep belief networks for feature extraction of EEG signal , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[68]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[69]  Yoshua Bengio,et al.  Scaling learning algorithms towards AI , 2007 .

[70]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[71]  P. Albuquerque,et al.  PORTFOLIO SELECTION WITH SUPPORT VECTOR MACHINES IN LOW ECONOMIC PERSPECTIVES IN EMERGING MARKETS , 2015 .

[72]  Colm Kearney,et al.  Emerging Markets Research: Trends, Issues and Future Directions , 2012 .

[73]  R. Brooks,et al.  Thai financial markets and political change , 2014 .

[74]  Gerald Penn,et al.  Applying Convolutional Neural Networks concepts to hybrid NN-HMM model for speech recognition , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[75]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2003, ICTAI.

[76]  Kunihiko Fukushima,et al.  Neocognitron for handwritten digit recognition , 2003, Neurocomputing.

[77]  Stefan Lessmann,et al.  Bridging the divide in financial market forecasting: machine learners vs. financial economists , 2016, Expert Syst. Appl..

[78]  Dong Yu,et al.  Feature engineering in Context-Dependent Deep Neural Networks for conversational speech transcription , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.

[79]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[80]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[81]  Vilma Deltuvaitė Transmission of Shocks through Stock Markets Channel: The Case of the CEECs , 2016 .

[82]  Ammar Belatreche,et al.  Evaluating machine learning classification for financial trading: An empirical approach , 2016, Expert Syst. Appl..

[83]  Michael Y. Hu,et al.  Forecasting with artificial neural networks: The state of the art , 1997 .

[84]  Marc'Aurelio Ranzato,et al.  Building high-level features using large scale unsupervised learning , 2011, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.