Mid-price prediction based on machine learning methods with technical and quantitative indicators

Stock price prediction is a challenging task, in which machine learning methods have recently been successfully used. In this paper, we extract over 270 hand-crafted features (factors) inspired by technical indicators and quantitative analysis and test their validity on short-term mid-price movement prediction for Nordic TotalView-ITCH stocks. The suggested feature list represents one of the most extensive studies in the field of financial feature engineering. We focus on a wrapper feature selection method using entropy, least-mean squares, and linear discriminant analysis. We also introduce a novel quantitative feature based on adaptive logistic regression for online learning. The proposed feature is consistently selected as the first feature among a large number of indicators used in this study. We further examine the best combinations of features using a high-frequency limit order book Nordic database. Our results suggest that sorting methods and classifiers can be used in such a way that one can reach the best classification performance with a combination of only a few advanced hand-crafted features.

[1]  Alexandros Iosifidis,et al.  Using deep learning to detect price change indications in financial markets , 2017, 2017 25th European Signal Processing Conference (EUSIPCO).

[2]  Beatriz de la Iglesia,et al.  Survey on Feature Selection , 2015, ArXiv.

[3]  W. Ng,et al.  High Frequency Trading using Fuzzy Momentum Analysis , 2010 .

[4]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[5]  R. Schafer,et al.  What Is a Savitzky-Golay Filter? , 2022 .

[6]  Steven W. Smith,et al.  The Scientist and Engineer's Guide to Digital Signal Processing , 1997 .

[7]  R. Batchelor,et al.  Judgemental bootstrapping of technical traders in the bond market , 2007 .

[8]  Alexandros Iosifidis,et al.  Time-series classification using neural Bag-of-Features , 2017, 2017 25th European Signal Processing Conference (EUSIPCO).

[9]  Masahiro Inuiguchi,et al.  Portfolio selection under independent possibilistic information , 2000, Fuzzy Sets Syst..

[10]  Alexander Elder,et al.  Come Into My Trading Room: A Complete Guide to Trading , 2002 .

[11]  Roberto Battiti,et al.  Using mutual information for selecting features in supervised neural net learning , 1994, IEEE Trans. Neural Networks.

[12]  J. Richman,et al.  Physiological time-series analysis using approximate entropy and sample entropy. , 2000, American journal of physiology. Heart and circulatory physiology.

[13]  Van K. Tharp Trade Your Way to Financial Freedom , 1998 .

[14]  P. Manimaran,et al.  Modelling Financial Time Series , 2006 .

[15]  J. Lintner THE VALUATION OF RISK ASSETS AND THE SELECTION OF RISKY INVESTMENTS IN STOCK PORTFOLIOS AND CAPITAL BUDGETS , 1965 .

[16]  Michael A. H. Dempster,et al.  Computational learning techniques for intraday FX trading using popular technical indicators , 2001, IEEE Trans. Neural Networks.

[17]  Adriano Lorena Inácio de Oliveira,et al.  A method for automatic stock trading combining technical analysis and nearest neighbor classification , 2010, Expert Syst. Appl..

[18]  Alexandros Iosifidis,et al.  Multidimensional Sequence Classification Based on Fuzzy Distances and Discriminant Analysis , 2013, IEEE Transactions on Knowledge and Data Engineering.

[19]  Alexandros Iosifidis,et al.  Forecasting Stock Prices from the Limit Order Book Using Convolutional Neural Networks , 2017, 2017 IEEE 19th Conference on Business Informatics (CBI).

[20]  Yuan Zhang,et al.  Modelling high-frequency limit order book dynamics with support vector machines , 2015 .

[21]  Yue Fang,et al.  The predictability of asset returns: an approach combining technical analysis and time series forecasts , 2003 .

[22]  Lei Liu,et al.  Feature selection with dynamic mutual information , 2009, Pattern Recognit..

[23]  E. Fama,et al.  RISK, RETURN AND EQUILIBRIUM: SOME CLARIFYING COMMENTS , 1968 .

[24]  J. Poterba,et al.  Mean Reversion in Stock Prices: Evidence and Implications , 1987 .

[25]  S. Ross THE CAPITAL ASSET PRICING MODEL (CAPM), SHORT‐SALE RESTRICTIONS AND RELATED ISSUES , 1977 .

[26]  Alexandros Iosifidis,et al.  Benchmark Dataset for Mid-Price Prediction of Limit Order Book data , 2017, ArXiv.

[27]  Alexandros Iosifidis,et al.  Approximate kernel extreme learning machine for large scale data classification , 2017, Neurocomputing.

[28]  Peter Bank,et al.  Hedging and Portfolio Optimization in Financial Markets with a Large Trader , 2004 .

[29]  T. Bollerslev,et al.  A CONDITIONALLY HETEROSKEDASTIC TIME SERIES MODEL FOR SPECULATIVE PRICES AND RATES OF RETURN , 1987 .

[30]  J. Murphy Technical Analysis of the Futures Markets: A Comprehensive Guide to Trading Methods and Applications , 1986 .

[31]  Adriano Lorena Inácio de Oliveira,et al.  Automatic method for stock trading combining technical analysis and the Artificial Bee Colony Algorithm , 2013, 2013 IEEE Congress on Evolutionary Computation.

[32]  W. Sharpe CAPITAL ASSET PRICES: A THEORY OF MARKET EQUILIBRIUM UNDER CONDITIONS OF RISK* , 1964 .

[33]  Alexandros Iosifidis,et al.  Feature Engineering for Mid-Price Prediction With Deep Learning , 2019, IEEE Access.

[34]  J. Murphy Technical Analysis of the Financial Markets , 1999 .

[35]  Guilherme Palermo Coelho,et al.  Evaluating the impact of technical indicators on stock forecasting , 2016, 2016 IEEE Symposium Series on Computational Intelligence (SSCI).

[36]  Qinghua Wen,et al.  Automatic stock decision support system based on box theory and SVM algorithm , 2010, Expert Syst. Appl..

[37]  André F. Perold,et al.  Large-Scale Portfolio Optimization , 1984 .

[38]  Ricardo Colomo Palacios,et al.  CAST: Using neural networks to improve trading systems based on technical analysis by means of the RSI financial indicator , 2011, Expert Syst. Appl..

[39]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[40]  Matthew Dixon,et al.  Sequence Classification of the Limit Order Book Using Recurrent Neural Networks , 2017, J. Comput. Sci..

[41]  Byung Ro Moon,et al.  A Hybrid Neurogenetic Approach for Stock Forecasting , 2007, IEEE Transactions on Neural Networks.

[42]  Chien-Hsing Chen Feature Selectionfor Unlabeled Data , 2011, ICSI.

[43]  Alexandros Iosifidis,et al.  Tensor representation in high-frequency financial data for price change prediction , 2017, 2017 IEEE Symposium Series on Computational Intelligence (SSCI).

[44]  Craig W. French The Treynor Capital Asset Pricing Model , 2003 .

[45]  Bill Williams,et al.  Trading Chaos: Maximize Profits with Proven Technical Techniques , 2004 .

[46]  J. Wilder New Concepts in Technical Trading Systems , 1978 .

[47]  Ulf Johansson,et al.  Co-evolving online high-frequency trading strategies using grammatical evolution , 2014, 2014 IEEE Conference on Computational Intelligence for Financial Engineering & Economics (CIFEr).

[48]  Dick J. C. van Dijk,et al.  High-Frequency Technical Trading: The Importance of Speed , 2012 .

[49]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[50]  D. Broomhead,et al.  Radial Basis Functions, Multi-Variable Functional Interpolation and Adaptive Networks , 1988 .

[51]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[52]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1971 .

[53]  Tongda Zhang,et al.  Stock Market Forecasting Using Machine Learning Algorithms , 2012 .

[54]  J. Mossin EQUILIBRIUM IN A CAPITAL ASSET MARKET , 1966 .

[55]  Abdalla Kablan,et al.  Adaptive Neuro-Fuzzy Inference System for Financial Trading using Intraday Seasonality Observation Model , 2009 .

[56]  Snehanshu Saha,et al.  Predicting the direction of stock market prices using random forest , 2016, ArXiv.

[57]  Sahil Shah,et al.  Predicting stock and stock price index movement using Trend Deterministic Data Preparation and machine learning techniques , 2015, Expert Syst. Appl..

[58]  Alexandros Iosifidis,et al.  On the kernel Extreme Learning Machine classifier , 2015, Pattern Recognit. Lett..

[59]  F. Black,et al.  The Capital Asset Pricing Model: Some Empirical Tests , 2006 .

[60]  Jonathan D. Cryer,et al.  Time Series Analysis , 1986 .

[61]  Jack W. Paul,et al.  Managing financial risk , 2012 .

[62]  Lukas Menkhoff,et al.  Equity Premium Prediction: Are Economic and Technical Indicators Unstable? , 2016 .

[63]  Ronald W. Schafer,et al.  What Is a Savitzky-Golay Filter? [Lecture Notes] , 2011, IEEE Signal Processing Magazine.

[64]  Fengxi Song,et al.  Feature Selection Based on Linear Discriminant Analysis , 2010, 2010 International Conference on Intelligent System Design and Engineering Application.

[65]  Justin A. Sirignano Deep learning for limit order books , 2016, Quantitative Finance.

[66]  Stacy Williams,et al.  Limit order books , 2010, 1012.0349.

[67]  Alexandros Iosifidis,et al.  Feature Engineering for Mid-Price Prediction Forecasting with Deep Learning , 2019 .

[68]  L. Spierdijk,et al.  Mean reversion in stock prices , 2017 .

[69]  John F. Ehlers Rocket Science for Traders: Digital Signal Processing Applications , 2001 .

[70]  J. Bollinger Bollinger on Bollinger Bands , 2001 .

[71]  C. Granger,et al.  Co-integration and error correction: representation, estimation and testing , 1987 .

[72]  Andrew W. Lo,et al.  Foundations of Technical Analysis: Computational Algorithms, Statistical Inference, and Empirical Implementation , 2000 .

[73]  Indranarain Ramlall Relative Strength Index , 2016 .

[74]  Maciej Lawrynczuk,et al.  An investment strategy for the Stock Exchange using neural networks , 2013, 2013 Federated Conference on Computer Science and Information Systems.

[75]  D. Hendry,et al.  Co-Integration and Error Correction : Representation , Estimation , and Testing , 2007 .

[76]  James T. Kwok,et al.  Prototype vector machine for large scale semi-supervised learning , 2009, ICML '09.

[77]  A. Savitzky,et al.  Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .

[78]  P. Dash,et al.  A hybrid stock trading framework integrating technical analysis with machine learning techniques , 2016 .

[79]  Luis E. Zárate,et al.  Applying Artificial Neural Networks to prediction of stock price and improvement of the directional prediction index - Case study of PETR4, Petrobras, Brazil , 2013, Expert Syst. Appl..

[80]  Ramdas Kumaresan,et al.  Identification of rational transfer function from frequency response sample , 1990 .

[81]  Anthony S. Tay,et al.  Multivariate Density Forecast Evaluation and Calibration In Financial Risk Management: High-Frequency Returns on Foreign Exchange , 1999, Review of Economics and Statistics.

[82]  Walter Dolde,et al.  THE TRAJECTORY OF CORPORATE FINANCIAL RISK MANAGEMENT , 1993 .