Adaptive stock trading strategies with deep reinforcement learning methods

Abstract The increasing complexity and dynamical property in stock markets are key challenges of the financial industry, in which inflexible trading strategies designed by experienced financial practitioners fail to achieve satisfactory performance in all market conditions. To meet this challenge, adaptive stock trading strategies with deep reinforcement learning methods are proposed. For the time-series nature of stock market data, the Gated Recurrent Unit (GRU) is applied to extract informative financial features, which can represent the intrinsic characteristics of the stock market for adaptive trading decisions. Furthermore, with the tailored design of state and action spaces, two trading strategies with reinforcement learning methods are proposed as GDQN (Gated Deep Q-learning trading strategy) and GDPG (Gated Deterministic Policy Gradient trading strategy). To verify the robustness and effectiveness of GDQN and GDPG, they are tested both in the trending and in the volatile stock market from different countries. Experimental results show that the proposed GDQN and GDPG not only outperform the Turtle trading strategy but also achieve more stable returns than a state-of-the-art direct reinforcement learning method, DRL trading strategy, in the volatile stock market. As far as the GDQN and the GDPG are compared, experimental results demonstrate that the GDPG with an actor-critic framework is more stable than the GDQN with a critic-only framework in the ever-evolving stock market.

[1]  Demis Hassabis,et al.  A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play , 2018, Science.

[2]  Yi Wu,et al.  Robust Multi-Agent Reinforcement Learning via Minimax Deep Deterministic Policy Gradient , 2019, AAAI.

[3]  Thomas Fischer,et al.  Deep learning with long short-term memory networks for financial market predictions , 2017, Eur. J. Oper. Res..

[4]  Richard Socher,et al.  Ask Me Anything: Dynamic Memory Networks for Natural Language Processing , 2015, ICML.

[5]  Nuno Horta,et al.  Reinforcement learning applied to Forex trading , 2018, Appl. Soft Comput..

[6]  Jie Xu,et al.  Data-driven dynamic resource scheduling for network slicing: A Deep reinforcement learning approach , 2019, Inf. Sci..

[7]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[8]  Duc Duong,et al.  A New Model for Stock Price Movements Prediction Using Deep Neural Network , 2017, SoICT.

[9]  Matthew Saffell,et al.  Reinforcement Learning for Trading Systems and Portfolios , 1998, KDD.

[10]  Yan Liu,et al.  Recurrent Neural Networks for Multivariate Time Series with Missing Values , 2016, Scientific Reports.

[11]  Hui Li,et al.  Class-imbalanced dynamic financial distress prediction based on Adaboost-SVM ensemble combined with SMOTE and time weighting , 2020, Inf. Fusion.

[12]  Yu Song,et al.  Application of artificial neural network for the prediction of stock market returns: The case of the Japanese stock market , 2016 .

[13]  Xiao Zhong,et al.  Forecasting daily stock market return using dimensionality reduction , 2017, Expert Syst. Appl..

[14]  Youyong Kong,et al.  Deep Direct Reinforcement Learning for Financial Signal Representation and Trading , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[15]  Yulei Rao,et al.  A deep learning framework for financial time series using stacked autoencoders and long-short term memory , 2017, PloS one.

[16]  Tao Mei,et al.  Learning Multi-attention Convolutional Neural Network for Fine-Grained Image Recognition , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[17]  Jie Sun,et al.  Dynamic financial distress prediction with concept drift based on time weighting combined with Adaboost support vector machine ensemble , 2017, Knowl. Based Syst..

[18]  Chong Wang,et al.  Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin , 2015, ICML.

[19]  Lirong Zheng,et al.  Automated trading systems statistical and machine learning methods and hardware implementation: a survey , 2018, Enterp. Inf. Syst..

[20]  Michael T. M. Emmerich,et al.  Application of portfolio optimization to drug discovery , 2019, Inf. Sci..

[21]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[22]  Marco Pavone,et al.  Cellular Network Traffic Scheduling With Deep Reinforcement Learning , 2018, AAAI.

[23]  Xiaohui Xie,et al.  Co-Occurrence Feature Learning for Skeleton Based Action Recognition Using Regularized Deep LSTM Networks , 2016, AAAI.

[24]  Hamido Fujita,et al.  The autonomous navigation and obstacle avoidance for USVs with ANOA deep reinforcement learning method , 2020, Knowl. Based Syst..

[25]  Guillaume Perrin,et al.  Adaptive early classification of temporal sequences using deep reinforcement learning , 2020, Knowl. Based Syst..

[26]  Yingbin Liang,et al.  Two Time-scale Off-Policy TD Learning: Non-asymptotic Analysis over Markovian Samples , 2019, NeurIPS.

[27]  Wei Wu,et al.  A novel multi-step Q-learning method to improve data efficiency for deep reinforcement learning , 2019, Knowl. Based Syst..

[28]  Yike Guo,et al.  The assessment of small bowel motility with attentive deformable neural network , 2020, Inf. Sci..

[29]  Shui-Ling Yu,et al.  Forecasting Stock Price Index Volatility with LSTM Deep Neural Network , 2018 .

[30]  Danilo Sulino Silveira Pinto,et al.  Robot position control in pipes using Q Learning , 2016, 2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[31]  Huai-Ning Wu,et al.  Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control , 2017, IEEE Transactions on Cybernetics.

[32]  Luigi Troiano,et al.  Replicating a Trading Strategy by Means of LSTM for Financial Industry Applications , 2018, IEEE Transactions on Industrial Informatics.

[33]  Yike Guo,et al.  Small bowel motility assessment based on fully convolutional networks and long short-term memory , 2017, Knowl. Based Syst..

[34]  Erik Cambria,et al.  Recent Trends in Deep Learning Based Natural Language Processing , 2017, IEEE Comput. Intell. Mag..

[35]  Yi Wu,et al.  Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.

[36]  Dimitrios Vezeris,et al.  AdTurtle: An Advanced Turtle Trading System , 2019, Journal of Risk and Financial Management.

[37]  W. Ongsakul,et al.  Sortino ratio based portfolio optimization considering EVs and renewable energy in microgrid power market , 2017, 2017 IEEE Power & Energy Society General Meeting.

[38]  Johann Marius Zöllner,et al.  Learning how to drive in a real world simulation with deep Q-Networks , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).

[39]  Angela Yao,et al.  Complex Gated Recurrent Neural Networks , 2018, NeurIPS.

[40]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Youyong Kong,et al.  Sparse Coding-Inspired Optimal Trading System for HFT Industry , 2015, IEEE Transactions on Industrial Informatics.