Deep Reinforcement Learning in Agent Based Financial Market Simulation

Prediction of financial market data with deep learning models has achieved some level of recent success. However, historical financial data suffer from an unknowable state space, limited observations, and the inability to model the impact of your own actions on the market can often be prohibitive when trying to find investment strategies using deep reinforcement learning. One way to overcome these limitations is to augment real market data with agent based artificial market simulation. Artificial market simulations designed to reproduce realistic market features may be used to create unobserved market states, to model the impact of your own investment actions on the market itself, and train models with as much data as necessary. In this study we propose a framework for training deep reinforcement learning models in agent based artificial price-order-book simulations that yield non-trivial policies under diverse conditions with market impact. Our simulations confirm that the proposed deep reinforcement learning model with unique task-specific reward function was able to learn a robust investment strategy with an attractive risk-return profile.

[1]  Daniel B. Nelson CONDITIONAL HETEROSKEDASTICITY IN ASSET RETURNS: A NEW APPROACH , 1991 .

[2]  Y. Abu-Mostafa,et al.  On the maximum drawdown of a Brownian motion , 2004, Journal of Applied Probability.

[3]  E. Fama,et al.  Common risk factors in the returns on stocks and bonds , 1993 .

[4]  Carl Chiarella,et al.  A simulation analysis of the microstructure of double auction markets , 2002 .

[5]  D. Aaker,et al.  The Financial Information Content of Perceived Quality , 1994 .

[6]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[7]  Wen Long,et al.  Deep learning-based feature engineering for stock price movement prediction , 2019, Knowl. Based Syst..

[8]  S. Solomon,et al.  A microscopic model of the stock market: Cycles, booms, and crashes , 1994 .

[9]  ChongEunsuk,et al.  Deep learning networks for stock market analysis and prediction , 2017 .

[10]  A. Harvey,et al.  Detrending, stylized facts and the business cycle , 1993 .

[11]  D. Stauffer,et al.  Agent-based Models of Financial Markets , 2007, physics/0701140.

[12]  Wojciech M. Czarnecki,et al.  Grandmaster level in StarCraft II using multi-agent reinforcement learning , 2019, Nature.

[13]  M. Marchesi,et al.  Agent-based simulation of a financial market , 2001, cond-mat/0103600.

[14]  Jakša Cvitanić,et al.  Market Microstructure Design and Flash Crashes: A Simulation Approach , 2013 .

[15]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[16]  W. Sharpe The Sharpe Ratio , 1994 .

[17]  Hiroki Sakaji,et al.  Encoding of high-frequency order information and prediction of short-term stock price by deep learning , 2019, Machine Learning and AI in Finance.

[18]  Chulwoo Han,et al.  Deep learning networks for stock market analysis and prediction: Methodology, data representations, and case studies , 2017, Expert Syst. Appl..

[19]  Michael L. Littman,et al.  Value-function reinforcement learning in Markov games , 2001, Cognitive Systems Research.

[20]  R. Faff,et al.  The Market Impact of Relative Agency Activity in the Sovereign Ratings Market , 2010 .

[21]  Sarit Kraus,et al.  Negotiation and Cooperation in Multi-Agent Environments , 1997, Artif. Intell..

[22]  S. Solomon,et al.  Social percolation models , 1999, adap-org/9909001.

[23]  Dietrich Stauffer Percolation Models of Financial Market Dynamics , 2001, Adv. Complex Syst..

[24]  M. Marchesi,et al.  Scaling and criticality in a stochastic multi-agent model of a financial market , 1999, Nature.

[25]  Etienne Perot,et al.  Deep Reinforcement Learning framework for Autonomous Driving , 2017, Autonomous Vehicles and Machines.

[26]  Youyong Kong,et al.  Deep Direct Reinforcement Learning for Financial Signal Representation and Trading , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[27]  Yulei Rao,et al.  A deep learning framework for financial time series using stacked autoencoders and long-short term memory , 2017, PloS one.

[28]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[29]  Jianfeng Gao,et al.  Deep Reinforcement Learning for Dialogue Generation , 2016, EMNLP.

[30]  P. Dayan,et al.  Reward, Motivation, and Reinforcement Learning , 2002, Neuron.

[31]  Daniel Ladley,et al.  Zero intelligence in economics and finance , 2012, The Knowledge Engineering Review.

[32]  Matloob Khushi,et al.  Reinforcement Learning in Financial Markets , 2019, Data.

[33]  Stelios D. Bekiros,et al.  Intelligent forecasting with machine learning trading systems in chaotic intraday Bitcoin market , 2020 .

[34]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[35]  Zita Vale,et al.  Generation of realistic scenarios for multi-agent simulation of electricity markets , 2016 .

[36]  Nicholas R. Jennings,et al.  Controlling Cooperative Problem Solving in Industrial Multi-Agent Systems Using Joint Intentions , 1995, Artif. Intell..

[37]  Thomas Fischer,et al.  Deep learning with long short-term memory networks for financial market predictions , 2017, Eur. J. Oper. Res..

[38]  B. LeBaron A builder's guide to agent-based financial markets , 2001 .

[39]  H. Markowitz,et al.  Investment rules, margin, and market volatility , 1989 .

[40]  Kelvin Leong,et al.  FinTech (Financial Technology): What is It and How toUse Technologies to Create Business Value in FintechWay? , 2018 .