Modeling Stock Order Flows and Learning Market-Making from Data

Stock markets employ specialized traders, market-makers, designed to provide liquidity and volume to the market by constantly supplying both supply and demand. In this paper, we demonstrate a novel method for modeling the market as a dynamic system and a reinforcement learning algorithm that learns profitable market-making strategies when run on this model.The sequence of buys and sells for a particular stock, the order flow, we model as an Input-Output Hidden Markov Model fit to historical data. When combined with the dynamics of the order book, this creates a highly non-linear and difficult dynamic system. Our reinforcement learning algorithm, based on likelihood ratios, is run on this partially-observable environment. We demonstrate learning results for two separate real stocks.

[1]  Yoshua Bengio,et al.  Markovian Models for Sequential Data , 2004 .

[2]  Christian R. Shelton,et al.  Policy Improvement for POMDPs Using Normalized Importance Sampling , 2001, UAI.

[3]  Leonid Peshkin,et al.  Bounds on Sample Size for Policy Evaluation in Markov Environments , 2001, COLT/EuroCOLT.

[4]  Christian R. Shelton,et al.  An Electronic Market-maker , 2001 .

[5]  Christian R. Shelton,et al.  Importance sampling for reinforcement learning with multiple objectives , 2001 .

[6]  Tung Chan,et al.  Artificial markets and intelligent agents , 2001 .

[7]  M. Sewell Market Microstructure , 2007 .

[8]  David Easley,et al.  The information content of the trading process , 1997 .

[9]  George Sofianos,et al.  New York Stock Exchange Systems and Trading Procedures , 1996 .

[10]  T. Hesterberg,et al.  Weighted Average Importance Sampling and Defensive Mixture Distributions , 1995 .

[11]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[12]  Maureen O'Hara,et al.  The Microeconomics of Market Making , 1986, Journal of Financial and Quantitative Analysis.

[13]  Paul R. Milgrom,et al.  Bid, ask and transaction prices in a specialist market with heterogeneously informed traders , 1985 .

[14]  Reuven Y. Rubinstein,et al.  Simulation and the Monte Carlo method , 1981, Wiley series in probability and mathematical statistics.

[15]  T. Ho,et al.  Optimal dealer pricing under transactions and return uncertainty , 1981 .

[16]  Y. Amihud,et al.  Dealership market: Market-making with inventory , 1980 .