论文信息 - Deep Hedging: Hedging Derivatives Under Generic Market Frictions Using Reinforcement Learning

Deep Hedging: Hedging Derivatives Under Generic Market Frictions Using Reinforcement Learning

This article discusses a new application of reinforcement learning: to the problem of hedging a portfolio of “over-the-counter” derivatives under under market frictions such as trading costs and liquidity constraints. It is an extended version of our recent work https://www.ssrn.com/abstract=3120710, here using notation more common in the machine learning literature. The objective is to maximize a non-linear risk-adjusted return function by trading in liquid hedging instruments such as equities or listed options. The approach presented here is the first efficient and model-independent algorithm which can be used for such problems at scale.

[1] Lizhong Wu,et al. Optimization of trading systems and portfolios , 1997, Proceedings of the IEEE/IAFE 1997 Computational Intelligence for Financial Engineering (CIFEr).

[2] H. Föllmer,et al. Stochastic Finance: An Introduction in Discrete Time , 2002 .

[3] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[4] David W. Lu,et al. Agent Inspired Trading Using Recurrent Reinforcement Learning and LSTM Neural Networks , 2017, 1707.07338.

[5] H. Soner,et al. There is no nontrivial hedging portfolio for option pricing with transaction costs , 1995 .

[6] Igor Halperin. QLBS: Q-Learner in the Black-Scholes (-Merton) Worlds , 2017, ArXiv.

[7] Xin Du,et al. Algorithm Trading using Q-Learning and Recurrent Reinforcement Learning , 2022 .

[8] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[9] Francis A. Longstaff,et al. Valuing American Options by Simulation: A Simple Least-Squares Approach , 2001 .

[10] Kurt Hornik,et al. Approximation capabilities of multilayer feedforward networks , 1991, Neural Networks.

[11] F. Black,et al. The Pricing of Options and Corporate Liabilities , 1973, Journal of Political Economy.

[12] S. Heston. A Closed-Form Solution for Options with Stochastic Volatility with Applications to Bond and Currency Options , 1993 .

[13] Peter Bank,et al. Hedging with temporary price impact , 2015, 1510.03223.

[14] S. Peng,et al. Backward Stochastic Differential Equations in Finance , 1997 .

[15] Thomas G. Dietterich. Adaptive computation and machine learning , 1998 .

[16] Zhengyao Jiang,et al. A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem , 2017, ArXiv.

[17] L. Rogers,et al. THE COST OF ILLIQUIDITY AND ITS EFFECTS ON HEDGING , 2010 .