QLBS: Q-Learner in the Black-Scholes (-Merton) Worlds
暂无分享,去创建一个
[1] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[2] Raphaël Fonteneau,et al. Contributions to Batch Mode Reinforcement Learning , 2011 .
[3] L. Bachelier,et al. Théorie de la spéculation , 1900 .
[4] Jan Kallsen,et al. HEDGING BY SEQUENTIAL REGRESSIONS REVISITED , 2007 .
[5] A. Stuart,et al. Portfolio Selection: Efficient Diversification of Investments , 1959 .
[6] Paul A. Samuelson,et al. Rational Theory of Warrant Pricing , 2015 .
[7] Andreas J. Grau. Applications of Least-Squares Regressions to Pricing and Hedging of Financial Derivatives , 2008 .
[8] Martin Schweizer,et al. Variance-Optimal Hedging in Discrete Time , 1995, Math. Oper. Res..
[9] Hans Föllmer,et al. Hedging by Sequential Regression: An Introduction to the Mathematics of Option Trading , 1988 .
[10] Francis A. Longstaff,et al. Valuing American Options by Simulation: A Simple Least-Squares Approach , 2001 .
[11] F. Black,et al. The Pricing of Options and Corporate Liabilities , 1973, Journal of Political Economy.
[12] R. C. Merton,et al. Theory of Rational Option Pricing , 2015, World Scientific Reference on Contingent Claims Analysis in Corporate Finance.
[13] Jonas Schmitt. Portfolio Selection Efficient Diversification Of Investments , 2016 .
[14] A. Gosavi. Finite horizon Markov control with one-step variance penalties , 2010, 2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton).
[15] Jean-Philippe Bouchaud,et al. Hedged Monte-Carlo: low variance derivative pricing with objective probabilities , 2000 .
[16] Chris Watkins,et al. Learning from delayed rewards , 1989 .
[17] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[18] Abhijit Gosavi,et al. Solving Markov Decision Processes via Simulation , 2015 .
[19] Charles Elkan,et al. Reinforcement Learning with a Bilinear Q Function , 2011, EWRL.
[20] H. Robbins. A Stochastic Approximation Method , 1951 .
[21] Hado van Hasselt,et al. Double Q-learning , 2010, NIPS.
[22] A Numerical Algorithm for Indifference Pricing in Incomplete Markets , 2006 .
[23] Jin-Chuan Duan,et al. American option pricing under GARCH by a Markov chain approximation , 2001 .
[24] Pierre Geurts,et al. Tree-Based Batch Mode Reinforcement Learning , 2005, J. Mach. Learn. Res..
[25] Susan A. Murphy,et al. A Generalization Error for Q-Learning , 2005, J. Mach. Learn. Res..