暂无分享,去创建一个
[1] Filip De Turck,et al. VIME: Variational Information Maximizing Exploration , 2016, NIPS.
[2] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.
[3] Sean Gerrish,et al. Black Box Variational Inference , 2013, AISTATS.
[4] Benjamin Van Roy,et al. Bootstrapped Thompson Sampling and Deep Exploration , 2015, ArXiv.
[5] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[6] David M. Blei,et al. Variational Inference: A Review for Statisticians , 2016, ArXiv.
[7] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[8] W. R. Thompson. ON THE LIKELIHOOD THAT ONE UNKNOWN PROBABILITY EXCEEDS ANOTHER IN VIEW OF THE EVIDENCE OF TWO SAMPLES , 1933 .
[9] Dustin Tran,et al. Automatic Differentiation Variational Inference , 2016, J. Mach. Learn. Res..
[10] David Barber,et al. Variational methods for Reinforcement Learning , 2010, AISTATS.
[11] R. J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[12] Shane Legg,et al. Noisy Networks for Exploration , 2017, ICLR.
[13] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.
[14] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.
[15] Dustin Tran,et al. Deep Probabilistic Programming , 2017, ICLR.
[16] Benjamin Van Roy,et al. Deep Exploration via Bootstrapped DQN , 2016, NIPS.
[17] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[18] Pieter Abbeel,et al. Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.
[19] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[20] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.
[21] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.
[22] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..
[23] Emanuel Todorov,et al. General duality between optimal control and estimation , 2008, 2008 47th IEEE Conference on Decision and Control.