How Does an Approximate Model Help in Reinforcement Learning
暂无分享,去创建一个
Wotao Yin | Lin F. Yang | Fei Feng | W. Yin | Fei Feng
[1] Germán Ros,et al. CARLA: An Open Urban Driving Simulator , 2017, CoRL.
[2] Daniele Calandriello,et al. Sparse multi-task reinforcement learning , 2014, Intelligenza Artificiale.
[3] Ambuj Tewari,et al. Sample Complexity of Reinforcement Learning using Linearly Combined Model Ensembles , 2019, AISTATS.
[4] Yusen Zhan,et al. Scalable lifelong reinforcement learning , 2017, Pattern Recognit..
[5] Marco Wiering,et al. Multi-Agent Reinforcement Learning for Traffic Light control , 2000 .
[6] Katja Hofmann,et al. Meta Reinforcement Learning with Latent Variable Gaussian Processes , 2018, UAI.
[7] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[8] Eric Eaton,et al. Online Multi-Task Learning for Policy Gradient Methods , 2014, ICML.
[9] Mengdi Wang,et al. Model-Based Reinforcement Learning with Value-Targeted Regression , 2020, L4DC.
[10] Sergey Levine,et al. Meta-Reinforcement Learning of Structured Exploration Strategies , 2018, NeurIPS.
[11] Jennie Si,et al. Online learning control by association and reinforcement. , 2001, IEEE transactions on neural networks.
[12] Ben Tse,et al. Autonomous Inverted Helicopter Flight via Reinforcement Learning , 2004, ISER.
[13] Hilbert J. Kappen,et al. On the Sample Complexity of Reinforcement Learning with a Generative Model , 2012, ICML.
[14] Alessandro Lazaric,et al. Transfer in Reinforcement Learning: A Framework and a Survey , 2012, Reinforcement Learning.
[15] Richard S. Sutton,et al. Reinforcement Learning is Direct Adaptive Optimal Control , 1992, 1991 American Control Conference.
[16] Andre Esteva,et al. A guide to deep learning in healthcare , 2019, Nature Medicine.
[17] Alessandro Lazaric,et al. Bayesian Multi-Task Reinforcement Learning , 2010, ICML.
[18] Yoonsuck Choe,et al. Directed Exploration in Reinforcement Learning with Transferred Knowledge , 2012, EWRL.
[19] Nan Jiang,et al. PAC Reinforcement Learning With an Imperfect Model , 2018, AAAI.
[20] Lihong Li,et al. PAC-inspired Option Discovery in Lifelong Reinforcement Learning , 2014, ICML.
[21] Pieter Abbeel,et al. Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments , 2017, ICLR.
[22] John N. Tsitsiklis,et al. The Sample Complexity of Exploration in the Multi-Armed Bandit Problem , 2004, J. Mach. Learn. Res..
[23] Peter Stone,et al. Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..
[24] Alan Fern,et al. Multi-task reinforcement learning: a hierarchical Bayesian approach , 2007, ICML '07.
[25] Kenji Doya,et al. Meta-learning in Reinforcement Learning , 2003, Neural Networks.
[26] Michael L. Littman,et al. Policy and Value Transfer in Lifelong Reinforcement Learning , 2018, ICML.
[27] Xian Wu,et al. Near-Optimal Time and Sample Complexities for Solving Markov Decision Processes with a Generative Model , 2018, NeurIPS.
[28] Marcin Andrychowicz,et al. Hindsight Experience Replay , 2017, NIPS.
[29] Lihong Li,et al. Sample Complexity of Multi-task Reinforcement Learning , 2013, UAI.