Meta-Reinforcement Learning of Structured Exploration Strategies
暂无分享,去创建一个
Sergey Levine | Pieter Abbeel | Abhishek Gupta | Yuxuan Liu | Russell Mendonca | S. Levine | P. Abbeel | Abhishek Gupta | Yuxuan Liu | Russell Mendonca | Yuxuan Liu
[1] R. J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[2] Sebastian Thrun,et al. Learning to Learn: Introduction and Overview , 1998, Learning to Learn.
[3] Sepp Hochreiter,et al. Learning to Learn Using Gradient Descent , 2001, ICANN.
[4] Ronen I. Brafman,et al. R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning , 2001, J. Mach. Learn. Res..
[5] Nuttapong Chentanez,et al. Intrinsically Motivated Reinforcement Learning , 2004, NIPS.
[6] Michael Kearns,et al. Near-Optimal Reinforcement Learning in Polynomial Time , 2002, Machine Learning.
[7] Emanuel Todorov,et al. Linearly-solvable Markov decision problems , 2006, NIPS.
[8] Yoshua Bengio,et al. On the Optimization of a Synaptic Learning Rule , 2007 .
[9] Michael L. Littman,et al. An analysis of model-based Interval Estimation for Markov Decision Processes , 2008, J. Comput. Syst. Sci..
[10] C. Stachniss,et al. Learning Omnidirectional Path Following Using Dimensionality Reduction , 2008 .
[11] Lihong Li,et al. An Empirical Evaluation of Thompson Sampling , 2011, NIPS.
[12] Pierre-Yves Oudeyer,et al. Exploration in Model-based Reinforcement Learning by Empirically Estimating Learning Progress , 2012, NIPS.
[13] Sergey Levine,et al. Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models , 2015, ArXiv.
[14] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[15] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[16] Gregory R. Koch,et al. Siamese Neural Networks for One-Shot Image Recognition , 2015 .
[17] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[18] Pieter Abbeel,et al. Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.
[19] Filip De Turck,et al. VIME: Variational Information Maximizing Exploration , 2016, NIPS.
[20] Marcin Andrychowicz,et al. Learning to learn by gradient descent by gradient descent , 2016, NIPS.
[21] Benjamin Van Roy,et al. Deep Exploration via Bootstrapped DQN , 2016, NIPS.
[22] Tom Schaul,et al. Unifying Count-Based Exploration and Intrinsic Motivation , 2016, NIPS.
[23] Peter L. Bartlett,et al. RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning , 2016, ArXiv.
[24] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..
[25] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.
[26] Bartunov Sergey,et al. Meta-Learning with Memory-Augmented Neural Networks , 2016 .
[27] Pieter Abbeel,et al. Meta-Learning with Temporal Convolutions , 2017, ArXiv.
[28] Zeb Kurth-Nelson,et al. Learning to reinforcement learn , 2016, CogSci.
[29] Hugo Larochelle,et al. Optimization as a Model for Few-Shot Learning , 2016, ICLR.
[30] Pieter Abbeel,et al. Stochastic Neural Networks for Hierarchical Reinforcement Learning , 2016, ICLR.
[31] Sergey Levine,et al. One-Shot Visual Imitation Learning via Meta-Learning , 2017, CoRL.
[32] Sergey Levine,et al. Reinforcement Learning with Deep Energy-Based Policies , 2017, ICML.
[33] Li Zhang,et al. Learning to Learn: Meta-Critic Networks for Sample Efficient Learning , 2017, ArXiv.
[34] Richard S. Zemel,et al. Prototypical Networks for Few-shot Learning , 2017, NIPS.
[35] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.
[36] Hang Li,et al. Meta-SGD: Learning to Learn Quickly for Few Shot Learning , 2017, ArXiv.
[37] Marcin Andrychowicz,et al. Parameter Space Noise for Exploration , 2017, ICLR.
[38] Sergey Levine,et al. Meta-Learning and Universality: Deep Representations and Gradient Descent can Approximate any Learning Algorithm , 2017, ICLR.
[39] Shane Legg,et al. Noisy Networks for Exploration , 2017, ICLR.
[40] Karol Hausman,et al. Learning an Embedding Space for Transferable Robot Skills , 2018, ICLR.
[41] Pieter Abbeel,et al. Some Considerations on Learning to Explore via Meta-Reinforcement Learning , 2018, ICLR 2018.
[42] Sergey Levine,et al. Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations , 2017, Robotics: Science and Systems.