Importance Weighted Policy Learning and Adaptation
暂无分享,去创建一个
N. Heess | Y. Teh | Leonard Hasenclever | Guillaume Desjardins | Jakub Sygnowski | Alexandre Galashov | Rae Jeong | Jan Humplik
[1] Rich Caruana,et al. Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.
[2] Jonathan Baxter,et al. A Model of Inductive Bias Learning , 2000, J. Artif. Intell. Res..
[3] Yasemin Altun,et al. Relative Entropy Policy Search , 2010 .
[4] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.
[5] Lihong Li,et al. PAC-inspired Option Discovery in Lifelong Reinforcement Learning , 2014, ICML.
[6] Yuval Tassa,et al. Learning Continuous Control Policies by Stochastic Value Gradients , 2015, NIPS.
[7] Roy Fox,et al. Taming the Noise in Reinforcement Learning via Soft Updates , 2015, UAI.
[8] Joshua B. Tenenbaum,et al. Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.
[9] Peter L. Bartlett,et al. RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning , 2016, ArXiv.
[10] Marc G. Bellemare,et al. Safe and Efficient Off-Policy Reinforcement Learning , 2016, NIPS.
[11] Yuval Tassa,et al. Learning and Transfer of Modulated Locomotor Controllers , 2016, ArXiv.
[12] Zeb Kurth-Nelson,et al. Learning to reinforcement learn , 2016, CogSci.
[13] Sergey Levine,et al. Reinforcement Learning with Deep Energy-Based Policies , 2017, ICML.
[14] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.
[15] Yee Whye Teh,et al. Distral: Robust multitask reinforcement learning , 2017, NIPS.
[16] Tom Schaul,et al. Successor Features for Transfer in Reinforcement Learning , 2016, NIPS.
[17] Sergey Levine,et al. Probabilistic Model-Agnostic Meta-Learning , 2018, NeurIPS.
[18] Herke van Hoof,et al. Addressing Function Approximation Error in Actor-Critic Methods , 2018, ICML.
[19] Karol Hausman,et al. Learning an Embedding Space for Transferable Robot Skills , 2018, ICLR.
[20] Yuval Tassa,et al. Relative Entropy Regularized Policy Iteration , 2018, ArXiv.
[21] J. Schulman,et al. Reptile: a Scalable Metalearning Algorithm , 2018 .
[22] Sergey Levine,et al. Meta-Reinforcement Learning of Structured Exploration Strategies , 2018, NeurIPS.
[23] Pieter Abbeel,et al. A Simple Neural Attentive Meta-Learner , 2017, ICLR.
[24] Shane Legg,et al. IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures , 2018, ICML.
[25] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[26] Martin A. Riedmiller,et al. Learning by Playing - Solving Sparse Reward Tasks from Scratch , 2018, ICML.
[27] Martin A. Riedmiller,et al. Regularized Hierarchical Policies for Compositional Transfer in Robotics , 2019, ArXiv.
[28] Sergey Levine,et al. Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables , 2019, ICML.
[29] Yee Whye Teh,et al. Information asymmetry in KL-regularized RL , 2019, ICLR.
[30] Yee Whye Teh,et al. Exploiting Hierarchy for Learning and Transfer in KL-regularized RL , 2019, ArXiv.
[31] Gabriel Dulac-Arnold,et al. Challenges of Real-World Reinforcement Learning , 2019, ArXiv.
[32] Yee Whye Teh,et al. Meta reinforcement learning as task inference , 2019, ArXiv.
[33] Yee Whye Teh,et al. Meta-learning of Sequential Strategies , 2019, ArXiv.
[34] Nicolas Heess,et al. Composing Entropic Policies using Divergence Correction , 2018, ICML.
[35] Luisa M. Zintgraf,et al. VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning , 2019, ICLR.
[36] Alex Smola,et al. Meta-Q-Learning , 2019, ICLR.
[37] Martin A. Riedmiller,et al. Compositional Transfer in Hierarchical Reinforcement Learning , 2019, Robotics: Science and Systems.
[38] Andriy Mnih,et al. Q-Learning in enormous action spaces via amortized approximate maximization , 2020, ArXiv.
[39] Oriol Vinyals,et al. Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML , 2019, ICLR.