暂无分享,去创建一个
[1] Alex Graves,et al. Automated Curriculum Learning for Neural Networks , 2017, ICML.
[2] J. Andrew Bagnell,et al. Modeling Purposeful Adaptive Behavior with the Principle of Maximum Causal Entropy , 2010 .
[3] Sergey Levine,et al. Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning , 2019, CoRL.
[4] Hugo Larochelle,et al. Optimization as a Model for Few-Shot Learning , 2016, ICLR.
[5] Katia Sycara,et al. MAME : Model-Agnostic Meta-Exploration , 2019, CoRL.
[6] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[7] Yoshua Bengio,et al. The effects of negative adaptation in Model-Agnostic Meta-Learning , 2018, ArXiv.
[8] Peter L. Bartlett,et al. RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning , 2016, ArXiv.
[9] Marcin Andrychowicz,et al. Solving Rubik's Cube with a Robot Hand , 2019, ArXiv.
[10] Abhinav Gupta,et al. Robust Adversarial Reinforcement Learning , 2017, ICML.
[11] Jason Weston,et al. Curriculum learning , 2009, ICML '09.
[12] Yang Liu,et al. Stein Variational Policy Gradient , 2017, UAI.
[13] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[14] Sergey Levine,et al. Skew-Fit: State-Covering Self-Supervised Reinforcement Learning , 2019, ICML.
[15] Daan Wierstra,et al. Meta-Learning with Memory-Augmented Neural Networks , 2016, ICML.
[16] Joshua Achiam,et al. On First-Order Meta-Learning Algorithms , 2018, ArXiv.
[17] Pieter Abbeel,et al. Some Considerations on Learning to Explore via Meta-Reinforcement Learning , 2018, ICLR 2018.
[18] Christopher Joseph Pal,et al. Active Domain Randomization , 2019, CoRL.
[19] Rui Wang,et al. Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions , 2019, ArXiv.
[20] Gregory Dudek,et al. Learning Domain Randomization Distributions for Transfer of Locomotion Policies , 2019, ArXiv.
[21] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[22] Pieter Abbeel,et al. Reverse Curriculum Generation for Reinforcement Learning , 2017, CoRL.
[23] Wojciech Zaremba,et al. Domain randomization for transferring deep neural networks from simulation to the real world , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[24] Sergey Levine,et al. Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables , 2019, ICML.
[25] Sergey Levine,et al. One-Shot Visual Imitation Learning via Meta-Learning , 2017, CoRL.
[26] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.
[27] Sergey Levine,et al. Meta-Reinforcement Learning of Structured Exploration Strategies , 2018, NeurIPS.
[28] Tamim Asfour,et al. ProMP: Proximal Meta-Policy Search , 2018, ICLR.
[29] Hong Yu,et al. Meta Networks , 2017, ICML.
[30] Zeb Kurth-Nelson,et al. Learning to reinforcement learn , 2016, CogSci.
[31] Richard S. Zemel,et al. Prototypical Networks for Few-shot Learning , 2017, NIPS.
[32] Pieter Abbeel,et al. A Simple Neural Attentive Meta-Learner , 2017, ICLR.
[33] Yulia Tsvetkov,et al. Learning the Curriculum with Bayesian Optimization for Task-Specific Word Representation Learning , 2016, ACL.
[34] Gregory R. Koch,et al. Siamese Neural Networks for One-Shot Image Recognition , 2015 .
[35] Yuval Tassa,et al. Emergence of Locomotion Behaviours in Rich Environments , 2017, ArXiv.
[36] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.