暂无分享,去创建一个
Yevgen Chebotar | Artem Molchanov | Sarah Bechtle | Ludovic Righetti | Franziska Meier | Gaurav Sukhatme | L. Righetti | Yevgen Chebotar | Franziska Meier | G. Sukhatme | Artem Molchanov | Sarah Bechtle
[1] Andrew W. Moore,et al. Efficient memory-based learning for robot control , 1990 .
[2] Dean Pomerleau,et al. Efficient Training of Artificial Neural Networks for Autonomous Navigation , 1991, Neural Computation.
[3] Yoshua Bengio,et al. Learning a synaptic learning rule , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.
[4] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.
[5] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[6] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[7] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[8] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[9] Roy Featherstone,et al. Rigid Body Dynamics Algorithms , 2007 .
[10] G. Evans,et al. Learning to Optimize , 2008 .
[11] Stefan Schaal,et al. Robot Programming by Demonstration , 2009, Springer Handbook of Robotics.
[12] Henk Nijmeijer,et al. Robot Programming by Demonstration , 2010, SIMPAR.
[13] Yuval Tassa,et al. Control-limited differential dynamic programming , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).
[14] M. McCall,et al. Rigid Body Dynamics , 2008 .
[15] Sergey Levine,et al. Guided Policy Search , 2013, ICML.
[16] Pieter Abbeel,et al. Gradient Estimation Using Stochastic Computation Graphs , 2015, NIPS.
[17] Olivier Stasse,et al. Whole-body model-predictive control applied to the HRP-2 humanoid , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[18] Ryan P. Adams,et al. Gradient-based Hyperparameter Optimization through Reversible Learning , 2015, ICML.
[19] Marcin Andrychowicz,et al. Learning to learn by gradient descent by gradient descent , 2016, NIPS.
[20] Peter L. Bartlett,et al. RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning , 2016, ArXiv.
[21] Giambattista Parascandolo,et al. Taming the waves: sine as activation function in deep neural networks , 2017 .
[22] C A Nelson,et al. Learning to Learn , 2017, Encyclopedia of Machine Learning and Data Mining.
[23] Li Zhang,et al. Learning to Learn: Meta-Critic Networks for Sample Efficient Learning , 2017, ArXiv.
[24] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.
[25] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[26] Paolo Frasconi,et al. Forward and Reverse Gradient-Based Hyperparameter Optimization , 2017, ICML.
[27] David Silver,et al. Meta-Gradient Reinforcement Learning , 2018, NeurIPS.
[28] Karol Hausman,et al. Learning an Embedding Space for Transferable Robot Skills , 2018, ICLR.
[29] Sergey Levine,et al. Meta-Reinforcement Learning of Structured Exploration Strategies , 2018, NeurIPS.
[30] Jascha Sohl-Dickstein,et al. Learning Unsupervised Learning Rules , 2018, ArXiv.
[31] A. Borisov,et al. Rigid Body Dynamics , 2018 .
[32] Lijun Wu,et al. Learning to Teach with Dynamic Loss Functions , 2018, NeurIPS.
[33] Stefan Schaal,et al. Online Learning of a Memory for Learning Rates , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[34] Pieter Abbeel,et al. Evolved Policy Gradients , 2018, NeurIPS.
[35] Sergey Levine,et al. One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning , 2018, Robotics: Science and Systems.
[36] Dong Yan,et al. Reward Shaping via Meta-Learning , 2019, ArXiv.
[37] Sergey Levine,et al. Unsupervised Learning via Meta-Learning , 2018, ICLR.
[38] S. Levine,et al. Guided Meta-Policy Search , 2019, NeurIPS.
[39] Jan Peters,et al. Deep Lagrangian Networks: Using Physics as Model Prior for Deep Learning , 2019, ICLR.
[40] Artem Molchanov,et al. Generalized Inner Loop Meta-Learning , 2019, ArXiv.