Learning Fast Adaptation With Meta Strategy Optimization
暂无分享,去创建一个
Sehoon Ha | Wenhao Yu | Jie Tan | Yunfei Bai | Erwin Coumans
[1] Sergey Levine,et al. Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables , 2019, ICML.
[2] Daniel E. Koditschek,et al. Design Principles for a Family of Direct-Drive Legged Robots , 2016, IEEE Robotics and Automation Letters.
[3] Tamim Asfour,et al. ProMP: Proximal Meta-Policy Search , 2018, ICLR.
[4] Aaas News,et al. Book Reviews , 1893, Buffalo Medical and Surgical Journal.
[5] Jakub W. Pachocki,et al. Learning dexterous in-hand manipulation , 2018, Int. J. Robotics Res..
[6] Greg Turk,et al. Preparing for the Unknown: Learning a Universal Policy with Online System Identification , 2017, Robotics: Science and Systems.
[7] W. Marsden. I and J , 2012 .
[8] Stephen J. Wright. Coordinate descent algorithms , 2015, Mathematical Programming.
[9] Honglak Lee,et al. Data-Efficient Learning for Sim-to-Real Robotic Grasping using Deep Point Cloud Prediction Networks , 2019, ArXiv.
[10] Lorenzo Fagiano,et al. Adaptive model predictive control for constrained linear systems , 2013, 2013 European Control Conference (ECC).
[11] Joonho Lee,et al. Learning agile and dynamic motor skills for legged robots , 2019, Science Robotics.
[12] Siddhartha S. Srinivasa,et al. DART: Dynamic Animation and Robotics Toolkit , 2018, J. Open Source Softw..
[13] Emanuel Todorov,et al. Ensemble-CIO: Full-body dynamic motion planning that transfers to physical humanoids , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[14] Andrew J. Davison,et al. Task-Embedded Control Networks for Few-Shot Imitation Learning , 2018, CoRL.
[15] Sergey Levine,et al. Using Simulation and Domain Adaptation to Improve Efficiency of Deep Robotic Grasping , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[16] Atil Iscen,et al. NoRML: No-Reward Meta Learning , 2019, AAMAS.
[17] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[18] Danna Zhou,et al. d. , 1934, Microbial pathogenesis.
[19] C. Karen Liu,et al. Policy Transfer with Strategy Optimization , 2018, ICLR.
[20] Abhinav Gupta,et al. Robust Adversarial Reinforcement Learning , 2017, ICML.
[21] Benjamin Recht,et al. Simple random search provides a competitive approach to reinforcement learning , 2018, ArXiv.
[22] Eric Monmasson,et al. Optimization of Perturbative PV MPPT Methods Through Online System Identification , 2014, IEEE Transactions on Industrial Electronics.
[23] Libin Liu,et al. Learning to schedule control fragments for physics-based characters using deep Q-learning , 2017, TOGS.
[24] Doina Precup,et al. The Option-Critic Architecture , 2016, AAAI.
[25] Mrinal Kalakrishnan,et al. Multi-Task Domain Adaptation for Deep Learning of Instance Grasping from Simulation , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[26] C. Karen Liu,et al. Sim-to-Real Transfer for Biped Locomotion , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[27] Nikolaus Hansen,et al. On the Adaptation of Arbitrary Normal Mutation Distributions in Evolution Strategies: The Generating Set Adaptation , 1995, ICGA.
[28] Peter Stone,et al. Stochastic Grounded Action Transformation for Robot Learning in Simulation , 2017, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[29] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[30] Wojciech Zaremba,et al. Domain randomization for transferring deep neural networks from simulation to the real world , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[31] J. Mockus. Bayesian Approach to Global Optimization: Theory and Applications , 1989 .
[32] Sergey Levine,et al. Sim-To-Real via Sim-To-Sim: Data-Efficient Robotic Grasping via Randomized-To-Canonical Adaptation Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[33] Pieter Abbeel,et al. Evolved Policy Gradients , 2018, NeurIPS.
[34] Sergey Levine,et al. Learning to Adapt in Dynamic, Real-World Environments through Meta-Reinforcement Learning , 2018, ICLR.
[35] Atil Iscen,et al. Sim-to-Real: Learning Agile Locomotion For Quadruped Robots , 2018, Robotics: Science and Systems.
[36] Marcin Andrychowicz,et al. Sim-to-Real Transfer of Robotic Control with Dynamics Randomization , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[37] Emanuel Todorov,et al. Reinforcement learning for non-prehensile manipulation: Transfer from simulation to physical system , 2018, 2018 IEEE International Conference on Simulation, Modeling, and Programming for Autonomous Robots (SIMPAR).
[38] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.
[39] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[40] Ross A. Knepper,et al. DeepMPC: Learning Deep Latent Features for Model Predictive Control , 2015, Robotics: Science and Systems.
[41] Antoine Cully,et al. Robots that can adapt like animals , 2014, Nature.
[42] Claire J. Tomlin,et al. Extensions of learning-based model predictive control for real-time application to a quadrotor helicopter , 2012, 2012 American Control Conference (ACC).
[43] Konstantinos Varelas,et al. Benchmarking large scale variants of CMA-ES and L-BFGS-B on the bbob-largescale testbed , 2019, GECCO.
[44] Sergey Levine,et al. Meta-Learning and Universality: Deep Representations and Gradient Descent can Approximate any Learning Algorithm , 2017, ICLR.
[45] Christopher G. Atkeson,et al. Bayesian Optimization Using Domain Knowledge on the ATRIAS Biped , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[46] Peter L. Bartlett,et al. RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning , 2016, ArXiv.
[47] Jonas Buchli,et al. Why off-the-shelf physics simulators fail in evaluating feedback controller performance - a case study for quadrupedal robots , 2016 .