Learning Domain Randomization Distributions for Training Robust Locomotion Policies
暂无分享,去创建一个
David Meger | Gregory Dudek | Juan Camilo Gamboa Higuera | Melissa Mozifian | Gregory Dudek | D. Meger | Melissa Mozifian | J. A. G. Higuera
[1] G. Zames. Feedback and optimal sensitivity: Model reference transformations, multiplicative seminorms, and approximate inverses , 1981 .
[2] Rich Caruana,et al. Multitask Learning: A Knowledge-Based Source of Inductive Bias , 1993, ICML.
[3] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[4] OpenAI. Learning Dexterous In-Hand Manipulation. , 2018 .
[5] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.
[6] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[7] Tao Chen,et al. Hardware Conditioned Policies for Multi-Robot Transfer Learning , 2018, NeurIPS.
[8] J. Doyle,et al. Essentials of Robust Control , 1997 .
[9] Marcin Andrychowicz,et al. Sim-to-Real Transfer of Robotic Control with Dynamics Randomization , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[10] Inman Harvey,et al. Noise and the Reality Gap: The Use of Simulation in Evolutionary Robotics , 1995, ECAL.
[11] Jonathan Baxter,et al. A Bayesian/Information Theoretic Model of Learning to Learn via Multiple Task Sampling , 1997, Machine Learning.
[12] Greg Turk,et al. Preparing for the Unknown: Learning a Universal Policy with Online System Identification , 2017, Robotics: Science and Systems.
[13] Shimon Whiteson,et al. Contextual Policy Optimisation , 2018, ArXiv.
[14] Christopher Joseph Pal,et al. Active Domain Randomization , 2019, CoRL.
[15] Dieter Fox,et al. BayesSim: adaptive domain randomization via probabilistic inference for robotics simulators , 2019, Robotics: Science and Systems.
[16] Sergey Levine,et al. Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables , 2019, ICML.
[17] Balaraman Ravindran,et al. EPOpt: Learning Robust Neural Network Policies Using Model Ensembles , 2016, ICLR.
[18] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[19] Dawn Xiaodong Song,et al. Assessing Generalization in Deep Reinforcement Learning , 2018, ArXiv.
[20] Yevgen Chebotar,et al. Closing the Sim-to-Real Loop: Adapting Simulation Randomization with Real World Experience , 2018, 2019 International Conference on Robotics and Automation (ICRA).
[21] C. Karen Liu,et al. Policy Transfer with Strategy Optimization , 2018, ICLR.
[22] Shie Mannor,et al. Optimizing the CVaR via Sampling , 2014, AAAI.