A Metric Space Perspective on Self-Supervised Policy Adaptation
暂无分享,去创建一个
Karol Hausman | Gabriel Dulac-Arnold | Rico Jonschkowski | Cristian Bodnar | Karol Hausman | Rico Jonschkowski | Gabriel Dulac-Arnold | Cristian Bodnar
[1] Marcin Andrychowicz,et al. Sim-to-Real Transfer of Robotic Control with Dynamics Randomization , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[2] Yuval Tassa,et al. DeepMind Control Suite , 2018, ArXiv.
[3] Doina Precup,et al. Bisimulation Metrics are Optimal Value Functions , 2014, UAI.
[4] Trevor Darrell,et al. Adversarial Discriminative Domain Adaptation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[5] Gaurav S. Sukhatme,et al. Never Stop Learning: The Effectiveness of Fine-Tuning in Robotic Reinforcement Learning , 2020 .
[6] Karl Johan Åström,et al. Optimal control of Markov processes with incomplete state information , 1965 .
[7] Wojciech Zaremba,et al. Domain randomization for transferring deep neural networks from simulation to the real world , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[8] Rico Jonschkowski,et al. The Distracting Control Suite - A Challenging Benchmark for Reinforcement Learning from Pixels , 2021, ArXiv.
[9] Doina Precup,et al. Bisimulation Metrics for Continuous Markov Decision Processes , 2011, SIAM J. Comput..
[10] Doina Precup,et al. Bounding Performance Loss in Approximate MDP Homomorphisms , 2008, NIPS.
[11] Aaron C. Courville,et al. Improved Training of Wasserstein GANs , 2017, NIPS.
[12] Joelle Pineau,et al. Improving Sample Efficiency in Model-Free Reinforcement Learning from Images , 2019, AAAI.
[13] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[14] Geoffrey E. Hinton,et al. Layer Normalization , 2016, ArXiv.
[15] Rowan McAllister,et al. Learning Invariant Representations for Reinforcement Learning without Reconstruction , 2020, ICLR.
[16] Leslie Pack Kaelbling,et al. Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..
[17] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[18] Alexei A. Efros,et al. Self-Supervised Policy Adaptation during Deployment , 2020, ICLR.
[19] Dieter Fox,et al. BayesSim: adaptive domain randomization via probabilistic inference for robotics simulators , 2019, Robotics: Science and Systems.
[20] Marc G. Bellemare,et al. DeepMDP: Learning Continuous Latent Space Models for Representation Learning , 2019, ICML.
[21] Karol Hausman,et al. Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping , 2019, Robotics: Science and Systems.
[22] Ilya Kostrikov,et al. Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels , 2020, ArXiv.