Adversarial Inverse Reinforcement Learning With Self-Attention Dynamics Model
暂无分享,去创建一个
Lantao Yu | Bolei Zhou | Jiankai Sun | Pinqian Dong | Bo Lu
[1] Jürgen Schmidhuber,et al. Recurrent World Models Facilitate Policy Evolution , 2018, NeurIPS.
[2] Pieter Abbeel,et al. Learning for control from multiple demonstrations , 2008, ICML '08.
[3] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[4] Sergey Levine,et al. Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization , 2016, ICML.
[5] Eduardo F. Morales,et al. An Introduction to Reinforcement Learning , 2011 .
[6] Stefano Ermon,et al. Generative Adversarial Imitation Learning , 2016, NIPS.
[7] Cewu Lu,et al. Transferable Active Grasping and Real Embodied Dataset , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).
[8] Sergey Levine,et al. Learning Robust Rewards with Adversarial Inverse Reinforcement Learning , 2017, ICLR 2017.
[9] Yee Whye Teh,et al. The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables , 2016, ICLR.
[10] Karol Gregor,et al. Neural Variational Inference and Learning in Belief Networks , 2014, ICML.
[11] Learning a Decision Module by Imitating Driver's Control Behaviors , 2019, CoRL.
[12] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[13] Carl E. Rasmussen,et al. PILCO: A Model-Based and Data-Efficient Approach to Policy Search , 2011, ICML.
[14] Shie Mannor,et al. End-to-End Differentiable Adversarial Imitation Learning , 2017, ICML.
[15] Sergey Levine,et al. Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models , 2018, NeurIPS.
[16] James Bergstra,et al. Benchmarking Reinforcement Learning Algorithms on Real-World Robots , 2018, CoRL.
[17] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[18] Mohammad Norouzi,et al. Dream to Control: Learning Behaviors by Latent Imagination , 2019, ICLR.
[19] Bolei Zhou,et al. Neuro-Symbolic Program Search for Autonomous Driving Decision Module Design , 2020, CoRL.
[20] Wolfram Burgard,et al. Learning driving styles for autonomous vehicles from demonstration , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).
[21] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[22] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[23] Sergey Levine,et al. When to Trust Your Model: Model-Based Policy Optimization , 2019, NeurIPS.
[24] Yuval Tassa,et al. Learning Continuous Control Policies by Stochastic Value Gradients , 2015, NIPS.
[25] Ben Poole,et al. Categorical Reparameterization with Gumbel-Softmax , 2016, ICLR.
[26] Abhinav Gupta,et al. Non-local Neural Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[27] Sergey Levine,et al. Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).