暂无分享,去创建一个
[1] Stefan Schaal,et al. Robot Programming by Demonstration , 2009, Springer Handbook of Robotics.
[2] Peter Stone,et al. Generative Adversarial Imitation from Observation , 2018, ArXiv.
[3] J. Andrew Bagnell,et al. Modeling Purposeful Adaptive Behavior with the Principle of Maximum Causal Entropy , 2010 .
[4] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[5] Brett Browning,et al. A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..
[6] Stefano Ermon,et al. Generative Adversarial Imitation Learning , 2016, NIPS.
[7] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[8] Sergey Levine,et al. Reinforcement Learning with Deep Energy-Based Policies , 2017, ICML.
[9] Sergey Levine,et al. High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.
[10] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[11] Dean Pomerleau,et al. Efficient Training of Artificial Neural Networks for Autonomous Navigation , 1991, Neural Computation.
[12] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[13] Geoffrey J. Gordon,et al. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.
[14] Sergey Levine,et al. A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models , 2016, ArXiv.
[15] Qiang Liu,et al. Learning Self-Imitating Diverse Policies , 2018, ICLR.
[16] Henk Nijmeijer,et al. Robot Programming by Demonstration , 2010, SIMPAR.
[17] Sergey Levine,et al. Learning Robust Rewards with Adversarial Inverse Reinforcement Learning , 2017, ICLR 2017.
[18] Long Ji Lin,et al. Self-improving reactive agents based on reinforcement learning, planning and teaching , 1992, Machine Learning.
[19] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[20] Sergey Levine,et al. Learning Invariant Feature Spaces to Transfer Skills with Reinforcement Learning , 2017, ICLR.
[21] Ryuki Tachibana,et al. Internal Model from Observations for Reward Shaping , 2018, ArXiv.
[22] Pieter Abbeel,et al. Third-Person Imitation Learning , 2017, ICLR.
[23] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[24] Peter Stone,et al. Behavioral Cloning from Observation , 2018, IJCAI.
[25] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[26] Satinder Singh,et al. Generative Adversarial Self-Imitation Learning , 2018, ArXiv.
[27] Sergey Levine,et al. Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[28] Byron Boots,et al. Provably Efficient Imitation Learning from Observation Alone , 2019, ICML.
[29] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..
[30] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .