暂无分享,去创建一个
[1] John Schulman,et al. Concrete Problems in AI Safety , 2016, ArXiv.
[2] Nan Jiang,et al. Repeated Inverse Reinforcement Learning , 2017, NIPS.
[3] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[4] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[5] Eiji Uchibe,et al. Model-Free Deep Inverse Reinforcement Learning by Logistic Regression , 2018, Neural Processing Letters.
[6] Sergey Levine,et al. Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization , 2016, ICML.
[7] David M. Bradley,et al. Boosting Structured Prediction for Imitation Learning , 2006, NIPS.
[8] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[9] J. Andrew Bagnell,et al. Maximum margin planning , 2006, ICML.
[10] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[11] Sergey Levine,et al. Nonlinear Inverse Reinforcement Learning with Gaussian Processes , 2011, NIPS.
[12] Honglak Lee,et al. Control of Memory, Active Perception, and Action in Minecraft , 2016, ICML.
[13] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..
[14] J. Andrew Bagnell,et al. Modeling Purposeful Adaptive Behavior with the Principle of Maximum Causal Entropy , 2010 .
[15] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[16] Sergey Levine,et al. Reinforcement Learning with Deep Energy-Based Policies , 2017, ICML.
[17] Csaba Szepesvári,et al. Algorithms for Reinforcement Learning , 2010, Synthesis Lectures on Artificial Intelligence and Machine Learning.
[18] Andrew Y. Ng,et al. Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping , 1999, ICML.
[19] Sergey Levine,et al. A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models , 2016, ArXiv.
[20] Jan Peters,et al. Relative Entropy Inverse Reinforcement Learning , 2011, AISTATS.
[21] Markus Wulfmeier,et al. Maximum Entropy Deep Inverse Reinforcement Learning , 2015, 1507.04888.
[22] Brett Browning,et al. A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..
[23] Stefano Ermon,et al. Generative Adversarial Imitation Learning , 2016, NIPS.