暂无分享,去创建一个
[1] Michael H. Bowling,et al. Apprenticeship learning using linear programming , 2008, ICML '08.
[2] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[3] Sergey Levine,et al. Nonlinear Inverse Reinforcement Learning with Gaussian Processes , 2011, NIPS.
[4] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[5] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[6] David Silver,et al. Learning to search: Functional gradient techniques for imitation learning , 2009, Auton. Robots.
[7] Robert E. Schapire,et al. A Game-Theoretic Approach to Apprenticeship Learning , 2007, NIPS.
[8] Robert E. Schapire,et al. A Reduction from Apprenticeship Learning to Classification , 2010, NIPS.
[9] J. Andrew Bagnell,et al. Maximum margin planning , 2006, ICML.
[10] Stuart J. Russell. Learning agents for uncertain environments (extended abstract) , 1998, COLT' 98.
[11] John Langford,et al. Search-based structured prediction , 2009, Machine Learning.
[12] J A Bagnell,et al. An Invitation to Imitation , 2015 .
[13] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[14] Stefan Schaal,et al. 2008 Special Issue: Reinforcement learning of motor skills with policy gradients , 2008 .
[15] J. Andrew Bagnell,et al. Efficient Reductions for Imitation Learning , 2010, AISTATS.
[16] Sergey Levine,et al. Continuous Inverse Optimal Control with Locally Optimal Examples , 2012, ICML.
[17] Dean Pomerleau,et al. Efficient Training of Artificial Neural Networks for Autonomous Navigation , 1991, Neural Computation.
[18] John Langford,et al. Approximately Optimal Approximate Reinforcement Learning , 2002, ICML.
[19] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.
[20] Stefano Ermon,et al. Learning Large-Scale Dynamic Discrete Choice Models of Spatio-Temporal Preferences with Application to Migratory Pastoralism in East Africa , 2015, AAAI.
[21] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[22] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[23] Joelle Pineau,et al. Learning from Limited Demonstrations , 2013, NIPS.
[24] Csaba Szepesvári,et al. Training parsers by inverse reinforcement learning , 2009, Machine Learning.
[25] Geoffrey J. Gordon,et al. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.