A Bayesian Approach to Generative Adversarial Imitation Learning
暂无分享,去创建一个
[1] Yuval Tassa,et al. Maximum a Posteriori Policy Optimisation , 2018, ICLR.
[2] Stefano Ermon,et al. Model-Free Imitation Learning with Policy Optimization , 2016, ICML.
[3] Qiang Liu,et al. A Kernelized Stein Discrepancy for Goodness-of-fit Tests , 2016, ICML.
[4] Stefano Ermon,et al. InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations , 2017, NIPS.
[5] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[6] Yang Liu,et al. Stein Variational Policy Gradient , 2017, UAI.
[7] Stuart J. Russell. Learning agents for uncertain environments (extended abstract) , 1998, COLT' 98.
[8] Nando de Freitas,et al. Robust Imitation of Diverse Behaviors , 2017, NIPS.
[9] Kee-Eung Kim,et al. Imitation Learning via Kernel Mean Embedding , 2018, AAAI.
[10] Richard S. Zemel,et al. Generative Moment Matching Networks , 2015, ICML.
[11] Dilin Wang,et al. Stein Variational Gradient Descent: A General Purpose Bayesian Inference Algorithm , 2016, NIPS.
[12] J. Andrew Bagnell,et al. Maximum margin planning , 2006, ICML.
[13] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[14] Gerhard Neumann,et al. Variational Inference for Policy Search in changing situations , 2011, ICML.
[15] Csaba Szepesvári,et al. Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods , 2007, UAI.
[16] Dean Pomerleau,et al. Efficient Training of Artificial Neural Networks for Autonomous Navigation , 1991, Neural Computation.
[17] Eyal Amir,et al. Bayesian Inverse Reinforcement Learning , 2007, IJCAI.
[18] Marc Toussaint,et al. Robot trajectory optimization using approximate inference , 2009, ICML '09.
[19] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[20] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[21] J A Bagnell,et al. An Invitation to Imitation , 2015 .
[22] Sergey Levine,et al. A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models , 2016, ArXiv.
[23] Martín Abadi,et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.
[24] Kee-Eung Kim,et al. MAP Inference for Bayesian Inverse Reinforcement Learning , 2011, NIPS.
[25] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[26] Stefano Ermon,et al. Generative Adversarial Imitation Learning , 2016, NIPS.
[27] Andrew Gordon Wilson,et al. Bayesian GAN , 2017, NIPS.
[28] Michael H. Bowling,et al. Apprenticeship learning using linear programming , 2008, ICML '08.
[29] Sergey Levine,et al. Learning Robust Rewards with Adversarial Inverse Reinforcement Learning , 2017, ICLR 2017.
[30] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.