Robust Imitation of Diverse Behaviors
暂无分享,去创建一个
Nando de Freitas | Ziyu Wang | Nicolas Heess | Josh Merel | Scott E. Reed | Gregory Wayne | Ziyun Wang | N. Heess | Greg Wayne | J. Merel | N. D. Freitas
[1] Michiel van de Panne,et al. Synthesis of Controllers for Stylized Planar Bipedal Walking , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.
[2] Alex Graves,et al. Conditional Image Generation with PixelCNN Decoders , 2016, NIPS.
[3] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[4] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.
[5] Stefan Schaal,et al. Robot Programming by Demonstration , 2009, Springer Handbook of Robotics.
[6] Dean Pomerleau,et al. Efficient Training of Artificial Neural Networks for Autonomous Navigation , 1991, Neural Computation.
[7] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[8] Ian J. Goodfellow,et al. NIPS 2016 Tutorial: Generative Adversarial Networks , 2016, ArXiv.
[9] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[10] Razvan Pascanu,et al. Policy Distillation , 2015, ICLR.
[11] Shie Mannor,et al. Model-based Adversarial Imitation Learning , 2016, ArXiv.
[12] KangKang Yin,et al. SIMBICON: simple biped locomotion control , 2007, ACM Trans. Graph..
[13] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.
[14] Raymond Y. K. Lau,et al. Least Squares Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).
[15] Stefan Schaal,et al. Robot Program 59. Robot Programming by Demonstration , 2008 .
[16] Guo-Jun Qi,et al. Loss-Sensitive Generative Adversarial Networks on Lipschitz Densities , 2017, International Journal of Computer Vision.
[17] Kuldip K. Paliwal,et al. Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..
[18] Pieter Abbeel,et al. Third-Person Imitation Learning , 2017, ICLR.
[19] Jehee Lee,et al. Simulating biped behaviors from human motion data , 2007, SIGGRAPH 2007.
[20] Heiga Zen,et al. WaveNet: A Generative Model for Raw Audio , 2016, SSW.
[21] J. Andrew Bagnell,et al. Efficient Reductions for Imitation Learning , 2010, AISTATS.
[22] Mykel J. Kochenderfer,et al. Imitating driver behavior with generative adversarial networks , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).
[23] Mohamed Medhat Gaber,et al. Imitation Learning , 2017, ACM Comput. Surv..
[24] Yoshua Bengio,et al. Boundary-Seeking Generative Adversarial Networks , 2017, ICLR 2017.
[25] Brett Browning,et al. A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..
[26] Stefano Ermon,et al. Generative Adversarial Imitation Learning , 2016, NIPS.
[27] Léon Bottou,et al. Wasserstein GAN , 2017, ArXiv.
[28] Stefano Ermon,et al. Inferring The Latent Structure of Human Decision-Making from Raw Visual Inputs , 2017, NIPS 2017.
[29] Gaurav S. Sukhatme,et al. Multi-Modal Imitation Learning from Unstructured Demonstrations using Generative Adversarial Nets , 2017, NIPS.
[30] M. V. D. Panne,et al. SIMBICON: simple biped locomotion control , 2007, SIGGRAPH 2007.
[31] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[32] Shakir Mohamed,et al. Variational Approaches for Auto-Encoding Generative Adversarial Networks , 2017, ArXiv.
[33] Zoran Popović,et al. Contact-aware nonlinear control of dynamic characters , 2009, SIGGRAPH 2009.
[34] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[35] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .
[36] Jürgen Schmidhuber,et al. Bidirectional LSTM Networks for Improved Phoneme Classification and Recognition , 2005, ICANN.
[37] Stefano Ermon,et al. InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations , 2017, NIPS.
[38] Guy Lever,et al. Deterministic Policy Gradient Algorithms , 2014, ICML.
[39] Matthias Bethge,et al. A note on the evaluation of generative models , 2015, ICLR.
[40] Yuval Tassa,et al. Learning human behaviors from motion capture by adversarial imitation , 2017, ArXiv.
[41] Marcin Andrychowicz,et al. One-Shot Imitation Learning , 2017, NIPS.
[42] David Berthelot,et al. BEGAN: Boundary Equilibrium Generative Adversarial Networks , 2017, ArXiv.
[43] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[44] Yiannis Demiris,et al. MAGAN: Margin Adaptation for Generative Adversarial Networks , 2017, ArXiv.
[45] Geoffrey J. Gordon,et al. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning , 2010, AISTATS.
[46] Glen Berseth,et al. DeepLoco: dynamic locomotion skills using hierarchical deep reinforcement learning , 2017, ACM Trans. Graph..