暂无分享,去创建一个
[1] Florentin Wörgötter,et al. Distributed recurrent neural forward models with synaptic adaptation and CPG-based control for complex behaviors of walking robots , 2015, Front. Neurorobot..
[2] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[3] Nando de Freitas,et al. Robust Imitation of Diverse Behaviors , 2017, NIPS.
[4] Dean Pomerleau,et al. Efficient Training of Artificial Neural Networks for Autonomous Navigation , 1991, Neural Computation.
[5] Markus Wulfmeier,et al. Maximum Entropy Deep Inverse Reinforcement Learning , 2015, 1507.04888.
[6] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[7] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[8] Stefano Ermon,et al. Generative Adversarial Imitation Learning , 2016, NIPS.
[9] Stefan Schaal,et al. Learning from Demonstration , 1996, NIPS.
[10] Peter Stone,et al. Behavioral Cloning from Observation , 2018, IJCAI.
[11] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[12] Ming Yang,et al. 3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[13] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[14] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[15] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[16] Eric Wiewiora,et al. Potential-Based Shaping and Q-Value Initialization are Equivalent , 2003, J. Artif. Intell. Res..
[17] Alexei A. Efros,et al. Curiosity-Driven Exploration by Self-Supervised Prediction , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[18] Marcin Andrychowicz,et al. One-Shot Imitation Learning , 2017, NIPS.
[19] Daiki Kimura,et al. DAQN: Deep Auto-encoder and Q-Network , 2018, ArXiv.
[20] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[21] Sonia Chernova,et al. Reinforcement Learning from Demonstration through Shaping , 2015, IJCAI.
[22] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.
[23] Sonia Chernova,et al. Learning from Demonstration for Shaping through Inverse Reinforcement Learning , 2016, AAMAS.
[24] Bram Bakker,et al. Reinforcement Learning with Long Short-Term Memory , 2001, NIPS.
[25] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[26] Shie Mannor,et al. End-to-End Differentiable Adversarial Imitation Learning , 2017, ICML.
[27] R. Mazo. On the theory of brownian motion , 1973 .
[28] Anind K. Dey,et al. Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.