Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning
暂无分享,去创建一个
[1] Shun-ichi Amari,et al. Natural Gradient Works Efficiently in Learning , 1998, Neural Computation.
[2] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[3] Sham M. Kakade,et al. A Natural Policy Gradient , 2001, NIPS.
[4] Jun Nakanishi,et al. Learning rhythmic movements by demonstration using nonlinear oscillators , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.
[5] Stefan Schaal,et al. Reinforcement Learning for Humanoid Robotics , 2003 .
[6] Jeff G. Schneider,et al. Covariant policy search , 2003, IJCAI 2003.
[7] Jan Wessnitzer,et al. ESANN'2007 proceedings - European Symposium on Artificial Neural Networks , 2007 .