Modeling reward functions for incomplete state representations via echo state networks
暂无分享,去创建一个
[1] A. P. Wieland,et al. Evolving neural network controllers for unstable systems , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.
[2] P J Webros. BACKPROPAGATION THROUGH TIME: WHAT IT DOES AND HOW TO DO IT , 1990 .
[3] Herbert Jaeger,et al. The''echo state''approach to analysing and training recurrent neural networks , 2001 .
[4] Herbert Jaeger,et al. A tutorial on training recurrent neural networks , covering BPPT , RTRL , EKF and the " echo state network " approach - Semantic Scholar , 2005 .
[5] Stuart E. Dreyfus,et al. On using discretized Cohen-Grossberg node dynamics for model-free actor-critic neural learning in non-Markovian domains , 2003, Proceedings 2003 IEEE International Symposium on Computational Intelligence in Robotics and Automation. Computational Intelligence in Robotics and Automation for the New Millennium (Cat. No.03EX694).
[6] Jennie Si,et al. Backpropagation Through Time and Derivative Adaptive CriticsA Common Framework for ComparisonPortions of this chapter were previously published in [4, 7,9, 1214,23]. , 2004 .
[7] Eiji Mizutani,et al. Two stochastic dynamic programming problems by model-free actor-critic recurrent-network learning in non-Markovian settings , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).
[8] Steven Seidman,et al. A synthesis of reinforcement learning and robust control theory , 2000 .
[9] Paul-Gerhard Plöger,et al. Echo State Networks for Mobile Robot Modeling and Control , 2003, RoboCup.
[10] Henry Markram,et al. Real-Time Computing Without Stable States: A New Framework for Neural Computation Based on Perturbations , 2002, Neural Computation.
[11] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[12] Jilles Vreeken,et al. On real-world temporal pattern recognition using Liquid State Machines , 2003 .