Conditions for the convergence of one-layer networks under reinforcement learning
暂无分享,去创建一个
[1] P. Anandan,et al. Pattern-recognizing stochastic learning automata , 1985, IEEE Transactions on Systems, Man, and Cybernetics.
[2] Richard S. Sutton,et al. Temporal credit assignment in reinforcement learning , 1984 .
[3] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .
[4] M. Raibert. Analytical equations vs. table look-up for manipulation: A unifying concept , 1977, 1977 IEEE Conference on Decision and Control including the 16th Symposium on Adaptive Processes and A Special Symposium on Fuzzy Set Theory and Applications.
[5] M. Minsky. The Society of Mind , 1986 .
[6] F. Downton. Stochastic Approximation , 1969, Nature.
[7] A G Barto,et al. Learning by statistical cooperation of self-interested neuron-like computing elements. , 1985, Human neurobiology.
[8] P. Anandan,et al. Cooperativity in Networks of Pattern Recognizing Stochastic Learning Automata , 1986 .
[9] Charles W. Anderson,et al. Learning and problem-solving with multilayer connectionist systems (adaptive, strategy learning, neural networks, reinforcement learning) , 1986 .
[10] Kumpati S. Narendra,et al. Learning automata - an introduction , 1989 .
[11] R. Kashyap,et al. 9 Stochastic Approximation , 1970 .
[12] S. Lakshmivarahan,et al. Learning Algorithms Theory and Applications , 1981 .
[13] David Zipser,et al. Feature Discovery by Competive Learning , 1985, Cogn. Sci..