论文信息 - Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming - 字舞流文

Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming

Richard S. Sutton | R. Sutton

[1] Richard S. Sutton,et al. Dyna, an integrated architecture for learning, planning, and reacting , 1990, SGAR.

[2] Richard S. Sutton,et al. Planning by Incremental Dynamic Programming , 1991, ML.

[3] Richard E. Korf,et al. Real-Time Heuristic Search , 1990, Artif. Intell..

[4] Michael C. Mozer,et al. Discovering the Structure of a Reactive Environment by Exploration , 1990, Neural Computation.

[5] Stuart J. Russell. Execution Architectures and Compilation , 1989, IJCAI.

[6] Richard S. Sutton,et al. Sequential Decision Problems and Neural Networks , 1989, NIPS 1989.

[7] A. Barto,et al. Learning and Sequential Decision Making , 1989 .

[8] Robert E. Schapire,et al. A new approach to unsupervised learning in deterministic environments , 1990 .

[9] Charles W. Anderson,et al. Strategy Learning with Multilayer Connectionist Representations , 1987 .

[10] Paul J. Werbos,et al. Building and Understanding Adaptive Systems: A Statistical/Numerical Approach to Factory Automation and Brain Research , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[11] Geoffrey E. Hinton,et al. Schemata and Sequential Thought Processes in PDP Models , 1986 .

[12] Richard S. Sutton,et al. Temporal credit assignment in reinforcement learning , 1984 .

[13] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[14] D. Dennett. Why the Law of Effect will not Go Away , 1975 .

[15] R. Howard. Dynamic Programming and Markov Processes , 1960 .

[16] W. H. F. Barnes. The Nature of Explanation , 1944, Nature.