Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming

[1]  Richard S. Sutton,et al.  Dyna, an integrated architecture for learning, planning, and reacting , 1990, SGAR.

[2]  Richard S. Sutton,et al.  Planning by Incremental Dynamic Programming , 1991, ML.

[3]  Richard E. Korf,et al.  Real-Time Heuristic Search , 1990, Artif. Intell..

[4]  Michael C. Mozer,et al.  Discovering the Structure of a Reactive Environment by Exploration , 1990, Neural Computation.

[5]  Stuart J. Russell Execution Architectures and Compilation , 1989, IJCAI.

[6]  Richard S. Sutton,et al.  Sequential Decision Problems and Neural Networks , 1989, NIPS 1989.

[7]  A. Barto,et al.  Learning and Sequential Decision Making , 1989 .

[8]  Robert E. Schapire,et al.  A new approach to unsupervised learning in deterministic environments , 1990 .

[9]  Charles W. Anderson,et al.  Strategy Learning with Multilayer Connectionist Representations , 1987 .

[10]  Paul J. Werbos,et al.  Building and Understanding Adaptive Systems: A Statistical/Numerical Approach to Factory Automation and Brain Research , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[11]  Geoffrey E. Hinton,et al.  Schemata and Sequential Thought Processes in PDP Models , 1986 .

[12]  Richard S. Sutton,et al.  Temporal credit assignment in reinforcement learning , 1984 .

[13]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[14]  D. Dennett Why the Law of Effect will not Go Away , 1975 .

[15]  R. Howard Dynamic Programming and Markov Processes , 1960 .

[16]  W. H. F. Barnes The Nature of Explanation , 1944, Nature.