Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming
暂无分享,去创建一个
[1] Richard S. Sutton,et al. Dyna, an integrated architecture for learning, planning, and reacting , 1990, SGAR.
[2] Richard S. Sutton,et al. Planning by Incremental Dynamic Programming , 1991, ML.
[3] Richard E. Korf,et al. Real-Time Heuristic Search , 1990, Artif. Intell..
[4] Michael C. Mozer,et al. Discovering the Structure of a Reactive Environment by Exploration , 1990, Neural Computation.
[5] Stuart J. Russell. Execution Architectures and Compilation , 1989, IJCAI.
[6] Richard S. Sutton,et al. Sequential Decision Problems and Neural Networks , 1989, NIPS 1989.
[7] A. Barto,et al. Learning and Sequential Decision Making , 1989 .
[8] Robert E. Schapire,et al. A new approach to unsupervised learning in deterministic environments , 1990 .
[9] Charles W. Anderson,et al. Strategy Learning with Multilayer Connectionist Representations , 1987 .
[10] Paul J. Werbos,et al. Building and Understanding Adaptive Systems: A Statistical/Numerical Approach to Factory Automation and Brain Research , 1987, IEEE Transactions on Systems, Man, and Cybernetics.
[11] Geoffrey E. Hinton,et al. Schemata and Sequential Thought Processes in PDP Models , 1986 .
[12] Richard S. Sutton,et al. Temporal credit assignment in reinforcement learning , 1984 .
[13] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.
[14] D. Dennett. Why the Law of Effect will not Go Away , 1975 .
[15] R. Howard. Dynamic Programming and Markov Processes , 1960 .
[16] W. H. F. Barnes. The Nature of Explanation , 1944, Nature.