Multi-time Models for Temporally Abstract Planning
暂无分享,去创建一个
[1] Earl David Sacerdoti,et al. A Structure for Plans and Behavior , 1977 .
[2] R. Korf. Learning to solve problems by searching for macro-operators , 1983 .
[3] Geoffrey E. Hinton,et al. Feudal Reinforcement Learning , 1992, NIPS.
[4] Satinder P. Singh,et al. Scaling Reinforcement Learning Algorithms by Learning Variable Temporal Resolution Models , 1992, ML.
[5] C. Atkeson,et al. Prioritized Sweeping: Reinforcement Learning with Less Data and Less Time , 1993, Machine Learning.
[6] Peter Dayan,et al. Improving Generalization for Temporal Difference Learning: The Successor Representation , 1993, Neural Computation.
[7] Jing Peng,et al. Efficient Learning and Planning Within the Dyna Framework , 1993, Adapt. Behav..
[8] Leslie Pack Kaelbling,et al. Hierarchical Learning in Stochastic Domains: Preliminary Results , 1993, ICML.
[9] Richard S. Sutton,et al. TD Models: Modeling the World at a Mixture of Time Scales , 1995, ICML.
[10] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..
[11] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .