Planning by Incremental Dynamic Programming
暂无分享,去创建一个
[1] Rick L. Riolo,et al. Lookahead planning and latent learning in a classifier system , 1991 .
[2] Long-Ji Lin,et al. Self-improving reactive agents: case studies of reinforcement learning frameworks , 1991 .
[3] Jean-Arcady Meyer,et al. Lookahead Planning and Latent Learning in a Classifier System , 1991 .
[4] Jean-Arcady Meyer,et al. Self-improving Reactive Agents: Case Studies of Reinforcement Learning Frameworks , 1991 .
[5] John E. Laird,et al. Integrating, Execution, Planning, and Learning in Soar for External Environments , 1990, AAAI.
[6] Tom M. Mitchell,et al. Becoming Increasingly Reactive , 1990, AAAI.
[7] Dana H. Ballard,et al. Active Perception and Reinforcement Learning , 1990, Neural Computation.
[8] Richard S. Sutton,et al. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.
[9] Richard E. Korf,et al. Real-Time Heuristic Search , 1990, Artif. Intell..
[10] Andrew W. Moore,et al. Efficient memory-based learning for robot control , 1990 .
[11] Stuart J. Russell. Execution Architectures and Compilation , 1989, IJCAI.
[12] A. Barto,et al. Learning and Sequential Decision Making , 1989 .
[13] Philip E. Agre,et al. The dynamic structure of everyday life , 1988 .
[14] Mark S. Boddy,et al. An Analysis of Time-Dependent Planning , 1988, AAAI.
[15] Marcel Schoppers,et al. Universal Plans for Reactive Robots in Unpredictable Environments , 1987, IJCAI.
[16] Dimitri P. Bertsekas,et al. Dynamic Programming: Deterministic and Stochastic Models , 1987 .
[17] Paul J. Werbos,et al. Building and Understanding Adaptive Systems: A Statistical/Numerical Approach to Factory Automation and Brain Research , 1987, IEEE Transactions on Systems, Man, and Cybernetics.
[18] John H. Holland,et al. Escaping brittleness: the possibilities of general-purpose learning algorithms applied to parallel rule-based systems , 1995 .
[19] Arthur L. Samuel,et al. Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..