Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching
暂无分享,去创建一个
[1] R. Bellman,et al. Dynamic Programming and Markov Processes , 1960 .
[2] Tom M. Mitchell,et al. Generalization as Search , 2002 .
[3] Richard S. Sutton,et al. Temporal credit assignment in reinforcement learning , 1984 .
[4] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .
[5] Charles W. Anderson,et al. Strategy Learning with Multilayer Connectionist Representations , 1987 .
[6] Dean Pomerleau,et al. ALVINN, an autonomous land vehicle in a neural network , 2015 .
[7] D. Ballard,et al. A Role for Anticipation in Reactive Systems that Learn , 1989, ML.
[8] C. Watkins. Learning from delayed rewards , 1989 .
[9] Richard S. Sutton,et al. Learning and Sequential Decision Making , 1989 .
[10] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.
[11] Kevin J. Lang. A time delay neural network architecture for speech recognition , 1989 .
[12] Sebastian Thrun,et al. Planning with an Adaptive World Model , 1990, NIPS.
[13] Richard S. Sutton,et al. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.
[14] M. Gabriel,et al. Learning and Computational Neuroscience: Foundations of Adaptive Networks , 1990 .
[15] Ming Tan,et al. Learning a Cost-Sensitive Internal Representation for Reinforcement Learning , 1991, ML.
[16] Leslie Pack Kaelbling,et al. Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons , 1991, IJCAI.
[17] Sridhar Mahadevan,et al. Scaling Reinforcement Learning to Robotics by Exploiting the Subsumption Architecture , 1991, ML.
[18] Steven D. Whitehead,et al. Complexity and Cooperation in Q-Learning , 1991, ML.
[19] Long Ji Lin,et al. Programming Robots Using Reinforcement Learning and Teaching , 1991, AAAI.
[20] Sebastian Thrun,et al. Active Exploration in Dynamic Environments , 1991, NIPS.
[21] Long-Ji Lin,et al. Self-improving reactive agents: case studies of reinforcement learning frameworks , 1991 .
[22] Long Ji Lin,et al. Self-improvement Based on Reinforcement Learning, Planning and Teaching , 1991, ML.
[23] Leslie Pack Kaelbling,et al. Learning in embedded systems , 1993 .
[24] John J. Grefenstette,et al. Learning sequential decision rules using simulation models and competition , 2004, Machine Learning.
[25] Dana H. Ballard,et al. Learning to perceive and act by trial and error , 1991, Machine Learning.
[26] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.