Purposive behavior acquisition for a real robot by vision-based reinforcement learning
暂无分享,去创建一个
Minoru Asada | Shoichi Noda | Koh Hosoda | Sukoya Tawaratsumida | M. Asada | S. Noda | K. Hosoda | Sukoya Tawaratsumida
[1] C. Watkins. Learning from delayed rewards , 1989 .
[2] Dana H. Ballard,et al. Active Perception and Reinforcement Learning , 1990, Neural Computation.
[3] Leslie Pack Kaelbling,et al. Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons , 1991, IJCAI.
[4] Steven D. Whitehead,et al. A Complexity Analysis of Cooperative Mechanisms in Reinforcement Learning , 1991, AAAI.
[5] Sridhar Mahadevan,et al. Automatic Programming of Behavior-Based Robots Using Reinforcement Learning , 1991, Artif. Intell..
[6] Sridhar Mahadevan,et al. Robot Learning , 1993 .
[7] Leslie Pack Kaelbling,et al. Learning to Achieve Goals , 1993, IJCAI.
[8] Masayuki Inaba,et al. Remote-Brained Robotics : Interfacing AI with Real World Behaviors , 1993 .
[9] Sridhar Mahadevan,et al. Rapid Task Learning for Real Robots , 1993 .
[10] Dean A. Pomerleau,et al. Knowledge-Based Training of Artificial Neural Networks for Autonomous Robot Driving , 1993 .
[11] George A. Bekey,et al. A reinforcement-learning approach to reactive control policy design for autonomous robots , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.
[12] Maja J. Mataric,et al. Reward Functions for Accelerated Learning , 1994, ICML.
[13] Fuminori Saito,et al. Learning architecture for real robotic systems-extension of connectionist Q-learning for continuous robot control domain , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.
[14] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..
[15] Long Ji Lin,et al. Self-improving reactive agents based on reinforcement learning, planning and teaching , 1992, Machine Learning.