Vision-based reinforcement learning for purposive behavior acquisition
暂无分享,去创建一个
Minoru Asada | Koh Hosoda | Shoichi Noda | Sukoya Tawaratsumida | M. Asada | S. Noda | K. Hosoda | Sukoya Tawaratsumida
[1] Editors , 1986, Brain Research Bulletin.
[2] Dana H. Ballard,et al. Active Perception and Reinforcement Learning , 1990, Neural Computation.
[3] Leslie Pack Kaelbling,et al. Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons , 1991, IJCAI.
[4] Steven D. Whitehead,et al. A Complexity Analysis of Cooperative Mechanisms in Reinforcement Learning , 1991, AAAI.
[5] Sridhar Mahadevan,et al. Automatic Programming of Behavior-Based Robots Using Reinforcement Learning , 1991, Artif. Intell..
[6] Sridhar Mahadevan,et al. Robot Learning , 1993 .
[7] Leslie Pack Kaelbling,et al. Learning to Achieve Goals , 1993, IJCAI.
[8] Masayuki Inaba,et al. Remote-Brained Robotics : Interfacing AI with Real World Behaviors , 1993 .
[9] Sridhar Mahadevan,et al. Rapid Task Learning for Real Robots , 1993 .
[10] George A. Bekey,et al. A reinforcement-learning approach to reactive control policy design for autonomous robots , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.
[11] Fuminori Saito,et al. Learning architecture for real robotic systems-extension of connectionist Q-learning for continuous robot control domain , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.
[12] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..