Module Based Reinforcement Learning: An Application to a Real Robot
暂无分享,去创建一个
[1] Leslie Pack Kaelbling,et al. Learning Policies for Partially Observable Environments: Scaling Up , 1997, ICML.
[2] Maja J. Mataric,et al. Reinforcement Learning in the Multi-Robot Domain , 1997, Auton. Robots.
[3] Rodney A. Brooks,et al. Elephants don't play chess , 1990, Robotics Auton. Syst..
[4] Sebastian Thrun,et al. The role of exploration in learning control , 1992 .
[5] Michael L. Littman,et al. Algorithms for Sequential Decision Making , 1996 .
[6] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..
[7] Sridhar Mahadevan,et al. Automatic Programming of Behavior-Based Robots Using Reinforcement Learning , 1991, Artif. Intell..
[8] Minoru Asada,et al. Behavior coordination for a mobile robot using modular reinforcement learning , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.
[9] András Lörincz,et al. Behavior of an Adaptive Self-organizing Autonomous Agent Working with Cues and Competing Concepts , 1993, Adapt. Behav..
[10] Csaba Szepesvári,et al. A Generalized Reinforcement-Learning Model: Convergence and Applications , 1996, ICML.
[11] John N. Tsitsiklis,et al. Analysis of Temporal-Diffference Learning with Function Approximation , 1996, NIPS.
[12] D. J. Hand,et al. Artificial intelligence , 1981, Psychological Medicine.
[13] Michael I. Jordan,et al. Learning Without State-Estimation in Partially Observable Markovian Decision Processes , 1994, ICML.
[14] Pattie Maes,et al. Designing autonomous agents: Theory and practice from biology to engineering and back , 1990, Robotics Auton. Syst..
[15] Z. Kalmar,et al. Generalization in an autonomous agent , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).
[16] Richard S. Sutton,et al. Temporal credit assignment in reinforcement learning , 1984 .
[17] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[18] Michael I. Jordan,et al. MASSACHUSETTS INSTITUTE OF TECHNOLOGY ARTIFICIAL INTELLIGENCE LABORATORY and CENTER FOR BIOLOGICAL AND COMPUTATIONAL LEARNING DEPARTMENT OF BRAIN AND COGNITIVE SCIENCES , 1996 .
[19] R. Bellman. Dynamic programming. , 1957, Science.
[20] Pattie Maes,et al. A bottom-up mechanism for behavior selection in an artificial creature , 1991 .
[21] Richard S. Sutton,et al. Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding , 1995, NIPS.