Simultaneous learning of situation classification based on rewards and behavior selection based on the situation
暂无分享,去创建一个
[1] Seiji Yamada. Reactive Planning with Uncertainty of a Plan , 1992, Proceedings of the Third Annual Conference of AI, Simulation, and Planning in High Autonomy Systems 'Integrating Perception, Planning and Action'..
[2] Minoru Asada,et al. Non-Physical Intervention in Robot Learning Based on LfE Method , 1995 .
[3] Luis Moreno,et al. Learning emergent tasks for an autonomous mobile robot , 1994, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'94).
[4] Sridhar Mahadevan,et al. Automatic Programming of Behavior-Based Robots Using Reinforcement Learning , 1991, Artif. Intell..
[5] Hiroshi Ishiguro,et al. Robot oriented state space construction , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.
[6] G. Tesauro. Practical Issues in Temporal Difference Learning , 1992 .
[7] Setsuo Ohsuga,et al. Articulation problem—a basic problem for information modelling , 1990 .
[8] Leslie Pack Kaelbling,et al. Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons , 1991, IJCAI.