State generalization method based on uncertainty minimization of behavior outcomes for reactive agents

Autonomous state generalization is one of the key issues in the behavior acquisition problem of reactive agents. The paper proposes a novel state generalization method which discretizes the continuous sensor space based on entropy minimization of agents' behavior outcomes. This general framework unifies the heuristic criteria for state generalization used in conventional works. An experimental study in the latter part suggests that our method increases the adaptability of agents to the environment and improves the overall behavior performance.

[1]  Minoru Asada,et al.  Action-based sensor space categorization for robot learning , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[2]  Hiroshi Ishiguro,et al.  Robot oriented state space construction , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[3]  Leslie Pack Kaelbling,et al.  Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons , 1991, IJCAI.

[4]  Shinichi Nakasuka,et al.  Simultaneous learning of situation classification based on rewards and behavior selection based on the situation , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[5]  Minoru Asada,et al.  Reasonable performance in less learning time by real robot based on incremental state space segmentation , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[6]  Andrew W. Moore,et al.  The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces , 2004, Machine Learning.

[7]  A. Meystel,et al.  Multiresolutional intelligent controller for baby robot , 1995, Proceedings of Tenth International Symposium on Intelligent Control.