State space partitioning and clustering with sensor alignment for autonomous robots

The goal of this study is to develop an intelligent wheelchair (IWC) acquiring autonomous, cooperative, and collaborative behavior. An important part of achieving the goal is that the agent controlling the IWC can realize suitable partitioning state of new environment for developing a behavior policy without human intervention. In particular, domestic robots like the IWCs which have low-cost, discrete simple range sensors have difficulty of efficient learning of environment because the deviation of sensor configuration in each robot causes serious problem in which each learning policy can not be shared among them. To overcome this problem, a new method of alignment and clustering of sensor space is conducted. The alignment process reconfigures sensor configurations on concentric circle from spot-turn behavior of IWC robots in order to compensate the deviation of the configuration. The clustering process realizes self-organized state space with simple vector partitioning method. The simulation experiment results have shown the method gives the IWC agent an ability of autonomous partitioning of environment and efficient constructing state space.

[1]  Holly A. Yanco Integrating Robotic Research: A Survey of Robotic Wheelchair Development , 1998 .

[2]  S P Levine,et al.  The NavChair Assistive Wheelchair Navigation System. , 1999, IEEE transactions on rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society.

[3]  H. Hirata,et al.  Reinforcement learning to compensate for perceptual aliasing using dynamic additional parameter: motivational value , 2002, IEEE International Conference on Systems, Man and Cybernetics.

[4]  Jürgen Schmidhuber,et al.  HQ-Learning , 1997, Adapt. Behav..

[5]  Axel Lankenau,et al.  Smart Wheelchairs - State of the Art in an Emerging Market , 2000, Künstliche Intell..

[6]  P. Lanzi,et al.  Adaptive Agents with Reinforcement Learning and Internal Memory , 2000 .

[7]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[8]  Michael E. Cleary,et al.  Progress on the Deictic Controlled Wheelchair , 1996 .

[9]  Takashi Gomi,et al.  Developing Intelligent Wheelchairs for the Handicapped , 1998, Assistive Technology and Artificial Intelligence.

[10]  Lonnie Chrisman,et al.  Reinforcement Learning with Perceptual Aliasing: The Perceptual Distinctions Approach , 1992, AAAI.

[11]  Michael E. Cleary,et al.  Progress on the Deictically Controlled Wheelchair , 1998, Assistive Technology and Artificial Intelligence.

[12]  Hironori Hirata,et al.  Development of intelligent wheelchair acquiring autonomous, cooperative, and collaborative behavior , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).