论文信息 - Adaptive State Space Quantisation For Reinforcement Learning Of collision-free navigation

Adaptive State Space Quantisation For Reinforcement Learning Of collision-free navigation

The paper describes a self-learning control system for a mobile robot. Based on sensor information the control system has to provide a steering signal in such a way that collisions are avoided. Since in our case no ‘examples’ are available, the system learns on the basis of an external reinforcement signal which is negative in case of a collision and zero otherwise. Rules from Temporal Difference learning are used to find the correct mapping between the (discrete) sensor input space and the steering signal. We describe the algorithm for learning the correct mapping from the input (state) vector to the output (steering) signal, and the algorithm which is used for a discrete coding of the input state space. keywords: reinforcement learning, neural networks, state-space quantisation, mobile robot navigation.

B. Krose | J.W.M. van Dam

[1] Richard S. Sutton,et al. Temporal credit assignment in reinforcement learning , 1984 .

[2] Rodney A. Brooks,et al. A Robust Layered Control Syste For A Mobile Robot , 2022 .

[3] Teuvo Kohonen,et al. Self-organization and associative memory: 3rd edition , 1989 .

[4] John Moody,et al. Fast Learning in Networks of Locally-Tuned Processing Units , 1989, Neural Computation.

[5] Dean A. Pomerleau,et al. What's hidden in the hidden layers? , 1989 .

[6] Ben J. A. Kröse,et al. A Sensor Simulation System for Mobile Robots , 1989, IAS.

[7] Ronald C. Arkin,et al. Motor Schema — Based Mobile Robot Navigation , 1989, Int. J. Robotics Res..

[8] Richard S. Sutton,et al. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.

[9] Dana H. Ballard,et al. Learning to Perceive and Act , 1990 .

[10] B.J.A. Kröse,et al. Learning to Avoid Collisions: A Reinforcement Learning Paradigm for Mobile Robot Navigation , 1992 .

[11] Ben J. A. Kröse,et al. Distributed adaptive control: The self-organization of structured behavior , 1992, Robotics Auton. Syst..

[12] G. D. van Albada,et al. Software architecture and simulation tools for autonomous mobile robots , 1992 .

[13] Donald Michie,et al. BOXES: AN EXPERIMENT IN ADAPTIVE CONTROL , 2013 .