论文信息 - A partially recurrent gating network approach to learning action selection by reinforcement

A partially recurrent gating network approach to learning action selection by reinforcement

We describe a neural network approach to the problem of reactive navigation, using a simulated mobile robot. Specifically, it is shown that complementary reinforcement backpropagation learning can be a means for modular networks to acquire different navigation related skills concurrently, Further, it is demonstrated that a partially recurrent net can function as a gating network to coordinate the reinforcement learning across modules and across time steps. In effect, the recurrent gating network performs action selection by choosing developing experts to make control decisions in the context of previous actions in the temporally extended domain.

R. M. Rylatt | Chris Czarnecki | T. W. Routen | C. Czarnecki | T. Routen

[1] R. A. Brooks,et al. Intelligence without Representation , 1991, Artif. Intell..

[2] John F. Kolen,et al. The importance of leaky levels for behavior-based AI , 1994 .

[3] R. J. Williams,et al. On the use of backpropagation in associative reinforcement learning , 1988, IEEE 1988 International Conference on Neural Networks.

[4] Pattie Maes,et al. Designing autonomous agents: Theory and practice from biology to engineering and back , 1990, Robotics Auton. Syst..

[5] David H. Ackley,et al. Generalization and Scaling in Reinforcement Learning , 1989, NIPS.

[6] Rodney A. Brooks,et al. A Robust Layered Control Syste For A Mobile Robot , 2022 .

[7] Geoffrey E. Hinton,et al. Adaptive Mixtures of Local Experts , 1991, Neural Computation.

[8] Gary McGraw,et al. Emergent Control and Planning in an Autonomous Vehicle , 1993 .