REINFORCEMENT LEARNING FOR COORDINATED REACTIVE CONTROL

The demands of rapid response and the complexity of many environments make it diicult to decompose, tune and coordinate reactive behaviors while ensuring consistency. Reinforcement learning networks can address the tuning problem, but do not address the problem of decomposition and coordination. We hypothesize that interacting reactions can often be decomposed into separate control tasks resident in separate networks and that the interaction can be coordinated through the tuning mechanism and a higher level controller. To explore these issues, we have implemented a reinforcement learning architecture as the reactive component of a two layer control system for a simulated race car. By varying the architecture, we test whether decomposing reactivity into separate controllers leads to superior overall performance and learning convergence in our domain.

[1]  Richard S. Sutton,et al.  Temporal credit assignment in reinforcement learning , 1984 .

[2]  Charles W. Anderson,et al.  Strategy Learning with Multilayer Connectionist Representations , 1987 .

[3]  T. Iberall,et al.  Neural network architecture for robot hand control , 1989, IEEE Control Systems Magazine.

[4]  A. Barto,et al.  Learning and Sequential Decision Making , 1989 .

[5]  Stephen H. Lane,et al.  Integrating Knowledge-Based System and Neural Network Techniques for Robotic Skill Acquisition , 1989, IJCAI.

[6]  G. B. Andeen,et al.  Structured neural-network approach to robot motion control , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.

[7]  Stephen Grossberg,et al.  A self-organizing neural network model for redundant sensory-motor control, motor equivalence, and tool use , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[8]  Abhijit S. Pandya,et al.  Using a cerebellar model arithmetic computer (CMAC) neural network to control an autonomous underwater vehicle , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[9]  F. J. Radermacher,et al.  Integration of Sub-Symbolic and Symbolic Information Processing in Robot Control , 1992, Proceedings of the Third Annual Conference of AI, Simulation, and Planning in High Autonomy Systems 'Integrating Perception, Planning and Action'..

[10]  Marcelo H. Ang,et al.  Performance of a neuro-model-based robot controller: adaptability and noise rejection , 1992 .

[11]  Marco Colombetti,et al.  Robot Shaping: Developing Autonomous Agents Through Learning , 1994, Artif. Intell..

[12]  John D. Lowrance,et al.  Planning and reacting in uncertain and dynamic environments , 1995, J. Exp. Theor. Artif. Intell..