A reinforcement-learning approach to reactive control policy design for autonomous robots

Within the field of robotics, much recent attention has been given to control techniques that have been termed reactive or behavior-based. The design of such control systems for even a remotely interesting task is typically a laborious effort, requiring many hours of experimental "tweaking" as the actual behavior of the system is observed by the system designer. In this paper, the authors present a neural-based reinforcement learning approach to the design of reactive control policies in which the designer specifies the the desired behavior of the system, rather than the control program that produces the desired behavior.<<ETX>>

[1]  Arthur L. Samuel,et al.  Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..

[2]  A. L. Samuel,et al.  Some studies in machine learning using the game of checkers. II: recent progress , 1967 .

[3]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[4]  Rodney A. Brooks,et al.  A Robust Layered Control Syste For A Mobile Robot , 2022 .

[5]  George A. Bekey,et al.  Robot control by reflex actions , 1986, Proceedings. 1986 IEEE International Conference on Robotics and Automation.

[6]  Michael A. Arbib,et al.  Schemas and Neural Networks for Sixth Generation Computing. Invited Survey , 1989, J. Parallel Distributed Comput..

[7]  Rodney A. Brooks,et al.  Learning to Coordinate Behaviors , 1990, AAAI.

[8]  Ronald C. Arkin,et al.  Integrating behavioral, perceptual, and world knowledge in reactive navigation , 1990, Robotics Auton. Syst..

[9]  Richard S. Sutton,et al.  Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.

[10]  Rodney A. Brooks,et al.  Intelligence Without Reason , 1991, IJCAI.

[11]  H. Harry Asada,et al.  Transferring Manipulative Skills to Robots-Representation and Acquisition of Tool Manipulative Skill , 1992 .

[12]  Andrew H. Fagg,et al.  Genetic programming approach to the construction of a neural network for control of a walking robot , 1992, Proceedings 1992 IEEE International Conference on Robotics and Automation.

[13]  Ben J. A. Kröse,et al.  Distributed adaptive control: The self-organization of structured behavior , 1992, Robotics Auton. Syst..

[14]  Sridhar Mahadevan,et al.  Automatic Programming of Behavior-Based Robots Using Reinforcement Learning , 1991, Artif. Intell..

[15]  Michael A. Arbib,et al.  A Model of Primate Visual-Motor Conditional Learning , 1992, Adapt. Behav..

[16]  L.-J. Lin,et al.  Hierarchical learning of robot skills by reinforcement , 1993, IEEE International Conference on Neural Networks.

[17]  Stephen H. Lane,et al.  Fast Sensorimotor Skill Acquisition based on Rule-Based Training of Neural Networks , 1993 .

[18]  Andrew H. Fagg,et al.  Developmental Robotics: A New Approach to the Specification of Robot Programs , 1993 .

[19]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.