Refinement of robot motor skills through reinforcement learning

An extension of earlier work in the refinement of robotic motor control using reinforcement learning is described. It is no longer assumed that the magnitude of the state-dependent nonlinear torque is known. The learning controller learns about not only the presence of the torque, but also its magnitude. The ability of the learning system to learn this real-valued mapping from output feedback and reference input to control signal is facilitated by a stochastic algorithm that uses reinforcement feedback. A learning controller that can learn nonlinear mappings holds many possibilities for extending existing adaptive control research.<<ETX>>

[1]  Kumpati S. Narendra,et al.  Stochastic Automata Models with Applications to Learning Systems , 1973, IEEE Trans. Syst. Man Cybern..

[2]  E. Freund Fast Nonlinear Control with Arbitrary Pole-Placement for Industrial Robots and Manipulators , 1982 .

[3]  C. Desoer,et al.  Multivariable Feedback Systems , 1982 .

[4]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[5]  H. Elliott,et al.  Nonlinear Adaptive Control of Mechanical Linkage Systems with Application to Robotics , 1983, 1983 American Control Conference.

[6]  P. Anandan,et al.  Pattern-recognizing stochastic learning automata , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[7]  Charles W. Anderson,et al.  Learning and problem-solving with multilayer connectionist systems (adaptive, strategy learning, neural networks, reinforcement learning) , 1986 .

[8]  Christopher G. Atkeson,et al.  Experimental determination of the effect of feedforward control on trajectory tracking errors , 1986, Proceedings. 1986 IEEE International Conference on Robotics and Automation.

[9]  Tien C. Hsia,et al.  Adaptive control of robot manipulators - A review , 1986, Proceedings. 1986 IEEE International Conference on Robotics and Automation.

[10]  King-Sun Fu,et al.  Learning Control Systems-Review and Outlook , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Christopher G. Atkeson,et al.  Roles of Knowledge in Motor Learning , 1987 .

[12]  Filson H. Glanz,et al.  Application of a General Learning Algorithm to the Control of Robotic Manipulators , 1987 .

[13]  Carlos Canudas de Wit,et al.  Adaptive friction compensation in DC motor drives , 1986, Proceedings. 1986 IEEE International Conference on Robotics and Automation.

[14]  Judy A. Franklin Compliance and learning: control skills for a robot operating in an uncertain world , 1988 .