Neural network learning control of robot manipulators using gradually increasing task difficulty

Trajectory extension learning is an incremental method for training an artificial neural network to approximate the inverse dynamics of a robot manipulator. Training data near a desired trajectory is obtained by slowly varying a parameter of the trajectory from a region of easy solvability of the inverse dynamics toward the desired behavior. The parameter can be average speed, path shape, feedback gain, or any other controllable variable. As learning proceeds, an approximate solution to the local inverse dynamics for each value of the parameter is used to guide learning for the next value of the parameter. Convergence conditions are given for two variations on the algorithm. Examples are shown of application to a real 2-joint direct drive robot arm and a simulated 3-joint redundant arm, both using simulated equilibrium point control. >

[1]  King-Sun Fu,et al.  Learning control systems--Review and outlook , 1970 .

[2]  John R. Anderson Cognitive Psychology and Its Implications , 1980 .

[3]  H. Dr,et al.  Separate cortical systems for control of joint movement and joint stiffness: reciprocal activation and coactivation of antagonist muscles. , 1983 .

[4]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[5]  D. Humphrey,et al.  Separate cortical systems for control of joint movement and joint stiffness: reciprocal activation and coactivation of antagonist muscles. , 1983, Advances in neurology.

[6]  E. Bizzi,et al.  Posture control and trajectory formation during arm movement , 1984, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[7]  N. Hogan An organizing principle for a class of voluntary movements , 1984, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[8]  N. Hogan Adaptive control of mechanical impedance by coactivation of antagonist muscles , 1984 .

[9]  Suguru Arimoto,et al.  Bettering operation of Robots by learning , 1984, J. Field Robotics.

[10]  Marc H. Raibert,et al.  Tabular control of balance in a dynamic legged system , 1984, IEEE Transactions on Systems, Man, and Cybernetics.

[11]  E. Bizzi,et al.  Neural, mechanical, and geometric factors subserving arm posture in humans , 1985, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[12]  J. Sloboda The Musical Mind: The Cognitive Psychology of Music , 1987 .

[13]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[14]  King-Sun Fu,et al.  Learning Control Systems-Review and Outlook , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  W. Thomas Miller,et al.  Sensor-based control of robotic manipulators using a general learning algorithm , 1987, IEEE J. Robotics Autom..

[16]  M. J. D. Powell,et al.  Radial basis functions for multivariable interpolation: a review , 1987 .

[17]  Bernard Widrow,et al.  Adaptive switching circuits , 1988 .

[18]  Christopher G. Atkeson,et al.  Model-Based Control of a Robot Manipulator , 1988 .

[19]  Barak A. Pearlmutter,et al.  Using a neural network to learn the dynamics of the CMU Direct-Drive Arm II , 1988 .

[20]  R. Su,et al.  Learning control for a class of nonlinear systems , 1989, Proceedings. IEEE International Symposium on Intelligent Control 1989.

[21]  Klaus Schulten,et al.  Topology-conserving maps for learning visuo-motor-coordination , 1989, Neural Networks.

[22]  Mark J. Damborg,et al.  An adaptive controller for a one-legged mobile robot , 1989, IEEE Trans. Robotics Autom..

[23]  J. Selinsky,et al.  A learning/adaptive robot controller , 1989, Proceedings. IEEE International Symposium on Intelligent Control 1989.

[24]  C. Atkeson,et al.  Learning arm kinematics and dynamics. , 1989, Annual review of neuroscience.

[25]  Alessandro De Luca,et al.  A frequency-domain approach to learning control: implementation for a robot manipulator , 1989, Proceedings. IEEE International Symposium on Intelligent Control 1989.

[26]  Kumpati S. Narendra,et al.  Identification and control of dynamical systems using neural networks , 1990, IEEE Trans. Neural Networks.

[27]  Michael Kuperstein,et al.  Neural controller for adaptive movements with unforeseen payloads , 1990, IEEE Trans. Neural Networks.

[28]  Helge J. Ritter,et al.  Three-dimensional neural net for learning visuomotor coordination of a robot arm , 1990, IEEE Trans. Neural Networks.

[29]  Roberto Horowitz,et al.  A new adaptive learning rule , 1990, Proceedings., IEEE International Conference on Robotics and Automation.

[30]  Reza Shadmehr Learning Virtual Equilibrium Trajectories for Control of a Robot Arm , 1990, Neural Computation.

[31]  J. McIntyre Utilizing elastic system properties for the control of posture and movement , 1990 .

[32]  John B. Moore,et al.  Exponential convergence of a learning controller for robot manipulators , 1991 .

[33]  Takayuki Yamada,et al.  Learning control using neural networks , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.

[34]  Zhihua Qu,et al.  A new learning control scheme for robots , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.

[35]  F A Mussa-Ivaldi,et al.  Computations underlying the execution of movement: a biological perspective. , 1991, Science.

[36]  Suguru Arimoto,et al.  Selective learning with a forgetting factor for robotic motion control , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.

[37]  Viale Causa,et al.  IMPLEMENTATION OF LEARNING CONTROL TECHNIQUES USING DESCRIPTOR SYSTEMS METHODS , 1991 .

[38]  Alessandro De Luca,et al.  Learning control for redundant manipulators , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.

[39]  Terence D. Sanger,et al.  A tree-structured adaptive network for function approximation in high-dimensional spaces , 1991, IEEE Trans. Neural Networks.

[40]  Jean-Jacques E. Slotine,et al.  Stable adaptive control and recursive identification using radial Gaussian networks , 1991, [1991] Proceedings of the 30th IEEE Conference on Decision and Control.

[41]  Toshiharu Sugie,et al.  An iterative learning control law for dynamical systems , 1991, Autom..

[42]  P. Patrick Van Der Smagt,et al.  A real-time learning neural robot controller , 1991 .

[43]  Kenneth Kreutz-Delgado,et al.  Learning Global Direct Inverse Kinematics , 1991, NIPS.

[44]  Terence D. Sanger,et al.  A Tree-Structured Algorithm for Reducing Computation in Networks with Separable Basis Functions , 1991, Neural Computation.

[45]  Robert M. Sanner,et al.  Gaussian Networks for Direct Adaptive Control , 1991, 1991 American Control Conference.

[46]  Roberto Horowitz,et al.  A new adaptive learning rule , 1991 .

[47]  Yasuharu Koike,et al.  Human hand stiffness during discrete point-to-point multi-joint movement , 1992, 1992 14th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[48]  Eli Tzirkel-Hancock,et al.  Stable control of nonlinear systems using neural networks , 1992 .

[49]  R. E. van Emmerick Kinematic adaptations to perturbations as a function of practice in rhythmic drawing movements. , 1992, Journal of motor behavior.