Bio-inspired adaptive feedback error learning architecture for motor control

This study proposes an adaptive control architecture based on an accurate regression method called Locally Weighted Projection Regression (LWPR) and on a bio-inspired module, such as a cerebellar-like engine. This hybrid architecture takes full advantage of the machine learning module (LWPR kernel) to abstract an optimized representation of the sensorimotor space while the cerebellar component integrates this to generate corrective terms in the framework of a control task. Furthermore, we illustrate how the use of a simple adaptive error feedback term allows to use the proposed architecture even in the absence of an accurate analytic reference model. The presented approach achieves an accurate control with low gain corrective terms (for compliant control schemes). We evaluate the contribution of the different components of the proposed scheme comparing the obtained performance with alternative approaches. Then, we show that the presented architecture can be used for accurate manipulation of different objects when their physical properties are not directly known by the controller. We evaluate how the scheme scales for simulated plants of high Degrees of Freedom (7-DOFs).

[1]  Sethu Vijayakumar,et al.  Implications of different classes of sensorimotor disturbance for cerebellar-based motor learning models , 2009, Biological Cybernetics.

[2]  Eduardo Ros,et al.  A real-time spiking cerebellum model for learning robot control , 2008, Biosyst..

[3]  Tadashi Yamazaki,et al.  A Possible Mechanism for Controlling Timing Representation in the Cerebellar Cortex , 2010, ISNN.

[4]  D. Wolpert Computational approaches to motor control , 1997, Trends in Cognitive Sciences.

[5]  Masao Ito Mechanisms of motor learning in the cerebellum 1 1 Published on the World Wide Web on 24 November 2000. , 2000, Brain Research.

[6]  Christopher H. Yeo,et al.  Cerebellar Function in Consolidation of a Motor Memory , 2002, Neuron.

[7]  Stefan Schaal,et al.  Locally Weighted Projection Regression: Incremental Real Time Learning in High Dimensional Space , 2000, ICML.

[8]  Stefan Schaal,et al.  Incremental Online Learning in High Dimensions , 2005, Neural Computation.

[9]  Eduardo Ros,et al.  Cerebellarlike Corrective Model Inference Engine for Manipulation Tasks , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[10]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[11]  Mitsuo Kawato,et al.  Internal models for motor control and trajectory planning , 1999, Current Opinion in Neurobiology.

[12]  José Melo,et al.  Gaussian Processes for regression : a tutorial , 2012 .

[13]  D. Marr A theory of cerebellar cortex , 1969, The Journal of physiology.

[14]  James V. Stone,et al.  Recurrent cerebellar architecture solves the motor-error problem , 2004, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[15]  Duy Nguyen-Tuong,et al.  Learning Robot Dynamics for Computed Torque Control Using Local Gaussian Processes Regression , 2008, 2008 ECSIS Symposium on Learning and Adaptive Behaviors for Robotic Systems (LAB-RS).

[16]  Patrick van der Smagt,et al.  Analysis and control of a rubbertuator arm , 1996, Biological Cybernetics.

[17]  John Porrill,et al.  Recurrent Cerebellar Loops Simplify Adaptive Control of Redundant and Nonlinear Motor Systems , 2007, Neural Computation.

[18]  Alin Albu-Schäffer,et al.  Safe Physical Human-Robot Interaction: Measurements, Analysis and New Insights , 2007, ISRR.

[19]  Aiko Miyamura,et al.  Stability of feedback error learning scheme , 2002, Syst. Control. Lett..

[20]  Olivier J. M. D. Coenen,et al.  Model of granular layer encoding in the cerebellum , 2004, Neurocomputing.

[21]  Hermano Igo Krebs,et al.  An Internal Model for Acquisition and Retention of Motor Learning During Arm Reaching , 2009, Neural Computation.

[22]  Tadashi Yamazaki,et al.  The cerebellum as a liquid state machine , 2007, Neural Networks.

[23]  Mitsuo Kawato,et al.  Adaptive feedback control models of the vestibulocerebellum and spinocerebellum , 2004, Biological Cybernetics.

[24]  Peter I. Corke,et al.  A robotics toolbox for MATLAB , 1996, IEEE Robotics Autom. Mag..

[25]  J. Albus A Theory of Cerebellar Function , 1971 .

[26]  Jun Nakanishi,et al.  Feedback error learning and nonlinear adaptive control , 2004, Neural Networks.

[27]  Stefan Schaal,et al.  Scalable Techniques from Nonparametric Statistics for Real Time Robot Learning , 2002, Applied Intelligence.

[28]  K. Doya,et al.  Unsupervised learning of granule cell sparse codes enhances cerebellar adaptive control , 2001, Neuroscience.

[29]  Silvia Tolu,et al.  Adaptive cerebellar Spiking Model Embedded in the Control Loop: Context Switching and Robustness against noise , 2011, Int. J. Neural Syst..

[30]  Eduardo Ros,et al.  Cerebellar Input Configuration Toward Object Model Abstraction in Manipulation Tasks , 2011, IEEE Transactions on Neural Networks.

[31]  Shun-ichi Amari,et al.  A computational study of synaptic mechanisms of partial memory transfer in cerebellar vestibulo-ocular-reflex learning , 2008, Journal of Computational Neuroscience.

[32]  William Wisden,et al.  Synaptic inhibition of Purkinje cells mediates consolidation of vestibulo-cerebellar motor learning , 2009, Nature Neuroscience.

[33]  Stefan Schaal,et al.  Biomimetic gaze stabilization based on feedback-error-learning with nonparametric regression networks , 2001, Neural Networks.

[34]  Joshua G. Hale,et al.  Using Humanoid Robots to Study Human Behavior , 2000, IEEE Intell. Syst..

[35]  Masao Ito Control of mental activities by internal models in the cerebellum , 2008, Nature Reviews Neuroscience.

[36]  T. Sejnowski,et al.  Storing covariance with nonlinearly interacting neurons , 1977, Journal of mathematical biology.

[37]  D. Wolpert,et al.  Internal models in the cerebellum , 1998, Trends in Cognitive Sciences.

[38]  E. Boyden,et al.  Cerebellum-dependent learning: the role of multiple plasticity mechanisms. , 2004, Annual review of neuroscience.

[39]  Michael I. Jordan,et al.  Forward Models: Supervised Learning with a Distal Teacher , 1992, Cogn. Sci..

[40]  Patrick van der Smagt Cerebellar Control of Robot Arms , 1998, Connect. Sci..

[41]  James S. Albus,et al.  New Approach to Manipulator Control: The Cerebellar Model Articulation Controller (CMAC)1 , 1975 .

[42]  P. Dean,et al.  The cerebellar microcircuit as an adaptive filter: experimental and computational evidence , 2010, Nature Reviews Neuroscience.

[43]  M. Arbib,et al.  Role of the cerebellum in reaching movements in humans. II. A neural model of the intermediate cerebellum , 1998, The European journal of neuroscience.

[44]  Michael I. Jordan Computational aspects of motor control and motor learning , 2008 .

[45]  M. Fujita,et al.  Adaptive filter model of the cerebellum , 1982, Biological Cybernetics.

[46]  Mitsuo Kawato,et al.  Feedback-Error-Learning Neural Network for Supervised Motor Learning , 1990 .

[47]  Hong Liu,et al.  A mechatronics approach to the design of light-weight arms and multifingered hands , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[48]  John J. Craig Zhu,et al.  Introduction to robotics mechanics and control , 1991 .