Learning multisensory neural controllers for robot arm tracking

Humans learn multisensory eye-hand coordination starting from infancy without supervision. For an example, they learn to track their hands by exploiting various sensory modalities, such as vision and proprioception. This integration occurs as they learn to perceive the world around them and their relationship to it. Most prior work has focused on the role of vision, as it is a primary sensory source for humans. However, it is interesting to study how vision and proprioception interact. We propose a system which combines visual and proprioceptive information to learn the eye-hand coordination skills that enable a robot to fixate its camera gaze on the end effector of its arm. In our model, visual cues are part of the feedback control loop, whereas proprioceptive cues are part of a feedforward control loop. Both controllers, as well as the sensory transform from raw visual information to a neural sensory representation are learned as the robot performs motor babbling movements. Visual information is encoded by sparse coding. The basis functions that emerge are similar to the receptive fields in the human visual cortex. An actor-critic reinforcement learning algorithm is used to drive eye motor neurons fusing visual and proprioceptive cues. We model and test the system in the iCub simulation environment. Our results suggest that these sensory modalities are capable of jointly learning model parameters to perform the tracking task. The evolved policy has characteristics that are qualitatively similar to the human oculomotor plant.

[1]  Wolfram Burgard,et al.  Body schema learning for robotic manipulators from visual self-perception , 2009, Journal of Physiology - Paris.

[2]  David J. Field,et al.  Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[3]  H. Hughes,et al.  Smooth pursuit of nonvisual motion. , 2006, Journal of neurophysiology.

[4]  Marco Antonelli,et al.  Implicit Sensorimotor Mapping of the Peripersonal Space by Gazing and Reaching , 2011, IEEE Transactions on Autonomous Mental Development.

[5]  François Chaumette,et al.  Visual servo control. I. Basic approaches , 2006, IEEE Robotics & Automation Magazine.

[6]  Mitsuo Kawato,et al.  A computational model of four regions of the cerebellum based on feedback-error learning , 2004, Biological Cybernetics.

[7]  Giulio Sandini,et al.  Own body perception based on visuomotor correlation , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[8]  Angelo Cangelosi,et al.  An open-source simulator for cognitive robotics research: the prototype of the iCub humanoid robot simulator , 2008, PerMIS.

[9]  Shalabh Bhatnagar,et al.  Natural actor-critic algorithms , 2009, Autom..

[10]  Pattie Maes,et al.  Self-Taught Visually-Guided Pointing for a Humanoid Robot , 1996 .

[11]  Yu Zhao,et al.  Intrinsically motivated learning of visual motion perception and smooth pursuit , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[12]  John Porrill,et al.  Recurrent Cerebellar Loops Simplify Adaptive Control of Redundant and Nonlinear Motor Systems , 2007, Neural Computation.

[13]  Amy S. Joh,et al.  Motor development: How infants get into the act , 2006 .

[14]  Manuela Chessa,et al.  A Hierarchical System for a Distributed Representation of the Peripersonal Space of a Humanoid Robot , 2014, IEEE Transactions on Autonomous Mental Development.

[15]  G M Gauthier,et al.  Dynamic analysis of human visuo-oculo-manual coordination control in target tracking tasks. , 1993, Aviation, space, and environmental medicine.

[16]  S Shimojo,et al.  Suppressive Effect of Multimodal Surface Representation on Ocular Smooth Pursuit of Invisible Hand , 1997, Perception.

[17]  Jean-Louis Vercher,et al.  Manuo-ocular coordination in target tracking. I. A model simulating human performance , 1997, Biological Cybernetics.

[18]  Michael I. Jordan,et al.  Forward Models: Supervised Learning with a Distal Teacher , 1992, Cogn. Sci..

[19]  P. Rochat Self-perception and action in infancy , 1998, Experimental Brain Research.

[20]  Stefan Schaal,et al.  Biomimetic Oculomotor Control , 2001, Adapt. Behav..

[21]  Helge J. Ritter,et al.  Three-dimensional neural net for learning visuomotor coordination of a robot arm , 1990, IEEE Trans. Neural Networks.

[22]  Florentin Wörgötter,et al.  Cognitive agents - a procedural perspective relying on the predictability of Object-Action-Complexes (OACs) , 2009, Robotics Auton. Syst..

[23]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[24]  Marco Antonelli,et al.  Learning the visual-oculomotor transformation: Effects on saccade control and space representation , 2015, Robotics Auton. Syst..

[25]  Kevin C. Dieter,et al.  Kinesthesis Can Make an Invisible Hand Visible , 2014, Psychological science.

[26]  David Vernon,et al.  Using neural networks to learn hand-eye co-ordination , 1994, Neural Computing & Applications.

[27]  Brian Scassellati,et al.  A Fast and Efficient Model for Learning to Reach , 2005, Int. J. Humanoid Robotics.

[28]  Mark H. Lee,et al.  Integration of Active Vision and Reaching From a Developmental Robotics Perspective , 2010, IEEE Transactions on Autonomous Mental Development.

[29]  Sergey Levine,et al.  Learning Hand-Eye Coordination for Robotic Grasping with Large-Scale Data Collection , 2016, ISER.

[30]  J R Lackner,et al.  Visual Tracking of Active and Passive Movements of the Hand , 1980, The Quarterly journal of experimental psychology.

[31]  Wolfram Schenck,et al.  Learning visuomotor transformations for gaze-control and grasping , 2005, Biological Cybernetics.

[32]  Alexander Maye,et al.  Extending sensorimotor contingency theory: prediction, planning, and action generation , 2013, Adapt. Behav..

[33]  A. Noë,et al.  A sensorimotor account of vision and visual consciousness. , 2001, The Behavioral and brain sciences.

[34]  Linda Lillakas,et al.  Children’s pursuit eye movements: a developmental study , 2003, Vision Research.

[35]  Tao Zhou,et al.  Learning Visuomotor Transformations and End Effector Appearance by Local Visual Consistency , 2016, IEEE Transactions on Cognitive and Developmental Systems.

[36]  Aude Billard,et al.  Online Learning of the Body Schema , 2008, Int. J. Humanoid Robotics.