A Hierarchical System for a Distributed Representation of the Peripersonal Space of a Humanoid Robot

Reaching a target object in an unknown and unstructured environment is easily performed by human beings. However, designing a humanoid robot that executes the same task requires the implementation of complex abilities, such as identifying the target in the visual field, estimating its spatial location, and precisely driving the motors of the arm to reach it. While research usually tackles the development of such abilities singularly, in this work we integrate a number of computational models into a unified framework, and demonstrate in a humanoid torso the feasibility of an integrated working representation of its peripersonal space. To achieve this goal, we propose a cognitive architecture that connects several models inspired by neural circuits of the visual, frontal and posterior parietal cortices of the brain. The outcome of the integration process is a system that allows the robot to create its internal model and its representation of the surrounding space by interacting with the environment directly, through a mutual adaptation of perception and action. The robot is eventually capable of executing a set of tasks, such as recognizing, gazing and reaching target objects, which can work separately or cooperate for supporting more structured and effective behaviors.

[1]  C. Galletti,et al.  Reaching activity in parietal area V6A of macaque: eye influence on arm activity or retinocentric coding of reaching movements? , 2008, The European journal of neuroscience.

[2]  C. Koch,et al.  Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[3]  Thomas Serre,et al.  Robust Object Recognition with Cortex-Like Mechanisms , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Ning Qian,et al.  Computing Stereo Disparity and Motion with Known Binocular Cell Properties , 1994, Neural Computation.

[5]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[6]  Radu Horaud,et al.  Patterns of Binocular Disparity for a Fixating Observer , 2007, BVAI.

[7]  Marco Antonelli,et al.  Application of the visuo-oculomotor transformation to ballistic and visually-guided eye movements , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[8]  Jakob Heinzle,et al.  A model of the local cortical circuit of the frontal eye fields , 2006 .

[9]  I. Ohzawa,et al.  Stereoscopic depth discrimination in the visual cortex: neurons ideally suited as disparity detectors. , 1990, Science.

[10]  Richard A Andersen,et al.  Integration of target and hand position signals in the posterior parietal cortex: effects of workspace and hand vision. , 2012, Journal of neurophysiology.

[11]  J. Schall Neuronal activity related to visually guided saccades in the frontal eye fields of rhesus monkeys: comparison with supplementary eye fields. , 1991, Journal of neurophysiology.

[12]  Manuela Chessa,et al.  Population coding for a reward-modulated Hebbian learning of vergence control , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[13]  Jooyoung Park,et al.  Universal Approximation Using Radial-Basis-Function Networks , 1991, Neural Computation.

[14]  Mark H. Lee,et al.  Integration of Active Vision and Reaching From a Developmental Robotics Perspective , 2010, IEEE Transactions on Autonomous Mental Development.

[15]  Alexandre Pouget,et al.  Computational approaches to sensorimotor transformations , 2000, Nature Neuroscience.

[16]  Anastasios N. Venetsanopoulos,et al.  Fast learning algorithms for neural networks , 1992 .

[17]  C. Galletti,et al.  Role of the medial parieto-occipital cortex in the control of reaching and grasping movements , 2003, Experimental Brain Research.

[18]  F. Hamker The reentry hypothesis: the putative interaction of the frontal eye field, ventrolateral prefrontal cortex, and areas V4, IT for attention and eye movement. , 2005, Cerebral cortex.

[19]  Emilio Salinas,et al.  Gain Modulation A Major Computational Principle of the Central Nervous System , 2000, Neuron.

[20]  Tamim Asfour,et al.  Stereo-based 6D object localization for grasping with humanoid robot systems , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[21]  Jeremiah Y. Cohen,et al.  The neural basis of saccade target selection , 1995 .

[22]  A. Berthoz,et al.  From brainstem to cortex: Computational models of saccade generation circuitry , 2005, Progress in Neurobiology.

[23]  Eris Chinellato,et al.  The Dorso-medial visual stream: From neural activation to sensorimotor interaction , 2011, Neurocomputing.

[24]  Markus Lappe,et al.  When humanoid robots become human-like interaction partners: corepresentation of robotic actions. , 2012, Journal of experimental psychology. Human perception and performance.

[25]  Marco Antonelli,et al.  On-Line Learning of the Visuomotor Transformations on a Humanoid Robot , 2012, IAS.

[26]  M. Goldberg,et al.  Functional properties of corticotectal neurons in the monkey's frontal eye field. , 1987, Journal of neurophysiology.

[27]  C. Bruce,et al.  Primate frontal eye fields. I. Single neurons discharging before saccades. , 1985, Journal of neurophysiology.

[28]  J. T. Enright Changes in vergence mediated by saccades. , 1984, The Journal of physiology.

[29]  Marco Antonelli,et al.  Implicit Sensorimotor Mapping of the Peripersonal Space by Gazing and Reaching , 2011, IEEE Transactions on Autonomous Mental Development.

[30]  Fred H Hamker,et al.  Split of spatial attention as predicted by a systems‐level model of visual attention , 2011, The European journal of neuroscience.

[31]  Manuela Chessa,et al.  A cortical model for binocular vergence control without explicit calculation of disparity , 2010, Neurocomputing.

[32]  Fred Henrik Hamker,et al.  Learning Invariance from Natural Images Inspired by Observations in the Primary Visual Cortex , 2012, Neural Computation.

[33]  M. Morgan,et al.  The aperture problem in stereopsis , 1997, Vision Research.

[34]  Marco Antonelli,et al.  A Pilot Study on Saccadic Adaptation Experiments with Robots , 2012, Living Machines.

[35]  Patrizia Fattori,et al.  Vision for Action in the Macaque Medial Posterior Parietal Cortex , 2012, The Journal of Neuroscience.

[36]  Danica Kragic,et al.  An Active Vision System for Detecting, Fixating and Manipulating Objects in the Real World , 2010, Int. J. Robotics Res..

[37]  R. Wurtz,et al.  Saccade-related activity in monkey superior colliculus. I. Characteristics of burst and buildup cells. , 1995, Journal of neurophysiology.

[38]  Hanspeter A. Mallot,et al.  Phase-based binocular vergence control and depth reconstruction using active vision , 1994 .

[39]  Guillaume S. Masson,et al.  Motion perception during saccadic eye movements , 2000, Nature Neuroscience.

[40]  U Büttner,et al.  Present concepts of oculomotor organization. , 1988, Reviews of oculomotor research.

[41]  P. Földiák,et al.  Forming sparse representations by local anti-Hebbian learning , 1990, Biological Cybernetics.

[42]  Antonio Morales,et al.  A 3D grasping system based on multimodal visual and tactile processing , 2009, Ind. Robot.

[43]  Mark H. Lee,et al.  Fast Learning Mapping Schemes for Robotic Hand–Eye Coordination , 2010, Cognitive Computation.

[44]  M. Bertero,et al.  Ill-posed problems in early vision , 1988, Proc. IEEE.

[45]  M. Goodale,et al.  An evolving view of duplex vision: separate but interacting cortical pathways for perception and action , 2004, Current Opinion in Neurobiology.

[46]  D. G. Albrecht,et al.  Striate cortex of monkey and cat: contrast response function. , 1982, Journal of neurophysiology.

[47]  Mark H. Lee,et al.  A developmental algorithm for ocular-motor coordination , 2010, Robotics Auton. Syst..

[48]  D. Hubel,et al.  Receptive fields and functional architecture of monkey striate cortex , 1968, The Journal of physiology.

[49]  Per-Erik Forssén Learning Saccadic Gaze Control via Motion Prediciton , 2007, Fourth Canadian Conference on Computer and Robot Vision (CRV '07).

[50]  Haluk Ögmen,et al.  Vergence Dynamics Predict Fixation Disparity , 2001, Neural Computation.

[51]  Olivier D. Faugeras,et al.  The geometry of multiple images - the laws that govern the formation of multiple images of a scene and some of their applications , 2001 .

[52]  Fred H. Hamker,et al.  Attentive Stereoscopic Object Recognition , 2010 .

[53]  Yuri B Saalmann,et al.  Human fronto-parietal and parieto-hippocampal pathways represent behavioral priorities in multiple spatial reference frames , 2013, Bioarchitecture.

[54]  Nikos K. Logothetis,et al.  Spatial Reference Frames for Object Recognition Tuning for Rotations in Depth , 1995 .

[55]  Bertram E. Shi,et al.  Improved Binocular Vergence Control via a Neural Network That Maximizes an Internally Defined Reward , 2011, IEEE Transactions on Autonomous Mental Development.

[56]  Ning Qian,et al.  A Coarse-to-Fine Disparity Energy Model with Both Phase-Shift and Position-Shift Receptive Field Mechanisms , 2004, Neural Computation.

[57]  Michael Hansen,et al.  Biologically inspired calibration-free adaptive saccade control of a binocular camera-head , 1997, Biological Cybernetics.

[58]  Christopher A. Buneo,et al.  Direct visuomotor transformations for reaching , 2002, Nature.

[59]  Michael W. Spratling Learning viewpoint invariant perceptual representations from cluttered images , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[60]  Manuela Chessa,et al.  How a Population-based Representation of Binocular Visual Signal Can Intrinsically Mediate Autonomous Learning of Vergence Control , 2012, INNS-WC.

[61]  Fred Henrik Hamker,et al.  The emergence of attention by population-based inference and its role in distributed processing and cognitive control of vision , 2005, Comput. Vis. Image Underst..

[62]  Michael E. Goldberg,et al.  Prefrontal Neurons Coding Suppression of Specific Saccades , 2004, Neuron.

[63]  Manuela Chessa,et al.  A Fast Joint Bioinspired Algorithm for Optic Flow and Two-Dimensional Disparity Estimation , 2009, ICVS.

[64]  H. Deubel Separate adaptive mechanisms for the control of reactive and volitional saccadic eye movements , 1995, Vision Research.

[65]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[66]  Yiannis Aloimonos,et al.  Active vision , 2004, International Journal of Computer Vision.

[67]  Fabio Solari,et al.  A compact harmonic code for early vision based on anisotropic frequency channels , 2010, Comput. Vis. Image Underst..

[68]  Giorgio Metta,et al.  YARP: Yet Another Robot Platform , 2006 .

[69]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[70]  C. Galletti,et al.  Reaching Activity in the Medial Posterior Parietal Cortex of Monkeys Is Modulated by Visual Feedback , 2010, The Journal of Neuroscience.

[71]  M. Goodale,et al.  Separate visual pathways for perception and action , 1992, Trends in Neurosciences.

[72]  A. Parker,et al.  Range and mechanism of encoding of horizontal disparity in macaque V1. , 2002, Journal of neurophysiology.

[73]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[74]  Stefan Schaal,et al.  Biomimetic Oculomotor Control , 2001, Adapt. Behav..

[75]  Florentin Wörgötter,et al.  Cognitive agents - a procedural perspective relying on the predictability of Object-Action-Complexes (OACs) , 2009, Robotics Auton. Syst..

[76]  F. A. Miles,et al.  Vergence eye movements in response to binocular disparity without depth perception , 1997, Nature.

[77]  Manuela Chessa,et al.  A neuromorphic control module for real-time vergence eye movements on the iCub robot head , 2011, 2011 11th IEEE-RAS International Conference on Humanoid Robots.

[78]  F. A. Miles,et al.  Single-unit activity in cortical area MST associated with disparity-vergence eye movements: evidence for population coding. , 2001, Journal of neurophysiology.

[79]  Aude Billard,et al.  On Learning, Representing, and Generalizing a Task in a Humanoid Robot , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[80]  Wolfram Schenck,et al.  Learning visuomotor transformations for gaze-control and grasping , 2005, Biological Cybernetics.