An Integrated System based on Binocular Learned Receptive Fields for Saccade-vergence on Visually Salient Targets

The human visual system uses saccadic and vergence eyes movements to foveate interesting objects with both eyes, and thus exploring the visual scene. To mimic this biological behavior in active vision, we proposed a bio-inspired integrated system able to learn a functional sensory representation of the environment, together with the motor commands for binocular eye coordination, directly by interacting with the environment itself. The proposed architecture, rather than sequentially combining different functionalities, is a robust integration of different modules that rely on a front-end of learned binocular receptive fields to specialize on different sub-tasks. The resulting modular architecture is able to detect salient targets in the scene and perform precise binocular saccadic and vergence movement on it. The performances of the proposed approach has been tested on the iCub Simulator, providing a quantitative evaluation of the computational potentiality of the learned sensory and motor resources.

[1]  G. Westheimer,et al.  Disjunctive eye movements , 1961, The Journal of physiology.

[2]  Paul B Hibbard,et al.  Distribution of independent components of binocular natural images. , 2015, Journal of vision.

[3]  John K. Tsotsos,et al.  Saliency Based on Information Maximization , 2005, NIPS.

[4]  Manuela Chessa,et al.  A Hierarchical System for a Distributed Representation of the Peripersonal Space of a Humanoid Robot , 2014, IEEE Transactions on Autonomous Mental Development.

[5]  R. Pfeifer,et al.  Self-Organization, Embodiment, and Biologically Inspired Robotics , 2007, Science.

[6]  Bertram E. Shi,et al.  Improved Binocular Vergence Control via a Neural Network That Maximizes an Internally Defined Reward , 2011, IEEE Transactions on Autonomous Mental Development.

[7]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[8]  Andrea Canessa,et al.  Vergence control learning through real V1 disparity tuning curves , 2015, 2015 7th International IEEE/EMBS Conference on Neural Engineering (NER).

[9]  David J. Field,et al.  Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[10]  Majid Nili Ahmadabadi,et al.  Online learning of task-driven object-based visual attention control , 2010, Image Vis. Comput..

[11]  Andrea Canessa,et al.  A Portable Bio-Inspired Architecture for Efficient Robotic Vergence Control , 2016, International Journal of Computer Vision.

[12]  Matthias Bethge,et al.  Evaluating neuronal codes for inference using Fisher information , 2010, NIPS.

[13]  PoggioTomaso,et al.  Robust Object Recognition with Cortex-Like Mechanisms , 2007 .

[14]  Rajesh P. N. Rao,et al.  Embodiment is the foundation, not a level , 1996, Behavioral and Brain Sciences.

[15]  Casper J. Erkelens,et al.  The relationship between absolute disparity and ocular vergence , 2004, Biological Cybernetics.

[16]  Alexandre Bernardino,et al.  Multimodal saliency-based bottom-up attention a framework for the humanoid robot iCub , 2008, 2008 IEEE International Conference on Robotics and Automation.

[17]  Kenji Okajima,et al.  Binocular disparity encoding cells generated through an Infomax based learning algorithm , 2004, Neural Networks.

[18]  T. Sejnowski,et al.  Spatial Transformations in the Parietal Cortex Using Basis Functions , 1997, Journal of Cognitive Neuroscience.

[19]  Giorgio Metta,et al.  Design of the robot-cub (iCub) head , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[20]  A. Parker,et al.  Quantitative analysis of the responses of V1 neurons to horizontal disparity in dynamic random-dot stereograms. , 2002, Journal of neurophysiology.

[21]  Yu Zhao,et al.  Autonomous learning of active multi-scale binocular vision , 2013, 2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics (ICDL).

[22]  S. Tipper,et al.  A model of inhibitory mechanisms in selective attention. , 1994 .

[23]  George K. Hung,et al.  A Dual-Mode Dynamic Model of the Vergence Eye Movement System , 1986, IEEE Transactions on Biomedical Engineering.

[24]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[25]  Xing Xie,et al.  Salient Region Detection Using Weighted Feature Maps Based on the Human Visual Attention Model , 2004, PCM.

[26]  C. Rovee-Collier,et al.  The development of visual search in infants and very young children. , 2002, Journal of experimental child psychology.

[27]  Angelo Cangelosi,et al.  An open-source simulator for cognitive robotics research: the prototype of the iCub humanoid robot simulator , 2008, PerMIS.

[28]  Michael W. Spratling,et al.  A neural model of binocular saccade planning and vergence control , 2015, Adapt. Behav..

[29]  Marco Antonelli,et al.  Implicit Sensorimotor Mapping of the Peripersonal Space by Gazing and Reaching , 2011, IEEE Transactions on Autonomous Mental Development.

[30]  Paul B. Hibbard,et al.  Ideal Binocular Disparity Detectors Learned Using Independent Subspace Analysis on Binocular Natural Image Pairs , 2016, PloS one.

[31]  Manuela Chessa,et al.  A cortical model for binocular vergence control without explicit calculation of disparity , 2010, Neurocomputing.

[32]  Aapo Hyvärinen,et al.  Emergence of Phase- and Shift-Invariant Features by Decomposition of Natural Images into Independent Feature Subspaces , 2000, Neural Computation.

[33]  J. Daugman Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[34]  Fabio Solari,et al.  Autonomous learning of disparity-vergence behavior through distributed coding and population reward: Basic mechanisms and real-world conditioning on a robot stereo head , 2015, Robotics Auton. Syst..

[35]  I. Ohzawa,et al.  Stereoscopic depth discrimination in the visual cortex: neurons ideally suited as disparity detectors. , 1990, Science.

[36]  Agostino Gibaldi,et al.  Emerging spatial competences: From machine perception to sensorimotor intelligence , 2015, Robotics Auton. Syst..

[37]  Dimitri Ognibene,et al.  Ecological Active Vision: Four Bioinspired Principles to Integrate Bottom–Up and Adaptive Top–Down Attention Tested With a Simple Camera-Arm Robot , 2015, IEEE Transactions on Autonomous Mental Development.

[38]  Rajesh P. N. Rao,et al.  Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. , 1999 .

[39]  Ning Qian,et al.  Computing Stereo Disparity and Motion with Known Binocular Cell Properties , 1994, Neural Computation.

[40]  Aapo Hyvärinen,et al.  Natural Image Statistics - A Probabilistic Approach to Early Computational Vision , 2009, Computational Imaging and Vision.

[41]  Thomas Serre,et al.  A Theory of Object Recognition: Computations and Circuits in the Feedforward Path of the Ventral Stream in Primate Visual Cortex , 2005 .

[42]  D Guitton,et al.  Visual-motor transformations required for accurate and kinematically correct saccades. , 1997, Journal of neurophysiology.

[43]  HongJiang Zhang,et al.  Contrast-based image attention analysis by using fuzzy growing , 2003, MULTIMEDIA '03.

[44]  Jacob L. Orquin,et al.  Attention and choice: a review on eye movements in decision making. , 2013, Acta psychologica.