Joint Learning of Binocularly Driven Saccades and Vergence by Active Efficient Coding

This paper investigates two types of eye movements: vergence and saccades. Vergence eye movements are responsible for bringing the images of the two eyes into correspondence, whereas saccades drive gaze to interesting regions in the scene. Control of both vergence and saccades develops during early infancy. To date, these two types of eye movements have been studied separately. Here, we propose a computational model of an active vision system that integrates these two types of eye movements. We hypothesize that incorporating a saccade strategy driven by bottom-up attention will benefit the development of vergence control. The integrated system is based on the active efficient coding framework, which describes the joint development of sensory-processing and eye movement control to jointly optimize the coding efficiency of the sensory system. In the integrated system, we propose a binocular saliency model to drive saccades based on learned binocular feature extractors, which simultaneously encode both depth and texture information. Saliency in our model also depends on the current fixation point. This extends prior work, which focused on monocular images and saliency measures that are independent of the current fixation. Our results show that the proposed saliency-driven saccades lead to better vergence performance and faster learning in the overall system than random saccades. Faster learning is significant because it indicates that the system actively selects inputs for the most effective learning. This work suggests that saliency-driven saccades provide a scaffold for the development of vergence control during infancy.

[1]  Yu Zhao,et al.  Intrinsically motivated learning of visual motion perception and smooth pursuit , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[2]  Bertram E. Shi,et al.  The generative Adaptive Subspace Self-Organizing Map , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[3]  Olivier A Coubard,et al.  Saccade and vergence eye movements: a review of motor and premotor commands , 2013, The European journal of neuroscience.

[4]  Zhong Liu,et al.  Salient region detection based on binocular vision , 2012, 2012 7th IEEE Conference on Industrial Electronics and Applications (ICIEA).

[5]  Yu Zhao,et al.  Self-calibrating smooth pursuit through active efficient coding , 2015, Robotics Auton. Syst..

[6]  Ming-Hsuan Yang,et al.  Top-down visual saliency via joint CRF and dictionary learning , 2012, CVPR.

[7]  Chen Wei-hai Wu Xing-ming Zou Yu-hua Wang Jian-hua Liu Zhong Salient region detection based on stereo vision , 2014 .

[8]  Nuno Vasconcelos,et al.  Discriminant Saliency, the Detection of Suspicious Coincidences, and Applications to Visual Recognition , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Jochen Triesch,et al.  Autonomous learning of smooth pursuit and vergence through active efficient coding , 2014, 4th International Conference on Development and Learning and on Epigenetic Robotics.

[10]  Sasa Kenjeres,et al.  Visualization of turbulence structures reorganization in thermal convection subjected to external magnetic field , 2004, J. Vis..

[11]  J. Colombo The development of visual attention in infancy. , 2001, Annual review of psychology.

[12]  Matthew H Tong,et al.  SUN: Top-down saliency using natural statistics , 2009, Visual cognition.

[13]  Shalabh Bhatnagar,et al.  Natural actor-critic algorithms , 2009, Autom..

[14]  Louise Hainline,et al.  Binocular alignment and vergence in early infancy , 1995, Vision Research.

[15]  G. Bronson,et al.  Infant differences in rate of visual encoding. , 1991, Child development.

[16]  A. L. Yarbus Eye Movements During Perception of Complex Objects , 1967 .

[17]  R. Aslin Development of binocular fixation in human infants. , 1977, Journal of experimental child psychology.

[18]  Jochen Triesch,et al.  Learning of Active Binocular Vision in a Biomechanical Model of the Oculomotor System , 2017, bioRxiv.

[19]  Jochen Triesch,et al.  An active-efficient-coding model of optokinetic nystagmus. , 2016, Journal of vision.

[20]  H. B. Barlow,et al.  Possible Principles Underlying the Transformations of Sensory Messages , 2012 .

[21]  Preeti Verghese,et al.  Where to look next? Eye movements reduce local uncertainty. , 2007, Journal of vision.

[22]  Hao Zhu,et al.  Bottom-up saliency based on weighted sparse coding residual , 2011, ACM Multimedia.

[23]  Andriana Olmos,et al.  A biologically inspired algorithm for the recovery of shading and reflectance images , 2004 .

[24]  Zhaoping Li A saliency map in primary visual cortex , 2002, Trends in Cognitive Sciences.

[25]  Fukui Kazuhiro,et al.  Realistic CG Stereo Image Dataset With Ground Truth Disparity Maps , 2012 .

[26]  John K. Tsotsos,et al.  Saliency, attention, and visual search: an information theoretic approach. , 2009, Journal of vision.

[27]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[28]  I. Ohzawa,et al.  Stereoscopic depth discrimination in the visual cortex: neurons ideally suited as disparity detectors. , 1990, Science.

[29]  M. Posner,et al.  Components of visual orienting , 1984 .

[30]  Jochen Triesch,et al.  Imitation learning based on an intrinsic motivation mechanism for efficient coding , 2013, Front. Psychol..

[31]  Qing Yang,et al.  The latency of saccades, vergence, and combined eye movements in children and in adults. , 2002, Investigative ophthalmology & visual science.

[32]  Junle Wang,et al.  Computational Model of Stereoscopic 3D Visual Saliency , 2013, IEEE Transactions on Image Processing.

[33]  L M Optican,et al.  Saccade-vergence interactions in humans. , 1992, Journal of neurophysiology.

[34]  Tim K Marks,et al.  SUN: A Bayesian framework for saliency using natural statistics. , 2008, Journal of vision.

[35]  Angelo Cangelosi,et al.  An open-source simulator for cognitive robotics research: the prototype of the iCub humanoid robot simulator , 2008, PerMIS.

[36]  G. Poggio,et al.  Binocular interaction and depth sensitivity in striate and prestriate cortex of behaving rhesus monkey. , 1977, Journal of neurophysiology.

[37]  Yu Zhao,et al.  A unified model of the joint development of disparity selectivity and vergence control , 2012, 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL).

[38]  Kazuhiro Fukui,et al.  Realistic CG Stereo Image Dataset with Ground Truth Disparity Maps , 2012 .

[39]  J. Wolfe,et al.  What attributes guide the deployment of visual attention and how do they do it? , 2004, Nature Reviews Neuroscience.

[40]  Wesley E. Snyder,et al.  A Biologically Inspired Algorithm for Shape Recognition , 2006 .

[41]  R. Klein,et al.  Contribution of the Primate Superior Colliculus to Inhibition of Return , 2002, Journal of Cognitive Neuroscience.

[42]  G. Bronson,et al.  Changes in infants' visual scanning across the 2- to 14-week age period. , 1990, Journal of experimental child psychology.

[43]  Yu Zhao,et al.  Autonomous learning of active multi-scale binocular vision , 2013, 2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics (ICDL).

[44]  Shalabh Bhatnagar,et al.  Natural actorcritic algorithms. , 2009 .

[45]  Liqing Zhang,et al.  Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.