Autonomous learning of smooth pursuit and vergence through active efficient coding

We present a model for the autonomous and simultaneous learning of smooth pursuit and vergence eye movements based on principles of efficient coding. The model accounts for the joint development of visual encoding and eye movement control. Sparse coding models encode the incoming data and capture the statistics of the input in spatio-temporal basis functions while a reinforcement learner generates eye movements to optimise the efficiency of the encoding. We consider the embodiment of the approach in the iCub simulator and demonstrate the emergence of a self-calibrating smooth pursuit and vergence behaviour. Unlike standard computer vision approaches, it is driven by the interaction between sensory encoding and eye movements. Interestingly, our analysis shows that the emerging representations learned by this model are in line with results on velocity and disparity tuning properties of neurons in visual cortex.

[1]  Ning Qian,et al.  Computing Stereo Disparity and Motion with Known Binocular Cell Properties , 1994, Neural Computation.

[2]  H. Collewijn,et al.  Eye movements and stereopsis during dichoptic viewing of moving random-dot stereograms , 1985, Vision Research.

[3]  J. Henderson Human gaze control during real-world scene perception , 2003, Trends in Cognitive Sciences.

[4]  R. Freeman,et al.  Oblique effect: a neural basis in the visual cortex. , 2003, Journal of neurophysiology.

[5]  W. Prinz Perception and Action Planning , 1997 .

[6]  Néstor Parga,et al.  SENSORY CODING: INFORMATION MAXIMIZATION AND REDUNDANCY REDUCTION , 1999 .

[7]  D. Hubel,et al.  Receptive fields and functional architecture of monkey striate cortex , 1968, The Journal of physiology.

[8]  K. May,et al.  Perceived Direction of Motion Determined by Adaptation to Static Binocular Images , 2012, Current Biology.

[9]  J. P. Jones,et al.  An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex. , 1987, Journal of neurophysiology.

[10]  Yu Zhao,et al.  Intrinsically motivated learning of visual motion perception and smooth pursuit , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[11]  David J. Field,et al.  Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[12]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[13]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[14]  Li Zhaoping,et al.  Theoretical understanding of the early visual processes by data compression and data selection , 2006, Network.

[15]  F. A. Miles,et al.  Vergence eye movements in response to binocular disparity without depth perception , 1997, Nature.

[16]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[17]  P. Lennie Receptive fields , 2003, Current Biology.

[18]  Terrence J. Sejnowski,et al.  Double-Gabor Filters Are Independent Components of Small Translation-Invariant Image Patches , 2013, Neural Computation.

[19]  Stefano Ramat,et al.  Interaural translational VOR: suppression, enhancement, and cognitive control. , 2005, Journal of neurophysiology.

[20]  Yu Zhao,et al.  A unified model of the joint development of disparity selectivity and vergence control , 2012, 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL).

[21]  Eero P. Simoncelli Vision and the statistics of the visual environment , 2003, Current Opinion in Neurobiology.

[22]  Yu Zhao,et al.  Robust active binocular vision through intrinsically motivated learning , 2013, Front. Neurorobot..