Learning of Active Binocular Vision in a Biomechanical Model of the Oculomotor System

We present a model for the autonomous learning of active binocular vision using a recently developed biome-chanical model of the human oculomotor system. The model is formulated in the Active Efficient Coding (AEC) framework, a recent generalization of classic efficient coding theories to active perception. The model simultaneously learns how to efficiently encode binocular images and how to generate accurate vergence eye movements that facilitate efficient encoding of the visual input. In order to resolve the redundancy problem arising from the actuation of the eyes through antagonistic muscle pairs, we consider the metabolic costs associated with eye movements. We show that the model successfully learns to trade off vergence accuracy against the associated metabolic costs, producing high fidelity vergence eye movements obeying Sherrington’s law of reciprocal innervation.

[1]  Yu Zhao,et al.  A unified model of the joint development of disparity selectivity and vergence control , 2012, 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL).

[2]  W. Swanson,et al.  Color matches in diseased eyes with good acuity: detection of deficits in cone optical density and in chromatic discrimination. , 1995, Journal of the Optical Society of America. A, Optics, image science, and vision.

[3]  Ayman Habib,et al.  OpenSim: Open-Source Software to Create and Analyze Dynamic Simulations of Movement , 2007, IEEE Transactions on Biomedical Engineering.

[4]  Joseph C. Besharse,et al.  The retina and its disorders , 2011 .

[5]  D. Maurer,et al.  Multiple sensitive periods in human visual development: evidence from visually deprived children. , 2005, Developmental psychobiology.

[6]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[7]  Yu Zhao,et al.  Robust active binocular vision through intrinsically motivated learning , 2013, Front. Neurorobot..

[8]  H. B. Barlow,et al.  Possible Principles Underlying the Transformations of Sensory Messages , 2012 .

[9]  Jochen Triesch,et al.  OpenEyeSim - A platform for biomechanical modeling of oculomotor control , 2014, 4th International Conference on Development and Learning and on Epigenetic Robotics.

[10]  C. Milleret,et al.  Origins of strabismus and loss of binocular vision , 2014, Front. Integr. Neurosci..

[11]  M.A. Wiering,et al.  Reinforcement Learning in Continuous Action Spaces , 2007, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning.

[12]  E. Hartmann Perceptual Development: Visual, Auditory, and Speech Perception in Infancy , 1999 .

[13]  Andriana Olmos,et al.  A biologically inspired algorithm for the recovery of shading and reflectance images , 2004 .

[14]  Paul V McGraw,et al.  Understanding the Neural Basis of Amblyopia , 2004, The Neuroscientist : a review journal bringing neurobiology, neurology and psychiatry.

[15]  David C. Knill,et al.  Stereopsis and amblyopia: A mini-review , 2015, Vision Research.

[16]  Anne Moskowitz,et al.  Normal and Abnormal Visual Development , 2004 .

[17]  David J. Field,et al.  Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[18]  Inhibition as a Coordinative Factor , 2006 .

[19]  Gerald Tesauro,et al.  Temporal Difference Learning and TD-Gammon , 1995, J. Int. Comput. Games Assoc..

[20]  Jochen Triesch,et al.  Autonomous learning of smooth pursuit and vergence through active efficient coding , 2014, 4th International Conference on Development and Learning and on Epigenetic Robotics.

[21]  L. Pinneo On noise in the nervous system. , 1966, Psychological review.

[22]  Yu Zhao,et al.  Autonomous learning of active multi-scale binocular vision , 2013, 2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics (ICDL).

[23]  Jochen Triesch,et al.  OpenEyeSim: A biomechanical model for simulation of closed-loop visual perception. , 2016, Journal of vision.

[24]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[25]  Douglas S. Ramsay,et al.  The Emergence of Fear on the Visual Cliff , 1978 .

[26]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[27]  B. Welford Note on a Method for Calculating Corrected Sums of Squares and Products , 1962 .

[28]  Kelvin E. Jones,et al.  Sources of signal-dependent noise during isometric force production. , 2002, Journal of neurophysiology.

[29]  E. Tronick Stimulus control and the growth of the infant’s effective visual field* , 1972 .

[30]  S Paajanen,et al.  Crying, feeding and sleeping patterns in 1 to 12-month-old infants. , 1990, Child: care, health and development.

[31]  Yu Zhao,et al.  Self-calibrating smooth pursuit through active efficient coding , 2015, Robotics Auton. Syst..

[32]  Robert Babuska,et al.  A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[33]  Philip E. Martin,et al.  A Model of Human Muscle Energy Expenditure , 2003, Computer methods in biomechanics and biomedical engineering.