Autonomous Development of Active Binocular and Motion Vision Through Active Efficient Coding

We present a model for the autonomous and simultaneous learning of active binocular and motion vision. The model is based on the Active Efficient Coding (AEC) framework, a recent generalization of classic efficient coding theories to active perception. The model learns how to efficiently encode the incoming visual signals generated by an object moving in 3-D through sparse coding. Simultaneously, it learns how to produce eye movements that further improve the efficiency of the sensory coding. This learning is driven by an intrinsic motivation to maximize the system's coding efficiency. We test our approach on the humanoid robot iCub using simulations. The model demonstrates self-calibration of accurate object fixation and tracking of moving objects. Our results show that the model keeps improving until it hits physical constraints such as camera or motor resolution, or limits on its internal coding capacity. Furthermore, we show that the emerging sensory tuning properties are in line with results on disparity, motion, and motion-in-depth tuning in the visual cortex of mammals. The model suggests that vergence and tracking eye movements can be viewed as fundamentally having the same objective of maximizing the coding efficiency of the visual system and that they can be learned and calibrated jointly through AEC.

[1]  Ho Ko,et al.  Emergence of Feature-Specific Connectivity in Cortical Microcircuits in the Absence of Visual Experience , 2014, The Journal of Neuroscience.

[2]  Emily A. Cooper,et al.  Stereopsis is adaptive for the natural environment , 2015, Science Advances.

[3]  Jochen Triesch,et al.  A computational model for the joint development of accommodation and vergence control , 2017 .

[4]  Andriana Olmos,et al.  A biologically inspired algorithm for the recovery of shading and reflectance images , 2004 .

[5]  Paul B Hibbard,et al.  Distribution of independent components of binocular natural images. , 2015, Journal of vision.

[6]  G. Orban,et al.  Velocity sensitivity and direction selectivity of neurons in areas V1 and V2 of the monkey: influence of eccentricity. , 1986, Journal of neurophysiology.

[7]  D. J. Felleman,et al.  Receptive-field properties of neurons in middle temporal visual area (MT) of owl monkeys. , 1984, Journal of neurophysiology.

[8]  Yu Zhao,et al.  Self-calibrating smooth pursuit through active efficient coding , 2015, Robotics Auton. Syst..

[9]  H. Ogmen,et al.  Neural network model of short-term horizontal disparity vergence dynamics. , 1997, Vision research.

[10]  Yu Zhao,et al.  Autonomous learning of active multi-scale binocular vision , 2013, 2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics (ICDL).

[11]  Yu Zhao,et al.  Intrinsically motivated learning of visual motion perception and smooth pursuit , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[12]  W. Geisler,et al.  Optimal disparity estimation in natural stereo images. , 2014, Journal of vision.

[13]  Douglas Tweed,et al.  PII: S0042-6989(97)00002-3 , 2003 .

[14]  Jochen Triesch,et al.  Autonomous learning of cyclovergence control based on Active Efficient Coding , 2018, 2018 Joint IEEE 8th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob).

[15]  Fabio Solari,et al.  Autonomous learning of disparity-vergence behavior through distributed coding and population reward: Basic mechanisms and real-world conditioning on a robot stereo head , 2015, Robotics Auton. Syst..

[16]  Kenichi Ohki,et al.  Neuronal activity is not required for the initial formation and maturation of visual selectivity , 2015, Nature Neuroscience.

[17]  Manuela Chessa,et al.  A cortical model for binocular vergence control without explicit calculation of disparity , 2010, Neurocomputing.

[18]  Richard T Born,et al.  Joint tuning for direction of motion and binocular disparity in macaque MT is largely separable. , 2013, Journal of neurophysiology.

[19]  Roland Memisevic,et al.  A unified approach to learning depth and motion features , 2014, ICVGIP '14.

[20]  Jochen Triesch,et al.  An active-efficient-coding model of optokinetic nystagmus. , 2016, Journal of vision.

[21]  Jochen Triesch,et al.  Autonomous, self-calibrating binocular vision based on learned attention and active efficient coding , 2017, 2017 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob).

[22]  J. Douglas Crawford,et al.  The motor side of depth vision , 2001, Nature.

[23]  H. B. Barlow,et al.  Possible Principles Underlying the Transformations of Sensory Messages , 2012 .

[24]  Bruce G. Cumming,et al.  Understanding the Cortical Specialization for Horizontal Disparity , 2004, Neural Computation.

[25]  Gregory C DeAngelis,et al.  Neural Representation of Motion-In-Depth in Area MT , 2014, The Journal of Neuroscience.

[26]  R. Freeman,et al.  Oblique effect: a neural basis in the visual cortex. , 2003, Journal of neurophysiology.

[27]  Stefan Schaal,et al.  Natural Actor-Critic , 2003, Neurocomputing.

[28]  Michael C Crair,et al.  Activity-dependent development of visual receptive fields , 2017, Current Opinion in Neurobiology.

[29]  S. Appelle Perception and discrimination as a function of stimulus orientation: the "oblique effect" in man and animals. , 1972, Psychological bulletin.

[30]  Ning Qian,et al.  Computing Stereo Disparity and Motion with Known Binocular Cell Properties , 1994, Neural Computation.

[31]  R. Wong,et al.  Retinal waves and visual system development. , 1999, Annual review of neuroscience.

[32]  G. DeAngelis,et al.  Perceptual “Read-Out” of Conjoined Direction and Disparity Maps in Extrastriate Area MT , 2004, PLoS biology.

[33]  V. V. Krishnan,et al.  A Heuristic Model for the Human Vergence Eye Movement System , 1977, IEEE Transactions on Biomedical Engineering.

[34]  Yu Zhao,et al.  A unified model of the joint development of disparity selectivity and vergence control , 2012, 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL).

[35]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[36]  Jochen Triesch,et al.  A model of the development of anisometropic amblyopia through recruitment of interocular suppression , 2018 .

[37]  Jochen Triesch,et al.  An Active Efficient Coding Model of Binocular Vision Development Under Normal and Abnormal Rearing Conditions , 2018, SAB.

[38]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[39]  Tushar Chauhan,et al.  Emergence of Binocular Disparity Selectivity through Hebbian Learning , 2017, The Journal of Neuroscience.

[40]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[41]  Yu Zhao,et al.  Robust active binocular vision through intrinsically motivated learning , 2013, Front. Neurorobot..

[42]  F. Attneave Some informational aspects of visual perception. , 1954, Psychological review.

[43]  P O Hoyer,et al.  Independent component analysis applied to feature extraction from colour and stereo images , 2000, Network.

[44]  G. Westheimer,et al.  Disjunctive eye movements , 1961, The Journal of physiology.

[45]  Thaddeus B. Czuba,et al.  Area MT Encodes Three-Dimensional Motion , 2014, The Journal of Neuroscience.

[46]  J. P. Jones,et al.  An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex. , 1987, Journal of neurophysiology.

[47]  Jochen Triesch,et al.  Learning of Active Binocular Vision in a Biomechanical Model of the Oculomotor System , 2017, bioRxiv.

[48]  Yang Liu,et al.  Disparity statistics in natural scenes. , 2008, Journal of vision.

[49]  Nikil Dutt,et al.  3D Visual Response Properties of MSTd Emerge from an Efficient, Sparse Population Code , 2016, The Journal of Neuroscience.

[50]  M. Feller,et al.  Mechanisms underlying development of visual maps and receptive fields. , 2008, Annual review of neuroscience.

[51]  I. Ohzawa,et al.  On the neurophysiological organization of binocular vision , 1990, Vision Research.

[52]  Shalabh Bhatnagar,et al.  Natural actor-critic algorithms , 2009, Autom..

[53]  Yuguo Yu,et al.  Preference of sensory neural coding for 1/f signals. , 2005, Physical review letters.

[54]  A. Parker,et al.  Quantitative analysis of the responses of V1 neurons to horizontal disparity in dynamic random-dot stereograms. , 2002, Journal of neurophysiology.

[55]  Gregory C DeAngelis,et al.  Coding of horizontal disparity and velocity by MT neurons in the alert macaque. , 2003, Journal of neurophysiology.

[56]  Bruno A Olshausen,et al.  Sparse coding of sensory inputs , 2004, Current Opinion in Neurobiology.

[57]  Y. Chino,et al.  Postnatal Development of Binocular Disparity Sensitivity in Neurons of the Primate Visual Cortex , 1997, The Journal of Neuroscience.

[58]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.