Visualizing and Understanding Vision System

How the human vision system addresses the object identity-preserving recognition problem is largely unknown. Here, we use a vision recognition-reconstruction network (RRN) to investigate the development, recognition, learning and forgetting mechanisms, and achieve similar characteristics to electrophysiological measurements in monkeys. First, in network development study, the RRN also experiences critical developmental stages characterized by specificities in neuron types, synapse and activation patterns, and visual task performance from the early stage of coarse salience map recognition to mature stage of fine structure recognition. In digit recognition study, we witness that the RRN could maintain object invariance representation under various viewing conditions by coordinated adjustment of responses of population neurons. And such concerted population responses contained untangled object identity and properties information that could be accurately extracted via high-level cortices or even a simple weighted summation decoder. In the learning and forgetting study, novel structure recognition is implemented by adjusting entire synapses in low magnitude while pattern specificities of original synaptic connectivity are preserved, which guaranteed a learning process without disrupting the existing functionalities. This work benefits the understanding of the human visual processing mechanism and the development of human-like machine intelligence.

[1]  H. Bülthoff,et al.  Effects of temporal association on recognition memory , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Nobuko Mataga,et al.  Experience-Dependent Pruning of Dendritic Spines in Visual Cortex by Tissue Plasminogen Activator , 2004, Neuron.

[3]  Razvan Pascanu,et al.  Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.

[4]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[5]  J. Maunsell,et al.  Anterior inferotemporal neurons of monkeys engaged in object recognition can be highly sensitive to object retinal position. , 2003, Journal of neurophysiology.

[6]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  David D. Cox,et al.  'Breaking' position-invariant object recognition , 2005, Nature Neuroscience.

[8]  C. Gilbert,et al.  Rapid Axonal Sprouting and Pruning Accompany Functional Reorganization in Primary Visual Cortex , 2009, Neuron.

[9]  Nicole C. Rust,et al.  Signals in inferotemporal and perirhinal cortex suggest an “untangling” of visual target information , 2013, Nature Neuroscience.

[10]  L. Nadel,et al.  Decay happens: the role of active forgetting in memory , 2013, Trends in Cognitive Sciences.

[11]  David D. Cox,et al.  Opinion TRENDS in Cognitive Sciences Vol.11 No.8 Untangling invariant object recognition , 2022 .

[12]  Jeffrey D Schall,et al.  The neural selection and control of saccades by the frontal eye field. , 2002, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[13]  Dwight J. Kravitz,et al.  How position dependent is visual object recognition? , 2008, Trends in Cognitive Sciences.

[14]  Feng Qi,et al.  Human-like general language processing , 2020, ArXiv.

[15]  Paul W. Frankland,et al.  Hippocampal neurogenesis and forgetting , 2013, Trends in Neurosciences.

[16]  J. Mellor,et al.  Coordinated activation of distinct Ca2+ sources and metabotropic glutamate receptors encodes Hebbian synaptic plasticity , 2016, Nature Communications.

[17]  David J. Freedman,et al.  Dynamic population coding of category information in inferior temporal and prefrontal cortex. , 2008, Journal of neurophysiology.

[18]  James J. DiCarlo,et al.  How Does the Brain Solve Visual Object Recognition? , 2012, Neuron.

[19]  Karl J. Friston The free-energy principle: a unified brain theory? , 2010, Nature Reviews Neuroscience.

[20]  David D. Cox,et al.  Does Learned Shape Selectivity in Inferior Temporal Cortex Automatically Generalize Across Retinal Position? , 2008, The Journal of Neuroscience.

[21]  Hannah Monyer,et al.  GABAergic Interneurons Shape the Functional Maturation of the Cortex , 2013, Neuron.

[22]  J L Gallant,et al.  Sparse coding and decorrelation in primary visual cortex during natural vision. , 2000, Science.

[23]  Luca Maria Gambardella,et al.  Max-pooling convolutional neural networks for vision-based hand gesture recognition , 2011, 2011 IEEE International Conference on Signal and Image Processing Applications (ICSIPA).

[24]  S. J. Martin,et al.  Synaptic plasticity and memory: an evaluation of the hypothesis. , 2000, Annual review of neuroscience.

[25]  Isaac Meilijson,et al.  Neuronal Regulation: A Mechanism for Synaptic Pruning During Brain Maturation , 1999, Neural Computation.

[26]  A. Dale,et al.  Functional analysis of primary visual cortex (V1) in humans. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Feng Qi,et al.  Human-like machine thinking: Language guided imagination , 2019, ArXiv.

[28]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[29]  M. Giustetto,et al.  Synaptic Pruning by Microglia Is Necessary for Normal Brain Development , 2011, Science.

[30]  Tomaso Poggio,et al.  Fast Readout of Object Identity from Macaque Inferior Temporal Cortex , 2005, Science.

[31]  Pinglei Bao,et al.  The representation of colored objects in macaque color patches , 2017, bioRxiv.

[32]  A. Pouget,et al.  Neural correlations, population coding and computation , 2006, Nature Reviews Neuroscience.

[33]  Doris Y. Tsao,et al.  Functional Compartmentalization and Viewpoint Generalization Within the Macaque Face-Processing System , 2010, Science.

[34]  Stefan Treue,et al.  Feature-based attention influences motion processing gain in macaque visual cortex , 1999, Nature.

[35]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[36]  A. Ishai,et al.  Distributed and Overlapping Representations of Faces and Objects in Ventral Temporal Cortex , 2001, Science.

[37]  E. Bienenstock,et al.  Theory for the development of neuron selectivity: orientation specificity and binocular interaction in visual cortex , 1982, The Journal of neuroscience : the official journal of the Society for Neuroscience.