A Structured Model of Video Reproduces Primary Visual Cortical Organisation

The visual system must learn to infer the presence of objects and features in the world from the images it encounters, and as such it must, either implicitly or explicitly, model the way these elements interact to create the image. Do the response properties of cells in the mammalian visual system reflect this constraint? To address this question, we constructed a probabilistic model in which the identity and attributes of simple visual elements were represented explicitly and learnt the parameters of this model from unparsed, natural video sequences. After learning, the behaviour and grouping of variables in the probabilistic model corresponded closely to functional and anatomical properties of simple and complex cells in the primary visual cortex (V1). In particular, feature identity variables were activated in a way that resembled the activity of complex cells, while feature attribute variables responded much like simple cells. Furthermore, the grouping of the attributes within the model closely parallelled the reported anatomical grouping of simple cells in cat V1. Thus, this generative model makes explicit an interpretation of complex and simple cells as elements in the segmentation of a visual scene into basic independent features, along with a parametrisation of their moment-by-moment appearances. We speculate that such a segmentation may form the initial stage of a hierarchical system that progressively separates the identity and appearance of more articulated visual elements, culminating in view-invariant object recognition.

[1]  F. Attneave Some informational aspects of visual perception. , 1954, Psychological review.

[2]  V. Mountcastle Modality and topographic properties of single neurons of cat's somatic sensory cortex. , 1957, Journal of neurophysiology.

[3]  D. Hubel,et al.  RECEPTIVE FIELDS OF CELLS IN STRIATE CORTEX OF VERY YOUNG, VISUALLY INEXPERIENCED KITTENS. , 1963, Journal of neurophysiology.

[4]  D. N. Spinelli,et al.  Visual Experience Modifies Distribution of Horizontally and Vertically Oriented Receptive Fields in Cats , 1970, Science.

[5]  G. F. Cooper,et al.  Development of the Brain depends on the Visual Environment , 1970, Nature.

[6]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[7]  Roman Bek,et al.  Discourse on one way in which a quantum-mechanics language on the classical logical base can be built up , 1978, Kybernetika.

[8]  D. Pollen,et al.  Phase relationships between adjacent simple cells in the visual cortex. , 1981, Science.

[9]  E H Adelson,et al.  Spatiotemporal energy models for the perception of motion. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[10]  R. Linsker From basic network principles to neural architecture (series) , 1986 .

[11]  R Linsker,et al.  From basic network principles to neural architecture: emergence of orientation-selective cells. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[12]  de Ruyter van Steveninck,et al.  Real-time performance of a movement-sensitive neuron in the blowfly visual system , 1986 .

[13]  J. Duysens,et al.  Response properties of area 17 neurons in cats reared in stroboscopic illumination. , 1987, Journal of neurophysiology.

[14]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[15]  William Bialek,et al.  Real-time performance of a movement-sensitive neuron in the blowfly visual system: coding and information transfer in short spike sequences , 1988, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[16]  J. Duysens,et al.  Experimental myopia in cats reared in stroboscopic illumination , 1989, Vision Research.

[17]  李幼升,et al.  Ph , 1989 .

[18]  K. Miller,et al.  Ocular dominance column development: analysis and simulation. , 1989, Science.

[19]  David Willshaw,et al.  Application of the elastic net algorithm to the formation of ocular dominance stripes , 1990 .

[20]  Peter Földiák,et al.  Learning Invariance from Transformation Sequences , 1991, Neural Comput..

[21]  I. Ohzawa,et al.  Spatiotemporal organization of simple-cell receptive fields in the cat's striate cortex. I. General characteristics and postnatal development. , 1993, Journal of neurophysiology.

[22]  David J. Field,et al.  What Is the Goal of Sensory Coding? , 1994, Neural Computation.

[23]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[24]  N. Swindale The development of topography in the visual cortex: a review of models. , 1996, Network.

[25]  David J. Field,et al.  Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[26]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[27]  J. V. van Hateren,et al.  Independent component filters of natural images compared with simple cells in primary visual cortex , 1998, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[28]  Geoffrey E. Hinton,et al.  A View of the Em Algorithm that Justifies Incremental, Sparse, and other Variants , 1998, Learning in Graphical Models.

[29]  D. Ruderman,et al.  Independent component analysis of natural image sequences yields spatio-temporal filters similar to simple cells in primary visual cortex , 1998, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[30]  Bartlett W. Mel,et al.  Translation-Invariant Orientation Tuning in Visual “Complex” Cells Could Derive from Intradendritic Computations , 1998, The Journal of Neuroscience.

[31]  A. L. Humphrey,et al.  Strobe rearing reduces direction selectivity in area 17 by altering spatiotemporal receptive-field structure. , 1998, Journal of neurophysiology.

[32]  Martin J. Wainwright,et al.  Scale Mixtures of Gaussians and the Statistics of Natural Images , 1999, NIPS.

[33]  K. Jarrod Millman,et al.  Learning Sparse Codes with a Mixture-of-Gaussians Prior , 1999, NIPS.

[34]  F. Sengpiel,et al.  Influence of experience on orientation maps in cat visual cortex , 1999, Nature Neuroscience.

[35]  K. Miller,et al.  Is the development of orientation selectivity instructed by activity? , 1999, Journal of neurobiology.

[36]  Charles M. Bishop Variational principal components , 1999 .

[37]  I. Ohzawa,et al.  Functional Micro-Organization of Primary Visual Cortex: Receptive Field Analysis of Nearby Neurons , 1999, The Journal of Neuroscience.

[38]  Frances S. Chance,et al.  Complex cells as cortically amplified simple cells , 1999, Nature Neuroscience.

[39]  P O Hoyer,et al.  Independent component analysis applied to feature extraction from colour and stereo images , 2000, Network.

[40]  Joshua B. Tenenbaum,et al.  Separating Style and Content with Bilinear Models , 2000, Neural Computation.

[41]  Aapo Hyvärinen,et al.  Emergence of Phase- and Shift-Invariant Features by Decomposition of Natural Images into Independent Feature Subspaces , 2000, Neural Computation.

[42]  Brendan J. Frey,et al.  Learning flexible sprites in video layers , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[43]  Eero P. Simoncelli,et al.  Natural signal statistics and sensory gain control , 2001, Nature Neuroscience.

[44]  H. Barlow The exploitation of regularities in the environment by the brain. , 2001, The Behavioral and brain sciences.

[45]  D. Long Probabilistic Models of the Brain. , 2002 .

[46]  BsnNr C. Srorn,et al.  CLASSIFYING SIMPLE AND COMPLEX CELLS ON THE BASIS OF RESPONSE MODULATION , 2002 .

[47]  D. Ringach Spatial structure and symmetry of simple-cell receptive fields in macaque primary visual cortex. , 2002, Journal of neurophysiology.

[48]  Terrence J. Sejnowski,et al.  Slow Feature Analysis: Unsupervised Learning of Invariances , 2002, Neural Computation.

[49]  Dmitri B. Chklovskii,et al.  Wiring Optimization in Cortical Circuits , 2002, Neuron.

[50]  K. Miller,et al.  Opponent Inhibition A Developmental Model of Layer 4 of the Neocortical Circuit , 2002, Neuron.

[51]  D. Ringach,et al.  On the classification of simple and complex cells , 2002, Vision Research.

[52]  Matthew J. Beal Variational algorithms for approximate Bayesian inference , 2003 .

[53]  Tai Sing Lee,et al.  Hierarchical Bayesian inference in the visual cortex. , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[54]  Aapo Hyvärinen,et al.  Simple-Cell-Like Receptive Fields Maximize Temporal Coherence in Natural Video , 2003, Neural Computation.

[55]  Konrad P. Körding,et al.  The world from a cat’s perspective – statistics of natural videos , 2003, Biological Cybernetics.

[56]  Konrad Paul Kording,et al.  How are complex cell properties adapted to the statistics of natural stimuli? , 2004, Journal of neurophysiology.

[57]  Christopher K. I. Williams,et al.  Greedy Learning of Multiple Objects in Images Using Robust Statistics and Factorial Learning , 2004, Neural Computation.

[58]  C. Malsburg Self-organization of orientation sensitive cells in the striate cortex , 2004, Kybernetik.

[59]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[60]  Rajesh P. N. Rao,et al.  Bilinear Sparse Coding for Invariant Vision , 2005, Neural Computation.

[61]  Long Zhu,et al.  A Hierarchical Compositional System for Rapid Object Detection , 2005, NIPS.

[62]  Laurenz Wiskott,et al.  Slow feature analysis yields a rich repertoire of complex cell properties. , 2005, Journal of vision.

[63]  J. Touryan,et al.  Spatial Structure of Complex Cell Receptive Fields Measured with Natural Images , 2005, Neuron.

[64]  Antonio Torralba,et al.  Describing Visual Scenes using Transformed Dirichlet Processes , 2005, NIPS.

[65]  Michael S. Lewicki,et al.  A Hierarchical Bayesian Model for Learning Nonlinear Statistical Regularities in Nonstationary Natural Signals , 2005, Neural Computation.

[66]  Eero P. Simoncelli,et al.  Spatiotemporal Elements of Macaque V1 Receptive Fields , 2005, Neuron.

[67]  Julian Eggert,et al.  Learning viewpoint invariant object representations using a temporal coherence principle , 2005, Biological Cybernetics.

[68]  Karl J. Friston,et al.  A theory of cortical responses , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.

[69]  Michael S. Lewicki,et al.  Is Early Vision Optimized for Extracting Higher-order Dependencies? , 2005, NIPS.

[70]  R. Baddeley,et al.  Is the early visual system optimised to be energy efficient? , 2005, Network.

[71]  Eero P. Simoncelli,et al.  Spike-triggered neural characterization. , 2006, Journal of vision.

[72]  Long Zhu,et al.  Unsupervised Learning of a Probabilistic Grammar for Object Detection and Parsing , 2006, NIPS.

[73]  Richard S. Zemel,et al.  Learning Parts-Based Representations of Data , 2006, J. Mach. Learn. Res..

[74]  A. Yuille,et al.  Opinion TRENDS in Cognitive Sciences Vol.10 No.7 July 2006 Special Issue: Probabilistic models of cognition Vision as Bayesian inference: analysis by synthesis? , 2022 .

[75]  Jérôme Ribot,et al.  Orientation-restricted continuous visual exposure induces marked reorganization of orientation maps in early life , 2006, NeuroImage.

[76]  Martin Rehn,et al.  A network that uses few active neurones to code visual input predicts the diverse shapes of cortical receptive fields , 2007, Journal of Computational Neuroscience.

[77]  Terrence J. Sejnowski,et al.  Soft Mixer Assignment in a Hierarchical Generative Model of Natural Scene Statistics , 2006, Neural Computation.

[78]  Richard E. Turner,et al.  On Sparsity and Overcompleteness in Image Models , 2007, NIPS.

[79]  Richard E. Turner,et al.  A Maximum-Likelihood Interpretation for Slow Feature Analysis , 2007, Neural Computation.

[80]  Antonio Torralba,et al.  Describing Visual Scenes Using Transformed Objects and Parts , 2008, International Journal of Computer Vision.

[81]  Jörg Lücke,et al.  A Dynamical Model for Receptive Field Self-organization in V1 Cortical Columns , 2007, ICANN.

[82]  Feng Qi Han,et al.  Excitatory and suppressive receptive field subunits in awake monkey primary visual cortex (V1) , 2007, Proceedings of the National Academy of Sciences.

[83]  Jörg Lücke,et al.  Maximal Causes for Non-linear Component Extraction , 2008, J. Mach. Learn. Res..

[84]  Edmund T. Rolls,et al.  Learning transform invariant object recognition in the visual system with multiple stimuli present during training , 2008, Neural Networks.

[85]  Niko Wilbert,et al.  Invariant Object Recognition with Slow Feature Analysis , 2008, ICANN.

[86]  P. Z. Marmarelis,et al.  Analysis of Physiological Systems: The White-Noise Approach , 2011 .

[87]  Richard E. Turner,et al.  Two problems with variational expectation maximisation for time-series models , 2011 .

[88]  RussLL L. Ds Vnlos,et al.  SPATIAL FREQUENCY SELECTIVITY OF CELLS IN MACAQUE VISUAL CORTEX , 2022 .