A new framework for understanding vision from the perspective of the primary visual cortex

Visual attention selects only a tiny fraction of visual input informationfor further processing. Selection starts in the primary visual cortex (V1), which creates abottom-up saliency map to guide the fovea to selected visual locations via gaze shifts.This motivates a new framework that views visionas consisting of encoding, selection, and decoding stages, placingselection on center stage. It suggests a massive loss of non-selectedinformation from V1 downstream along the visual pathway.Hence, feedback from downstream visual cortical areas to V1 for better decoding (recognition),through analysis-by-synthesis, should query for additional information and be mainly directed atthe foveal region. Accordingly, non-foveal vision is not only poorer in spatial resolution,but also more susceptible to many illusions.

[1]  Peter H. Schiller,et al.  Neural Control of Visually Guided Eye Movements , 2012 .

[2]  D. Hubel,et al.  Ferrier lecture - Functional architecture of macaque monkey visual cortex , 1977, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[3]  Li Zhaoping,et al.  Theoretical understanding of the early visual processes by data compression and data selection , 2006, Network.

[4]  J. Wolfe,et al.  Preattentive Object Files: Shapeless Bundles of Basic Features , 1997, Vision Research.

[5]  B. G. Cumming,et al.  Responses of primary visual cortical neurons to binocular disparity without depth perception , 1997, Nature.

[6]  I. Rentschler,et al.  Peripheral vision and pattern recognition: a review. , 2011, Journal of vision.

[7]  Richard H. Chen,et al.  Retinotopic patterns of functional connectivity between V1 and large-scale brain networks during resting fixation , 2017, NeuroImage.

[8]  Sheng He,et al.  Temporally flexible feedback signal to foveal cortex for peripheral object recognition , 2016, Proceedings of the National Academy of Sciences.

[9]  Leslie G. Ungerleider,et al.  Organization of visual inputs to the inferior temporal and posterior parietal cortex in macaques , 1991, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[10]  N. Logothetis,et al.  Supporting Online Material for Attention But Not Awareness Modulates the BOLD Signal in the Human V 1 During Binocular Suppression , 2011 .

[11]  Chang-Bing Huang,et al.  Transitions between Central and Peripheral Vision Create Spatial/Temporal Distortions: A Hypothesis Concerning the Perceived Break of the Curveball , 2010, PloS one.

[12]  R. Rosenholtz,et al.  A summary-statistic representation in peripheral vision explains visual crowding. , 2009, Journal of vision.

[13]  Zhaoping Li,et al.  Efficient stereo coding in the multiscale representation , 1994 .

[14]  P. H. Schiller,et al.  The Hermann Grid Illusion Revisited , 2005, Perception.

[15]  Byron M. Yu,et al.  Cortical Areas Interact through a Communication Subspace , 2019, Neuron.

[16]  Li Zhaoping,et al.  Gaze capture by eye-of-origin singletons: interdependence with awareness. , 2012, Journal of vision.

[17]  D. Mackay,et al.  Towards an information-flow model of human behaviour. , 1956, British journal of psychology.

[18]  Kazunori O’Hashi,et al.  Mechanisms for shaping receptive field in monkey area TE. , 2017, Journal of neurophysiology.

[19]  K. May,et al.  Perceived Direction of Motion Determined by Adaptation to Static Binocular Images , 2012, Current Biology.

[20]  Stuart Anstis,et al.  The furrow illusion: peripheral motion becomes aligned with stationary contours. , 2012, Journal of vision.

[21]  Zhaoping Li,et al.  Feature-specific interactions in salience from combined feature contrasts: evidence for a bottom-up saliency map in V1. , 2007, Journal of vision.

[22]  Li Zhaoping,et al.  Reversed Depth in Anticorrelated Random-Dot Stereograms and the Central-Peripheral Difference in Visual Inference , 2018, Perception.

[23]  L. Zhaoping Olfactory object recognition, segmentation, adaptation, target seeking, and discrimination by the network of the olfactory bulb and cortex: computational model and experimental data , 2016, Current Opinion in Behavioral Sciences.

[24]  Johannes J. Fahrenfort,et al.  Masking Disrupts Reentrant Processing in Human Visual Cortex , 2007, Journal of Cognitive Neuroscience.

[25]  Eero P. Simoncelli,et al.  Metamers of the ventral stream , 2011, Nature Neuroscience.

[26]  Ikuya Murakami,et al.  The effects of eccentricity and retinal illuminance on the illusory motion seen in a stationary luminance gradient , 2008, Vision Research.

[27]  D. V. van Essen,et al.  Processing of color, form and disparity information in visual areas VP and V2 of ventral extrastriate cortex in the macaque monkey , 1986, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[28]  L. Zhaoping Brains studying brains: look before you think in vision. , 2016, Physical biology.

[29]  M. Carrasco Visual attention: The past 25 years , 2011, Vision Research.

[30]  Li Zhaoping,et al.  Efficient Coding Theory Predicts a Tilt Aftereffect from Viewing Untilted Patterns , 2016, Current Biology.

[31]  David Whitney,et al.  Multi-level Crowding and the Paradox of Object Recognition in Clutter , 2018, Current Biology.

[32]  Joan Bruna,et al.  Intriguing properties of neural networks , 2013, ICLR.

[33]  Li Zhaoping,et al.  Feedback from higher to lower visual areas for visual recognition may be weaker in the periphery: Glimpses from the perception of brief dichoptic stimuli , 2017, Vision Research.

[34]  C. Koch,et al.  Are we aware of neural activity in primary visual cortex? , 1995, Nature.

[35]  R. Desimone,et al.  Neural mechanisms of selective visual attention. , 1995, Annual review of neuroscience.

[36]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[37]  A. Yuille,et al.  Opinion TRENDS in Cognitive Sciences Vol.10 No.7 July 2006 Special Issue: Probabilistic models of cognition Vision as Bayesian inference: analysis by synthesis? , 2022 .

[38]  R. Weale Vision. A Computational Investigation Into the Human Representation and Processing of Visual Information. David Marr , 1983 .

[39]  Mitsuo Kawato,et al.  A forward-inverse optics model of reciprocal connections between visual cortical areas , 1993 .

[40]  S. M. Axstis PHI MOVEMENT AS A SUBTRACTION PROCESS , 1970 .

[41]  Zhaoping Li A saliency map in primary visual cortex , 2002, Trends in Cognitive Sciences.

[42]  Hualou Liang,et al.  Incremental Integration of Global Contours through Interplay between Visual Cortical Areas , 2014, Neuron.

[43]  T. Isa,et al.  Saccade control after V1 lesion revisited , 2009, Current Opinion in Neurobiology.

[44]  C. Gross,et al.  Visual topography of V2 in the macaque , 1981, The Journal of comparative neurology.

[45]  Ohad Ben-Shahar,et al.  Pop-out in visual search of moving targets in the archer fish , 2015, Nature Communications.

[46]  D. Levi,et al.  Visual crowding: a fundamental limit on conscious perception and object recognition , 2011, Trends in Cognitive Sciences.

[47]  Nathalie Guyader,et al.  Interference with Bottom-Up Feature Detection by Higher-Level Object Recognition , 2007, Current Biology.

[48]  Z Li,et al.  Contextual influences in V1 as a basis for pop out and asymmetry in visual search. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[49]  Zhaoping Li,et al.  From the optic tectum to the primary visual cortex: migration through evolution of the saliency map for exogenous attentional guidance , 2016, Current Opinion in Neurobiology.

[50]  D. H. Kelly,et al.  Information capacity of a single retinal channel , 1962, IRE Trans. Inf. Theory.

[51]  Laurent Itti,et al.  Superior colliculus encodes visual saliency before the primary visual cortex , 2017, Proceedings of the National Academy of Sciences.

[52]  Zhaoping Li,et al.  Bottom-up saliency and top-down learning in the primary visual cortex of monkeys , 2018, Proceedings of the National Academy of Sciences.

[53]  G. C. Sziklai Some studies in the speed of visual perception , 1957 .

[54]  J. Allman,et al.  Stimulus specific responses from beyond the classical receptive field: neurophysiological mechanisms for local-global comparisons in visual neurons. , 1985, Annual review of neuroscience.

[55]  Thomas Serre,et al.  A feedforward architecture accounts for rapid categorization , 2007, Proceedings of the National Academy of Sciences.

[56]  N. Kanwisher,et al.  Feedback of pVisual Object Information to Foveal Retinotopic Cortex , 2008, Nature Neuroscience.

[57]  L. Zhaoping Attention capture by eye of origin singletons even without awareness--a hallmark of a bottom-up saliency map in the primary visual cortex. , 2008, Journal of vision.

[58]  Hualou Liang,et al.  Synergistic Processing of Visual Contours across Cortical Layers in V1 and V2 , 2017, Neuron.

[59]  J. Enns Object substitution and its relation to other forms of visual masking , 2004, Vision Research.

[60]  David Cox,et al.  Recurrent computations for visual pattern completion , 2017, Proceedings of the National Academy of Sciences.

[61]  Eero P. Simoncelli,et al.  Selectivity and tolerance for visual texture in macaque V2 , 2016, Proceedings of the National Academy of Sciences.

[62]  C. Chabris,et al.  Gorillas in Our Midst: Sustained Inattentional Blindness for Dynamic Events , 1999, Perception.