Compressive Temporal Summation in Human Visual Cortex

Combining sensory inputs over space and time is fundamental to vision. Population receptive field models have been successful in characterizing spatial encoding throughout the human visual pathways. A parallel question, how visual areas in the human brain process information distributed over time, has received less attention. One challenge is that the most widely used neuroimaging method, fMRI, has coarse temporal resolution compared with the time-scale of neural dynamics. Here, via carefully controlled temporally modulated stimuli, we show that information about temporal processing can be readily derived from fMRI signal amplitudes in male and female subjects. We find that all visual areas exhibit subadditive summation, whereby responses to longer stimuli are less than the linear prediction from briefer stimuli. We also find fMRI evidence that the neural response to two stimuli is reduced for brief interstimulus intervals (indicating adaptation). These effects are more pronounced in visual areas anterior to V1-V3. Finally, we develop a general model that shows how these effects can be captured with two simple operations: temporal summation followed by a compressive nonlinearity. This model operates for arbitrary temporal stimulation patterns and provides a simple and interpretable set of computations that can be used to characterize neural response properties across the visual hierarchy. Importantly, compressive temporal summation directly parallels earlier findings of compressive spatial summation in visual cortex describing responses to stimuli distributed across space. This indicates that, for space and time, cortex uses a similar processing strategy to achieve higher-level and increasingly invariant representations of the visual world. SIGNIFICANCE STATEMENT Combining sensory inputs over time is fundamental to seeing. Two important temporal phenomena are summation, the accumulation of sensory inputs over time, and adaptation, a response reduction for repeated or sustained stimuli. We investigated these phenomena in the human visual system using fMRI. We built predictive models that operate on arbitrary temporal patterns of stimulation using two simple computations: temporal summation followed by a compressive nonlinearity. Our new temporal compressive summation model captures (1) subadditive temporal summation, and (2) adaptation. We show that the model accounts for systematic differences in these phenomena across visual areas. Finally, we show that for space and time, the visual system uses a similar strategy to achieve increasingly invariant representations of the visual world.

[1]  Jonathan Winawer,et al.  A Two-Stage Cascade Model of BOLD Responses in Human Visual Cortex , 2013, PLoS Comput. Biol..

[2]  Jonathan Winawer,et al.  GLMdenoise: a fast, automated technique for denoising task-based fMRI data , 2013, Front. Neurosci..

[3]  D. Heeger,et al.  Linear Systems Analysis of Functional Magnetic Resonance Imaging in Human V1 , 1996, The Journal of Neuroscience.

[4]  O. Schwartz,et al.  Specificity and timescales of cortical adaptation as inferences about natural movie statistics , 2016, Journal of vision.

[5]  Stephen A. Engel,et al.  Linear systems analysis of the fMRI signal , 2012, NeuroImage.

[6]  M. Webster Visual Adaptation. , 2015, Annual review of vision science.

[7]  Genevieve M. Heckman,et al.  Nonlinearities in rapid event-related fMRI explained by stimulus scaling , 2007, NeuroImage.

[8]  K. Grill-Spector,et al.  fMRI-adaptation and category selectivity in human ventral temporal cortex: regional differences across time scales. , 2010, Journal of neurophysiology.

[9]  B. Wandell Foundations of vision , 1995 .

[10]  D. Tolhurst,et al.  Non-linearities of temporal summation in neurones in area 17 of the cat , 2004, Experimental Brain Research.

[11]  Edward H. Adelson,et al.  Motion illusions as optimal percepts , 2002, Nature Neuroscience.

[12]  D G Pelli,et al.  The VideoToolbox software for visual psychophysics: transforming numbers into movies. , 1997, Spatial vision.

[13]  Pinglei Bao,et al.  Using an achiasmic human visual system to quantify the relationship between the fMRI BOLD signal and neural response , 2015, eLife.

[14]  Kalanit Grill-Spector,et al.  Encoding model of temporal processing in human visual cortex , 2017, Proceedings of the National Academy of Sciences.

[15]  David J. Freedman,et al.  A hierarchy of intrinsic timescales across primate cortex , 2014, Nature Neuroscience.

[16]  Leslie G. Ungerleider,et al.  Modulation of sensory suppression: implications for receptive field sizes in the human visual cortex. , 2001, Journal of neurophysiology.

[17]  M. Tovée,et al.  The responses of single neurons in the temporal visual cortical areas of the macaque when more than one stimulus is present in the receptive field , 2004, Experimental Brain Research.

[18]  J. Winawer,et al.  Linking Electrical Stimulation of Human Primary Visual Cortex, Size of Affected Cortical Area, Neuronal Responses, and Subjective Experience , 2016, Neuron.

[19]  H. Barlow Vision: A computational investigation into the human representation and processing of visual information: David Marr. San Francisco: W. H. Freeman, 1982. pp. xvi + 397 , 1983 .

[20]  Jonathan Winawer,et al.  Modeling systematic differences in temporal summation and adaptation in human visual cortex: evidence from fMRI and intracranial EEG , 2017 .

[21]  A. Kohn Visual adaptation: physiology, mechanisms, and functional benefits. , 2007, Journal of neurophysiology.

[22]  John H. R. Maunsell,et al.  Visual processing in monkey extrastriate cortex. , 1987, Annual review of neuroscience.

[23]  Eero P. Simoncelli,et al.  Computational models of cortical visual processing. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Gordon T. Plant,et al.  Temporal frequency discrimination in human vision: Evidence for an additional mechanism in the low spatial and high temporal frequency region , 1985, Vision Research.

[25]  Brian A. Wandell,et al.  Population receptive field estimates in human visual cortex , 2008, NeuroImage.

[26]  Nicholas J. Priebe,et al.  Constraints on the source of short-term motion adaptation in macaque area MT. I. the role of input and intrinsic mechanisms. , 2002, Journal of neurophysiology.

[27]  J. Atick,et al.  STATISTICS OF NATURAL TIME-VARYING IMAGES , 1995 .

[28]  I. Gauthier,et al.  Expertise for cars and birds recruits brain areas involved in face recognition , 2000, Nature Neuroscience.

[29]  S P McKee,et al.  Discrimination of time: comparison of foveal and peripheral sensitivity. , 1984, Journal of the Optical Society of America. A, Optics and image science.

[30]  H. Helmholtz Helmholtz's Treatise on Physiological Optics , 1963 .

[31]  Kendrick N. Kay,et al.  Principles for models of neural information processing , 2017, NeuroImage.

[32]  James J. DiCarlo,et al.  Balanced Increases in Selectivity and Tolerance Produce Constant Sparseness along the Ventral Visual Stream , 2012, The Journal of Neuroscience.

[33]  Liang Wang,et al.  Probabilistic Maps of Visual Topography in Human Cortex. , 2015, Cerebral cortex.

[34]  D. Heeger,et al.  Slow Cortical Dynamics and the Accumulation of Information over Long Timescales , 2012, Neuron.

[35]  Jonathan Winawer,et al.  Computational neuroimaging and population receptive fields , 2015, Trends in Cognitive Sciences.

[36]  Eero P. Simoncelli,et al.  A model of neuronal responses in visual area MT , 1998, Vision Research.

[37]  R. W. Rodieck,et al.  Parasol and midget ganglion cells of the human retina , 1985, The Journal of comparative neurology.

[38]  N. Kanwisher,et al.  The lateral occipital complex and its role in object recognition , 2001, Vision Research.

[39]  S. Solomon,et al.  Moving Sensory Adaptation beyond Suppressive Effects in Single Neurons , 2014, Current Biology.

[40]  Jonathan Winawer,et al.  Asynchronous Broadband Signals Are the Principal Source of the BOLD Response in Human Visual Cortex , 2013, Current Biology.

[41]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[42]  E H Adelson,et al.  Spatiotemporal energy models for the perception of motion. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[43]  Nicole C. Rust,et al.  Selectivity and Tolerance (“Invariance”) Both Increase as Visual Information Propagates from Cortical Area V4 to IT , 2010, The Journal of Neuroscience.

[44]  Kalanit Grill-Spector,et al.  Sparsely-distributed organization of face and limb activations in human ventral temporal cortex , 2010, NeuroImage.

[45]  B. Wandell,et al.  Compressive spatial summation in human visual cortex. , 2013, Journal of neurophysiology.

[46]  K. H. Britten,et al.  Spatial Summation in the Receptive Fields of MT Neurons , 1999, The Journal of Neuroscience.

[47]  D. Heeger,et al.  A Hierarchy of Temporal Receptive Windows in Human Cortex , 2008, The Journal of Neuroscience.

[48]  J. Robson,et al.  Discrimination at threshold: Labelled detectors in human vision , 1981, Vision Research.

[49]  David J Heeger,et al.  Theory of cortical function , 2017, Proceedings of the National Academy of Sciences.

[50]  J. Movshon,et al.  Time Course and Time-Distance Relationships for Surround Suppression in Macaque V1 Neurons , 2003, The Journal of Neuroscience.

[51]  B. A. Wandell,et al.  Two temporal channels in human V1 identified using fMRI , 2009, NeuroImage.

[52]  D H Brainard,et al.  The Psychophysics Toolbox. , 1997, Spatial vision.

[53]  A. Watson Derivation of the impulse response: comments on the method of Roufs and Blommaert , 1982, Vision Research.

[54]  A. Holcombe Seeing slow and seeing fast: two limits on perception , 2009, Trends in Cognitive Sciences.

[55]  Alan C. Evans,et al.  A General Statistical Analysis for fMRI Data , 2000, NeuroImage.

[56]  Zhigang Yang,et al.  Visualization of icing process of a water droplet impinging onto a frozen cold plate under free and forced convection , 2012, Journal of Visualization.

[57]  S. Thompson-Schill,et al.  Varying Timescales of Stimulus Integration Unite Neural Adaptation and Prototype Formation , 2016, Current Biology.

[58]  Eero P. Simoncelli,et al.  Natural signal statistics and sensory gain control , 2001, Nature Neuroscience.

[59]  M. Tovée,et al.  Translation invariance in the responses to faces of single neurons in the temporal visual cortical areas of the alert macaque. , 1994, Journal of neurophysiology.

[60]  James J. DiCarlo,et al.  How Does the Brain Solve Visual Object Recognition? , 2012, Neuron.

[61]  K. H. Britten,et al.  Contrast dependence of response normalization in area MT of the rhesus macaque. , 2002, Journal of neurophysiology.

[62]  M. Carandini,et al.  Normalization as a canonical neural computation , 2011, Nature Reviews Neuroscience.

[63]  M. A. Bouman,et al.  Opponent color coding: A mechanistic model and a new metric for color space , 1972, Kybernetik.

[64]  Harvey S. Smallman,et al.  Category effects in color memory , 1989 .

[65]  D. Heeger Normalization of cell responses in cat striate cortex , 1992, Visual Neuroscience.

[66]  Karl J. Friston,et al.  A direct quantitative relationship between the functional properties of human and macaque V5 , 2000, Nature Neuroscience.

[67]  Leila Reddy,et al.  The Effects of Context and Attention on Spiking Activity in Human Early Visual Cortex , 2016, PLoS biology.

[68]  D. G. Albrecht,et al.  Spikes versus BOLD: what does neuroimaging tell us about neuronal activity? , 2000, Nature Neuroscience.

[69]  Kendrick N. Kay,et al.  Attention Reduces Spatial Uncertainty in Human Ventral Temporal Cortex , 2015, Current Biology.

[70]  R. Turner,et al.  Event-Related fMRI: Characterizing Differential Responses , 1998, NeuroImage.

[71]  B. C. Motter,et al.  Modulation of Transient and Sustained Response Components of V4 Neurons by Temporal Crowding in Flashed Stimulus Sequences , 2006, The Journal of Neuroscience.