A Texture Statistics Encoding Model Reveals Hierarchical Feature Selectivity across Human Visual Cortex

Midlevel features, such as contour and texture, provide a computational link between low- and high-level visual representations. Although the nature of midlevel representations in the brain is not fully understood, past work has suggested a texture statistics model, called the P–S model (Portilla and Simoncelli, 2000), is a candidate for predicting neural responses in areas V1–V4 as well as human behavioral data. However, it is not currently known how well this model accounts for the responses of higher visual cortex to natural scene images. To examine this, we constructed single-voxel encoding models based on P–S statistics and fit the models to fMRI data from human subjects (both sexes) from the Natural Scenes Dataset (Allen et al., 2022). We demonstrate that the texture statistics encoding model can predict the held-out responses of individual voxels in early retinotopic areas and higher-level category-selective areas. The ability of the model to reliably predict signal in higher visual cortex suggests that the representation of texture statistics features is widespread throughout the brain. Furthermore, using variance partitioning analyses, we identify which features are most uniquely predictive of brain responses and show that the contributions of higher-order texture features increase from early areas to higher areas on the ventral and lateral surfaces. We also demonstrate that patterns of sensitivity to texture statistics can be used to recover broad organizational axes within visual cortex, including dimensions that capture semantic image content. These results provide a key step forward in characterizing how midlevel feature representations emerge hierarchically across the visual system. SIGNIFICANCE STATEMENT Intermediate visual features, like texture, play an important role in cortical computations and may contribute to tasks like object and scene recognition. Here, we used a texture model proposed in past work to construct encoding models that predict the responses of neural populations in human visual cortex (measured with fMRI) to natural scene stimuli. We show that responses of neural populations at multiple levels of the visual system can be predicted by this model, and that the model is able to reveal an increase in the complexity of feature representations from early retinotopic cortex to higher areas of ventral and lateral visual cortex. These results support the idea that texture-like representations may play a broad underlying role in visual processing.

[1]  Zvi N. Roth,et al.  Natural scene sampling reveals reliable coarse-scale orientation tuning in human V1 , 2022, Nature Communications.

[2]  M. Tarr,et al.  Low-level tuning biases in higher visual cortex reflect the semantic informativeness of visual features , 2022, bioRxiv.

[3]  John A. Pyles,et al.  GLMsingle: a toolbox for improving single-trial fMRI response estimates , 2022, bioRxiv.

[4]  I. Fujita,et al.  Processing of visual statistics of naturalistic videos in macaque visual areas V1 and V4 , 2021, Brain Structure and Function.

[5]  Emily J. Allen,et al.  A massive 7T fMRI dataset to bridge cognitive neuroscience and artificial intelligence , 2021, Nature Neuroscience.

[6]  Russell A. Epstein,et al.  Scene Perception in the Human Brain. , 2019, Annual review of vision science.

[7]  Jack L. Gallant,et al.  Human Scene-Selective Areas Represent 3D Configurations of Surfaces , 2019, Neuron.

[8]  Jack L. Gallant,et al.  Voxelwise encoding models with non-spherical multivariate normal priors , 2018, NeuroImage.

[9]  Jonathan Winawer,et al.  The Human Connectome Project 7 Tesla retinotopy dataset: Description and population receptive field analysis , 2018, Journal of vision.

[10]  Talia Konkle,et al.  Mid-level visual features underlie the high-level categorical organization of the ventral stream , 2018, Proceedings of the National Academy of Sciences.

[11]  Ghislain St-Yves,et al.  The feature-weighted receptive field: an interpretable encoding model for complex feature spaces , 2017, NeuroImage.

[12]  Stefania Bracci,et al.  On the partnership between neural representations of object categories and visual features in the ventral visual pathway , 2017, Neuropsychologia.

[13]  George A Alvarez,et al.  Mid-level perceptual features contain early cues to animacy. , 2017, Journal of vision.

[14]  Chris I Baker,et al.  Contributions of low- and high-level properties to neural processing of visual scenes in the human brain , 2017, Philosophical Transactions of the Royal Society B: Biological Sciences.

[15]  Till S. Hartmann,et al.  End-Stopping Predicts Curvature Tuning along the Ventral Stream , 2017, The Journal of Neuroscience.

[16]  Gouki Okazawa,et al.  Gradual Development of Visual Texture-Selective Properties Between Macaque Areas V2 and V4 , 2016, Cerebral cortex.

[17]  K. Gegenfurtner,et al.  Image Statistics and the Representation of Material Properties in the Visual Cortex , 2016, Front. Psychol..

[18]  Thomas L. Griffiths,et al.  Supplementary Information for Natural Speech Reveals the Semantic Maps That Tile Human Cerebral Cortex , 2022 .

[19]  A. Norcia,et al.  Representation of Maximally Regular Textures in Human Visual Cortex , 2016, The Journal of Neuroscience.

[20]  Michael A. Cohen,et al.  Mid-level perceptual features distinguish objects of different real-world sizes. , 2016, Journal of experimental psychology. General.

[21]  Jack L. Gallant,et al.  Fourier power, subjective distance, and object categories all provide plausible models of BOLD responses in scene-selective visual areas , 2015, Front. Comput. Neurosci..

[22]  Liang Wang,et al.  Probabilistic Maps of Visual Topography in Human Cortex. , 2015, Cerebral cortex.

[23]  Thomas Naselaris,et al.  Resolving Ambiguities of MVPA Using Explicit Models of Representation , 2015, Trends in Cognitive Sciences.

[24]  Jack L. Gallant,et al.  Pycortex: an interactive surface visualizer for fMRI , 2015, Front. Neuroinform..

[25]  Kalanit Grill-Spector,et al.  Temporal Processing Capacity in High-Level Visual Cortex Is Domain Specific , 2015, The Journal of Neuroscience.

[26]  J. Peirce Understanding mid-level representations in visual processing. , 2015, Journal of vision.

[27]  H. Komatsu,et al.  Image statistics underlying natural texture selectivity of neurons in macaque V4 , 2014, Proceedings of the National Academy of Sciences.

[28]  Brian Murphy,et al.  Simultaneously Uncovering the Patterns of Brain Regions Involved in Different Story Reading Subprocesses , 2014, PloS one.

[29]  Marcel van Gerven,et al.  Unsupervised Feature Learning Improves Prediction of Human Brain Activity in Response to Natural Images , 2014, PLoS Comput. Biol..

[30]  K. Grill-Spector,et al.  The functional architecture of the ventral temporal cortex and its role in categorization , 2014, Nature Reviews Neuroscience.

[31]  R. Tootell,et al.  Thinking Outside the Box: Rectilinear Shapes Selectively Activate Scene-Selective Cortex , 2014, The Journal of Neuroscience.

[32]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[33]  Alex Krizhevsky,et al.  One weird trick for parallelizing convolutional neural networks , 2014, ArXiv.

[34]  Dirk B Walther,et al.  Nonaccidental Properties Underlie Human Categorization of Complex Natural Scenes , 2014, Psychological science.

[35]  A. Caramazza,et al.  Tripartite Organization of the Ventral Stream by Animacy and Object Size , 2013, The Journal of Neuroscience.

[36]  Eero P. Simoncelli,et al.  A functional and perceptual signature of the second visual area in primates , 2013, Nature Neuroscience.

[37]  Roger B. H. Tootell,et al.  A Cardinal Orientation Bias in Scene-Selective Visual Cortex , 2012, The Journal of Neuroscience.

[38]  Victor A. F. Lamme,et al.  Spatially Pooled Contrast Responses Predict Neural and Perceptual Similarity of Naturalistic Image Categories , 2012, PLoS Comput. Biol..

[39]  John T. Serences,et al.  Computational advances towards linking BOLD and behavior , 2012, Neuropsychologia.

[40]  R. Rosenholtz,et al.  A summary statistic representation in peripheral vision explains visual search. , 2009, Journal of vision.

[41]  Eero P. Simoncelli,et al.  Metamers of the ventral stream , 2011, Nature Neuroscience.

[42]  Jeremy Freeman,et al.  Orientation Decoding Depends on Maps, Not Columns , 2011, The Journal of Neuroscience.

[43]  Nicole C. Rust,et al.  Selectivity and Tolerance (“Invariance”) Both Increase as Visual Information Propagates from Cortical Area V4 to IT , 2010, The Journal of Neuroscience.

[44]  R. Rosenholtz,et al.  A summary-statistic representation in peripheral vision explains visual crowding. , 2009, Journal of vision.

[45]  Brian A. Wandell,et al.  Population receptive field estimates in human visual cortex , 2008, NeuroImage.

[46]  Tai Sing Lee,et al.  Contextual Influences in Visual Processing , 2008 .

[47]  Anitha Pasupathy,et al.  Transformation of shape information in the ventral pathway , 2007, Current Opinion in Neurobiology.

[48]  J. Gallant,et al.  Complete functional characterization of sensory neurons by system identification. , 2006, Annual review of neuroscience.

[49]  D. Kersten,et al.  The representation of perceived angular size in human primary visual cortex , 2006, Nature Neuroscience.

[50]  Benjamin J. Balas,et al.  Texture synthesis and perception: Using computational models to study texture representations in the human visual system , 2006, Vision Research.

[51]  Nicole C. Rust,et al.  Do We Know What the Early Visual System Does? , 2005, The Journal of Neuroscience.

[52]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[53]  Eero P. Simoncelli,et al.  A Parametric Texture Model Based on Joint Statistics of Complex Wavelet Coefficients , 2000, International Journal of Computer Vision.

[54]  Robert O. Duncan,et al.  Cortical Magnification within Human Primary Visual Cortex Correlates with Acuity Thresholds , 2003, Neuron.

[55]  Michel Vidal-Naquet,et al.  Visual features of intermediate complexity and their use in classification , 2002, Nature Neuroscience.

[56]  L. Optican,et al.  Cortical regions involved in visual texture perception: a fMRI study. , 1998, Brain research. Cognitive brain research.

[57]  William T. Freeman,et al.  Presented at: 2nd Annual IEEE International Conference on Image , 1995 .

[58]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[59]  J D Victor,et al.  Striate cortex extracts higher-order spatial correlations from visual textures. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[60]  J. Bergen,et al.  Computational Modeling of Visual Texture Segregation , 1991 .

[61]  R. Desimone,et al.  Stimulus-selective properties of inferior temporal neurons in the macaque , 1984, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[62]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[63]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.