Higher visual areas act like domain-general filters with strong selectivity and functional specialization

Neuroscientific studies rely heavily on a-priori hypotheses, which can bias results toward existing theories. Here, we use a hypothesis-neutral approach to study category selectivity in higher visual cortex. Using only stimulus images and their associated fMRI activity, we constrain randomly initialized neural networks to predict voxel activity. Despite no category-level supervision, the units in the trained networks act as detectors for semantic concepts like ‘faces’ or ‘words’, providing solid empirical support for categorical selectivity. Importantly, this selectivity is maintained when training the networks without images that contain the preferred category, strongly suggesting that selectivity is not domain-specific machinery, but sensitivity to generic patterns that characterize preferred categories. The ability of the models’ representations to transfer to perceptual tasks further reveals the functional role of their selective responses. Finally, our models show selectivity only for a limited number of categories, all previously identified, suggesting that the essential categories are already known. Teaser Models trained solely to predict fMRI activity from images reveal strong category selectivity in higher visual areas, even without exposure to these categories in training.

[1]  Jacob S. Prince,et al.  Selectivity for food in human ventral visual cortex , 2023, bioRxiv.

[2]  Emily J. Allen,et al.  Color-biased regions in the ventral visual pathway are food selective , 2022, Current Biology.

[3]  N. Kanwisher,et al.  A highly selective response to food in human visual cortex revealed by hypothesis-free voxel decomposition , 2022, Current Biology.

[4]  Christoph Dann,et al.  Same Cause; Different Effects in the Brain , 2022, CLeaR.

[5]  Emily J. Allen,et al.  Brain-optimized neural networks learn non-hierarchical models of representation in human visual cortex , 2022, bioRxiv.

[6]  Mert R. Sabuncu,et al.  NeuroGen: Activation optimized image synthesis for discovery neuroscience , 2021, NeuroImage.

[7]  Emily J. Allen,et al.  A massive 7T fMRI dataset to bridge cognitive neuroscience and artificial intelligence , 2021, Nature Neuroscience.

[8]  N. Kanwisher,et al.  Computational models of category-selective brain regions enable high-throughput tests of selectivity , 2021, Nature Communications.

[9]  Nikolaus Kriegeskorte,et al.  Diverse Deep Neural Networks All Predict Human Inferior Temporal Cortex Well, After Training and Fitting , 2021, Journal of Cognitive Neuroscience.

[10]  Emily J. Allen,et al.  A massive 7T fMRI dataset to bridge cognitive and computational neuroscience , 2021, bioRxiv.

[11]  Mert R. Sabuncu,et al.  Cortical response to naturalistic stimuli is largely predictable with deep neural networks , 2020, Science Advances.

[12]  Doris Y. Tsao,et al.  Explaining face representation in the primate brain using different computational models , 2020, Current Biology.

[13]  Grace W. Lindsay Convolutional Neural Networks as a Model of the Visual System: Past, Present, and Future , 2020, Journal of Cognitive Neuroscience.

[14]  Tom M. Mitchell,et al.  Combining computational controls with natural text reveals new aspects of meaning composition , 2020, bioRxiv.

[15]  Jonas Kubilius,et al.  Integrative Benchmarking to Advance Neurally Mechanistic Models of Human Intelligence , 2020, Neuron.

[16]  Bolei Zhou,et al.  Understanding the role of individual units in a deep neural network , 2020, Proceedings of the National Academy of Sciences.

[17]  Maurice Weiler,et al.  General E(2)-Equivariant Steerable CNNs , 2019, NeurIPS.

[18]  Michal Irani,et al.  Convergent evolution of face spaces across human face-selective neuronal groups and deep convolutional networks , 2019, Nature Communications.

[19]  Alexandros Iosifidis,et al.  PyGOP: A Python library for Generalized Operational Perceptron algorithms , 2019, Knowl. Based Syst..

[20]  Alexander S. Ecker,et al.  Inception loops discover what excites neurons most using deep predictive models , 2019, Nature Neuroscience.

[21]  Michael J. Tarr,et al.  Neural Taskonomy: Inferring the Similarity of Task-Derived Representations from Brain Activity , 2019, bioRxiv.

[22]  Carlos R. Ponce,et al.  Evolving Images for Visual Neurons Using a Deep Generative Network Reveals Coding Principles and Neuronal Preferences , 2019, Cell.

[23]  K. Seeliger,et al.  End-to-end neural system identification with neural information flow , 2019, bioRxiv.

[24]  Jack L. Gallant,et al.  Human Scene-Selective Areas Represent 3D Configurations of Surfaces , 2019, Neuron.

[25]  James J DiCarlo,et al.  Neural population control via deep image synthesis , 2018, Science.

[26]  Alexander S. Ecker,et al.  A rotation-equivariant convolutional neural network model of primary visual cortex , 2018, ICLR.

[27]  Jonas Kubilius,et al.  Brain-Score: Which Artificial Neural Network for Object Recognition is most Brain-Like? , 2018, bioRxiv.

[28]  Yuning Jiang,et al.  Unified Perceptual Parsing for Scene Understanding , 2018, ECCV.

[29]  Andrea Vedaldi,et al.  Net2Vec: Quantifying and Explaining How Concepts are Encoded by Filters in Deep Neural Networks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[30]  Maurice Weiler,et al.  Learning Steerable Filters for Rotation Equivariant CNNs , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[31]  Omkar M. Parkhi,et al.  VGGFace2: A Dataset for Recognising Faces across Pose and Age , 2017, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[32]  Yizhen Zhang,et al.  Neural Encoding and Decoding with Deep Learning for Dynamic Natural Vision , 2016, Cerebral cortex.

[33]  Nikolaus Kriegeskorte,et al.  Deep convolutional neural networks, features, and categories perform similarly at explaining primate high-level visual representations , 2018 .

[34]  Kate Jeffery,et al.  Retrosplenial cortex and its role in spatial cognition , 2017, bioRxiv.

[35]  Alexander S. Ecker,et al.  Neural system identification for large populations separating "what" and "where" , 2017, NIPS.

[36]  Alberto Testolin,et al.  Letter perception emerges from unsupervised deep learning and recycling of natural image features , 2017, Nature Human Behaviour.

[37]  Bolei Zhou,et al.  Scene Parsing through ADE20K Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Michael Eickenberg,et al.  Seeing it all: Convolutional network layers map the function of the human visual system , 2017, NeuroImage.

[39]  Bolei Zhou,et al.  Network Dissection: Quantifying Interpretability of Deep Visual Representations , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Russell A. Epstein,et al.  Coding of navigational affordances in the human visual system , 2017, Proceedings of the National Academy of Sciences.

[41]  Jack L. Gallant,et al.  A deep convolutional energy model of V4 responses to natural movies , 2016 .

[42]  Alexei A. Efros,et al.  What makes ImageNet good for transfer learning? , 2016, ArXiv.

[43]  Timothy Edward John Behrens,et al.  Task-free MRI predicts individual differences in brain activity during task performance , 2016, Science.

[44]  J. S. Guntupalli,et al.  A Model of Representational Spaces in Human Cortex , 2016, Cerebral cortex.

[45]  Hod Lipson,et al.  Understanding Neural Networks Through Deep Visualization , 2015, ArXiv.

[46]  Jeremy Freeman,et al.  Representing “stuff” in visual cortex , 2015, Proceedings of the National Academy of Sciences.

[47]  Andrea Vedaldi,et al.  Understanding deep image representations by inverting them , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[48]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[49]  Leslie G. Ungerleider,et al.  Curvature-processing network in macaque visual cortex , 2014, Proceedings of the National Academy of Sciences.

[50]  K. Grill-Spector,et al.  The functional architecture of the ventral temporal cortex and its role in categorization , 2014, Nature Reviews Neuroscience.

[51]  R. Tootell,et al.  Thinking Outside the Box: Rectilinear Shapes Selectively Activate Scene-Selective Cortex , 2014, The Journal of Neuroscience.

[52]  Ha Hong,et al.  Performance-optimized hierarchical models predict neural responses in higher visual cortex , 2014, Proceedings of the National Academy of Sciences.

[53]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[54]  S. Dehaene,et al.  Characterizing the dynamics of mental representations: the temporal generalization method , 2014, Trends in Cognitive Sciences.

[55]  Jack L. Gallant,et al.  A Continuous Semantic Space Describes the Representation of Thousands of Object and Action Categories across the Human Brain , 2012, Neuron.

[56]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[57]  K. Grill-Spector,et al.  Electrical Stimulation of Human Fusiform Face-Selective Regions Distorts Face Perception , 2012, The Journal of Neuroscience.

[58]  M. Livingstone,et al.  Behavioral and Anatomical Consequences of Early versus Late Symbol Training in Macaques , 2012, Neuron.

[59]  Leslie G. Ungerleider,et al.  Scene-Selective Cortical Regions in Human and Nonhuman Primates , 2011, The Journal of Neuroscience.

[60]  Leslie G. Ungerleider,et al.  Relationship between Functional Magnetic Resonance Imaging-Identified Regions and Neuronal Category Selectivity , 2011, The Journal of Neuroscience.

[61]  P. Downing,et al.  The role of occipitotemporal body-selective regions in person perception , 2011, Cognitive neuroscience.

[62]  S. Dehaene,et al.  The unique role of the visual word form area in reading , 2011, Trends in Cognitive Sciences.

[63]  Jack L. Gallant,et al.  Encoding and decoding in fMRI , 2011, NeuroImage.

[64]  Kalanit Grill-Spector,et al.  Sparsely-distributed organization of face and limb activations in human ventral temporal cortex , 2010, NeuroImage.

[65]  Krista A. Ehinger,et al.  SUN database: Large-scale scene recognition from abbey to zoo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[66]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[67]  Pascal Vincent,et al.  Visualizing Higher-Layer Features of a Deep Network , 2009 .

[68]  Y. Sugita Face perception in monkeys reared with no exposure to faces , 2008, Proceedings of the National Academy of Sciences.

[69]  J. Gallant,et al.  Complete functional characterization of sensory neurons by system identification. , 2006, Annual review of neuroscience.

[70]  Doris Y. Tsao,et al.  A Cortical Region Consisting Entirely of Face-Selective Cells , 2006, Science.

[71]  C. Gross,et al.  Representations of faces and body parts in macaque temporal cortex: a functional MRI study. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[72]  J. Gallant,et al.  Predicting neuronal responses during natural vision , 2005, Network.

[73]  P. Downing,et al.  Selectivity for the human body in the fusiform gyrus. , 2005, Journal of neurophysiology.

[74]  N. Kanwisher,et al.  The fusiform face area subserves face perception, not generic within-category identification , 2004, Nature Neuroscience.

[75]  Bruce D. McCandliss,et al.  The visual word form area: expertise for reading in the fusiform gyrus , 2003, Trends in Cognitive Sciences.

[76]  N. Kanwisher,et al.  The Human Body , 2001 .

[77]  Ravi S. Menon,et al.  An fMRI study of the selective activation of human extrastriate form vision areas by radial and concentric gratings , 2000, Current Biology.

[78]  S Lehéricy,et al.  The visual word form area: spatial and temporal characterization of an initial stage of reading in normal subjects and posterior split-brain patients. , 2000, Brain : a journal of neurology.

[79]  J. Haxby,et al.  Attribute-based neural substrates in temporal cortex for perceiving and knowing about objects , 1999, Nature Neuroscience.

[80]  Nancy Kanwisher,et al.  A cortical representation of the local visual environment , 1998, Nature.

[81]  N. Kanwisher,et al.  The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for Face Perception , 1997, The Journal of Neuroscience.

[82]  D. V. van Essen,et al.  Selectivity for polar, hyperbolic, and Cartesian gratings in macaque visual cortex. , 1993, Science.

[83]  R. Tootell,et al.  Functional anatomy of the second visual area (V2) in the macaque , 1989, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[84]  K. Brodmann Vergleichende Lokalisationslehre der Großhirnrinde : in ihren Prinzipien dargestellt auf Grund des Zellenbaues , 1985 .

[85]  D. Hubel,et al.  Anatomical demonstration of orientation columns in macaque monkey , 1978, The Journal of comparative neurology.

[86]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.