Convergent evolution of face spaces across human face-selective neuronal groups and deep convolutional networks

The discovery that deep convolutional neural networks (DCNNs) achieve human performance in realistic tasks offers fresh opportunities for linking neuronal tuning properties to such tasks. Here we show that the face-space geometry, revealed through pair-wise activation similarities of face-selective neuronal groups recorded intracranially in 33 patients, significantly matches that of a DCNN having human-level face recognition capabilities. This convergent evolution of pattern similarities across biological and artificial networks highlights the significance of face-space geometry in face perception. Furthermore, the nature of the neuronal to DCNN match suggests a role of human face areas in pictorial aspects of face perception. First, the match was confined to intermediate DCNN layers. Second, presenting identity-preserving image manipulations to the DCNN abolished its correlation to neuronal responses. Finally, DCNN units matching human neuronal group tuning displayed view-point selective receptive fields. Our results demonstrate the importance of face-space geometry in the pictorial aspects of human face perception. Deep convolutional neural networks (DCNNs) are able to identify faces on par with humans. Here, the authors record neuronal activity from higher visual areas in humans and show that face-selective responses in the brain show similarity to those in the intermediate layers of the DCNN.

[1]  I. Fried,et al.  Neural “Ignition”: Enhanced Activation Linked to Perceptual Awareness in Human Ventral Stream Visual Cortex , 2009, Neuron.

[2]  K. Grill-Spector,et al.  The functional architecture of the ventral temporal cortex and its role in categorization , 2014, Nature Reviews Neuroscience.

[3]  J. DiCarlo,et al.  Using goal-driven deep learning models to understand sensory cortex , 2016, Nature Neuroscience.

[4]  Stephen M. Smith,et al.  A global optimisation method for robust affine registration of brain images , 2001, Medical Image Anal..

[5]  J. Haxby,et al.  The distributed human neural system for face perception , 2000, Trends in Cognitive Sciences.

[6]  Luc Van Gool,et al.  European conference on computer vision (ECCV) , 2006, eccv 2006.

[7]  Pascal Vincent,et al.  Visualizing Higher-Layer Features of a Deep Network , 2009 .

[8]  Anders M. Dale,et al.  Automatic parcellation of human cortical gyri and sulci using standard anatomical nomenclature , 2010, NeuroImage.

[9]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[10]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[11]  Yehezkel Yeshurun,et al.  Enhanced Category Tuning Revealed by Intracranial Electroencephalograms in High-Order Human Visual Areas , 2007, The Journal of Neuroscience.

[12]  B. Argall,et al.  Simplified intersubject averaging on the cortical surface using SUMA , 2006, Human brain mapping.

[13]  Talma Hendler,et al.  Vase or face? A neural correlate of shape-selective grouping processes in the human brain , 2001, NeuroImage.

[14]  Lionel Naccache,et al.  Face-selective neurons in the vicinity of the human fusiform face area , 2019, Neurology.

[15]  Keiji Tanaka,et al.  Coding visual images of objects in the inferotemporal cortex of the macaque monkey. , 1991, Journal of neurophysiology.

[16]  Doris Y. Tsao,et al.  A Cortical Region Consisting Entirely of Face-Selective Cells , 2006, Science.

[17]  Philippe Kahane,et al.  Activations of deep convolutional neural networks are aligned with gamma band activity of human visual cortex , 2017, Communications Biology.

[18]  M. Giese,et al.  Norm-based face encoding by single neurons in the monkey inferotemporal cortex , 2006, Nature.

[19]  David Pitcher,et al.  Facial Expression Recognition Takes Longer in the Posterior Superior Temporal Sulcus than in the Occipital Face Area , 2014, The Journal of Neuroscience.

[20]  C. Koch,et al.  Invariant visual representation by single neurons in the human brain , 2005, Nature.

[21]  Liang Wang,et al.  Probabilistic Maps of Visual Topography in Human Cortex. , 2015, Cerebral cortex.

[22]  Josef Parvizi,et al.  Corresponding ECoG and fMRI category-selective signals in human ventral temporal cortex , 2016, Neuropsychologia.

[23]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[24]  Bruno Rossion,et al.  A face identity hallucination (palinopsia) generated by intracerebral stimulation of the face-selective right lateral fusiform cortex , 2018, Cortex.

[25]  Mark J. Hilsenroth,et al.  A primer on meta-analysis of correlation coefficients: The relationship between patient-reported therapeutic alliance and adult attachment style as an illustration , 2009, Psychotherapy research : journal of the Society for Psychotherapy Research.

[26]  N. Kanwisher,et al.  How face perception unfolds over time , 2018, Nature Communications.

[27]  Denise C. Park,et al.  A lifespan database of adult facial stimuli , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[28]  K. Grill-Spector,et al.  Electrical Stimulation of Human Fusiform Face-Selective Regions Distorts Face Perception , 2012, The Journal of Neuroscience.

[29]  J. Wagemans,et al.  Is neuroimaging measuring information in the brain? , 2016, Psychonomic Bulletin & Review.

[30]  Lucia Melloni,et al.  Human intracranial recordings link suppressed transients rather than 'filling-in' to perceptual continuity across blinks , 2016, eLife.

[31]  Jeremy R. Manning,et al.  Broadband Shifts in Local Field Potential Power Spectra Are Correlated with Single-Neuron Spiking in Humans , 2009, The Journal of Neuroscience.

[32]  S. Edelman,et al.  Toward direct visualization of the internal shape representation space by fMRI , 1998, Psychobiology.

[33]  K. Nakayama,et al.  Binocular Rivalry and Visual Awareness in Human Extrastriate Cortex , 1998, Neuron.

[34]  Shimon Ullman,et al.  Class Information Predicts Activation by Object Fragments in Human Object Areas , 2008, Journal of Cognitive Neuroscience.

[35]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[36]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[37]  Christopher J. Honey,et al.  iELVis: An open source MATLAB toolbox for localizing and visualizing human intracranial electrode data , 2016, Journal of Neuroscience Methods.

[38]  Ashesh D. Mehta,et al.  Tuning face perception with electrical stimulation of the fusiform gyrus , 2017, Human brain mapping.

[39]  Anders M. Dale,et al.  Cortical Surface-Based Analysis I. Segmentation and Surface Reconstruction , 1999, NeuroImage.

[40]  T. Allison,et al.  Face recognition in human extrastriate cortex. , 1994, Journal of neurophysiology.

[41]  Nikolaus Kriegeskorte,et al.  Deep Supervised, but Not Unsupervised, Models May Explain IT Cortical Representation , 2014, PLoS Comput. Biol..

[42]  R. VanRullen Perception Science in the Age of Deep Neural Networks , 2017, Front. Psychol..

[43]  Bruno Rossion,et al.  Understanding face perception by means of prosopagnosia and neuroimaging. , 2014, Frontiers in bioscience.

[44]  Gilles Pourtois,et al.  View-independent coding of face identity in frontal and temporal cortices is modulated by familiarity: an event-related fMRI study , 2005, NeuroImage.

[45]  Uri Hasson,et al.  Altered topology of neural circuits in congenital prosopagnosia , 2017, bioRxiv.

[46]  I. Fried,et al.  Coupling between Neuronal Firing Rate, Gamma LFP, and BOLD fMRI Is Related to Interneuronal Correlations , 2007, Current Biology.

[47]  Frank S. Werblin,et al.  Mechanisms and circuitry underlying directional selectivity in the retina , 2002, Nature.

[48]  G. Yovel,et al.  Hierarchical Processing of Face Viewpoint in Human Visual Cortex , 2012, The Journal of Neuroscience.

[49]  Swami Sankaranarayanan,et al.  Face recognition accuracy of forensic examiners, superrecognizers, and face recognition algorithms , 2018, Proceedings of the National Academy of Sciences.

[50]  David M. Groppe,et al.  Exemplar selectivity reflects perceptual similarities in the human fusiform cortex. , 2014, Cerebral cortex.

[51]  Frank Tong,et al.  Prevalence of Selectivity for Mirror-Symmetric Views of Faces in the Ventral and Dorsal Visual Pathways , 2012, The Journal of Neuroscience.

[52]  Timothy J. Andrews,et al.  Differential sensitivity for viewpoint between familiar and unfamiliar faces in human visual cortex , 2008, NeuroImage.

[53]  Nancy Kanwisher,et al.  Facephenes and rainbows: Causal evidence for functional and anatomical specificity of face and color processing in the human brain , 2017, Proceedings of the National Academy of Sciences.

[54]  S. Edelman,et al.  Differential Processing of Objects under Various Viewing Conditions in the Human Lateral Occipital Complex , 1999, Neuron.

[55]  Galit Yovel,et al.  A Revised Neural Framework for Face Processing. , 2015, Annual review of vision science.

[56]  James J DiCarlo,et al.  Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks , 2018, The Journal of Neuroscience.

[57]  Antonio Torralba,et al.  Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence , 2016, Scientific Reports.

[58]  Ronen Basri,et al.  Perceptual Dominance in Brief Presentations of Mixed Images: Human Perception vs. Deep Neural Networks , 2018, Front. Comput. Neurosci..

[59]  James J. DiCarlo,et al.  Evidence that recurrent circuits are critical to the ventral stream’s execution of core object recognition behavior , 2018, Nature Neuroscience.

[60]  Keiji Tanaka,et al.  Matching Categorical Object Representations in Inferior Temporal Cortex of Man and Monkey , 2008, Neuron.

[61]  Arnaud Delorme,et al.  EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis , 2004, Journal of Neuroscience Methods.

[62]  Jonas Kubilius,et al.  Evidence that recurrent circuits are critical to the ventral stream’s execution of core object recognition behavior , 2019, Nature Neuroscience.

[63]  N. Kanwisher,et al.  The fusiform face area subserves face perception, not generic within-category identification , 2004, Nature Neuroscience.

[64]  Jia Deng,et al.  A large-scale hierarchical image database , 2009, CVPR 2009.

[65]  T. Allison,et al.  Electrophysiological studies of human face perception. I: Potentials generated in occipitotemporal cortex by face and non-face stimuli. , 1999, Cerebral cortex.

[66]  K. Grill-Spector,et al.  Electrical Stimulation of the Left and Right Human Fusiform Gyrus Causes Different Effects in Conscious Face Perception , 2014, The Journal of Neuroscience.

[67]  Marcel A. J. van Gerven,et al.  Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream , 2014, The Journal of Neuroscience.

[68]  Alexander Borst,et al.  How does Nature Program Neuron Types? , 2008, Front. Neurosci..

[69]  Ha Hong,et al.  Performance-optimized hierarchical models predict neural responses in higher visual cortex , 2014, Proceedings of the National Academy of Sciences.

[70]  Doris Y. Tsao,et al.  The Code for Facial Identity in the Primate Brain , 2017, Cell.

[71]  Xenophon Papademetris,et al.  BioImage Suite: An integrated medical image analysis suite: An update. , 2006, The insight journal.

[72]  Ming Yang,et al.  DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[73]  Nikolaus Kriegeskorte,et al.  Representational Similarity Analysis – Connecting the Branches of Systems Neuroscience , 2008, Frontiers in systems neuroscience.