The face module emerges from domain-general visual experience: a deprivation study on deep convolutional neural network

Can faces be accurately recognized with zero experience on faces? The answer to this question is critical because it examines the role of experiences in the formation of domain-specific modules in the brain. However, thorough investigation with human and non-human animals on this issue cannot easily dissociate the effect of the visual experience from that of genetic inheritance, i.e., the hardwired domain-specificity. The present study addressed this problem by building a model of selective deprivation of the experience on faces with a representative deep convolutional neural network (DCNN), AlexNet. We trained a new AlexNet with the same image dataset, except that all images containing faces of human and nonhuman primates were removed. We found that the experience-deprived AlexNet (d-AlexNet) did not show significant deficits in face categorization and discrimination, and face-selective modules also automatically emerged. However, the deprivation made the d-AlexNet to process faces in a more parts-based fashion, similar to the way of processing objects. In addition, the face representation of the face-selective module in the d-AlexNet was more distributed and the empirical receptive field was larger, resulting in less degree of selectivity of the module. In sum, our study provides undisputable evidence on the role of nature versus nurture in developing the domain-specific modules that domain-specificity may evolve from non-specific stimuli and processes without genetic predisposition, which is further fine-tuned by domain-specific experience.

[1]  Xu Wang,et al.  Quantifying interindividual variability and asymmetry of face-selective regions: A probabilistic functional atlas , 2015, NeuroImage.

[2]  Nancy Kanwisher,et al.  Heritability of the Specific Cognitive Ability of Face Perception , 2010, Current Biology.

[3]  Anitha Pasupathy,et al.  'Artiphysiology' reveals V4-like shape tuning in a deep network trained for image classification , 2018, eLife.

[4]  F. Quimby What's in a picture? , 1993, Laboratory animal science.

[5]  N. Kanwisher,et al.  The fusiform face area: a cortical region specialized for the perception of faces , 2006, Philosophical Transactions of the Royal Society B: Biological Sciences.

[6]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[7]  Ramakant Nevatia,et al.  Face recognition using deep multi-pose representations , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[8]  R. Yin Looking at Upside-down Faces , 1969 .

[9]  K. Nakayama,et al.  The effect of face inversion on the human fusiform face area , 1998, Cognition.

[10]  Lisa R. Betts,et al.  Decoding of Faces and Face Components in Face-Sensitive Human Visual Cortex , 2010, Front. Psychology.

[11]  Mark H. Johnson,et al.  CONSPEC and CONLERN: a two-process theory of infant face recognition. , 1991, Psychological review.

[12]  L. Spillmann Receptive Fields of Visual Neurons: The Early Years , 2014, Perception.

[13]  John B. Troy,et al.  Non-Centered Spike-Triggered Covariance Analysis Reveals Neurotrophin-3 as a Developmental Regulator of Receptive Field Properties of ON-OFF Retinal Ganglion Cells , 2010, PLoS Comput. Biol..

[14]  Alex Martin,et al.  Properties and mechanisms of perceptual priming , 1998, Current Opinion in Neurobiology.

[15]  R. Reid,et al.  Diverse receptive fields in the lateral geniculate nucleus during thalamocortical development , 2000, Nature Neuroscience.

[16]  K. Nakayama,et al.  Human face recognition ability is specific and highly heritable , 2010, Proceedings of the National Academy of Sciences.

[17]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[18]  K. Grill-Spector,et al.  Repetition and the brain: neural models of stimulus-specific effects , 2006, Trends in Cognitive Sciences.

[19]  R. C. Rentería,et al.  Receptive field center size decreases and firing properties mature in ON and OFF retinal ganglion cells after eye opening in the mouse. , 2011, Journal of neurophysiology.

[20]  I. W. R. Bushneil,et al.  Neonatal recognition of the mother's face , 1989 .

[21]  Sing-Hang Cheung,et al.  Processing of configural and componential information in face-selective cortical areas , 2014, Cognitive neuroscience.

[22]  Leila Reddy,et al.  Reconstructing faces from fMRI patterns using deep generative neural networks , 2018, Communications Biology.

[23]  P. Heggelund,et al.  Development of spatial receptive-field organization and orientation selectivity in kitten striate cortex. , 1985, Journal of neurophysiology.

[24]  Jeff Donahue,et al.  Large Scale GAN Training for High Fidelity Natural Image Synthesis , 2018, ICLR.

[25]  Kalanit Grill-Spector,et al.  Extensive childhood experience with Pokémon suggests eccentricity drives organization of visual cortex , 2019, Nature Human Behaviour.

[26]  Jaeson Jang,et al.  Spontaneous generation of face recognition in untrained deep neural networks , 2019, bioRxiv.

[27]  C. C. Goren,et al.  Visual following and pattern discrimination of face-like stimuli by newborn infants. , 1975, Pediatrics.

[28]  G. Griffin,et al.  Caltech-256 Object Category Dataset , 2007 .

[29]  F. Simion,et al.  The origins of face perception: specific versus non‐specific mechanisms , 2001 .

[30]  R. Desimone,et al.  Neural mechanisms for visual memory and their role in attention. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Marcel A. J. van Gerven,et al.  Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream , 2014, The Journal of Neuroscience.

[32]  Francesca Simion,et al.  Face perception and processing in early infancy: inborn predispositions and developmental changes , 2015, Front. Psychol..

[33]  Timo Aila,et al.  A Style-Based Generator Architecture for Generative Adversarial Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Nikolaus Kriegeskorte,et al.  Deep neural networks: a new framework for modelling biological vision and brain information processing , 2015, bioRxiv.

[35]  C. Umilta,et al.  Face preference at birth. , 1996, Journal of experimental psychology. Human perception and performance.

[36]  I. Gauthier,et al.  How does the brain process upright and inverted faces? , 2002, Behavioral and cognitive neuroscience reviews.

[37]  Y. Sugita Face perception in monkeys reared with no exposure to faces , 2008, Proceedings of the National Academy of Sciences.

[38]  Kalle Åström,et al.  Transferring and Compressing Convolutional Neural Networks for Face Representations , 2016, ICIAR.

[39]  Shan Xu,et al.  Implementation-Independent Representation for Deep Convolutional Neural Networks and Humans in Processing Faces , 2020, bioRxiv.

[40]  Drew H. Abney,et al.  Journal of Experimental Psychology : Human Perception and Performance Influence of Musical Groove on Postural Sway , 2015 .

[41]  Nikolaus Kriegeskorte,et al.  Deep Supervised, but Not Unsupervised, Models May Explain IT Cortical Representation , 2014, PLoS Comput. Biol..

[42]  F. Simion,et al.  Can a Nonspecific Bias Toward Top-Heavy Patterns Explain Newborns' Face Preference? , 2004, Psychological science.

[43]  R. O’Reilly,et al.  Modeling hippocampal and neocortical contributions to recognition memory: a complementary-learning-systems approach. , 2003, Psychological review.

[44]  Jia Liu,et al.  Perception of Face Parts and Face Configurations: An fMRI Study , 2010, Journal of Cognitive Neuroscience.

[45]  Daniel D. Dilks,et al.  A critical review of the development of face recognition: Experience is less important than previously believed , 2012, Cognitive neuropsychology.

[46]  Margaret S. Livingstone,et al.  Seeing faces is necessary for face-patch formation , 2017, Nature Neuroscience.

[47]  Yizhen Zhang,et al.  Neural Encoding and Decoding with Deep Learning for Dynamic Natural Vision , 2016, Cerebral cortex.

[48]  J. Movshon,et al.  Adaptation changes the direction tuning of macaque MT neurons , 2004, Nature Neuroscience.

[49]  W. Freiwald,et al.  Face Processing Systems: From Neurons to Real-World Social Perception. , 2016, Annual review of neuroscience.

[50]  Shengcai Liao,et al.  Learning Face Representation from Scratch , 2014, ArXiv.

[51]  Ha Hong,et al.  Performance-optimized hierarchical models predict neural responses in higher visual cortex , 2014, Proceedings of the National Academy of Sciences.

[52]  R. Stevenson A CRITICAL REVIEW OF THE DEVELOPMENT OF HPCE INSTRUMENTATION , 1994 .

[53]  Ming Zhou,et al.  DNNBrain: A Unifying Toolbox for Mapping Deep Neural Networks and Brains , 2020, bioRxiv.

[54]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.