Reconstructing faces from fMRI patterns using deep generative neural networks

Although distinct categories are reliably decoded from fMRI brain responses, it has proved more difficult to distinguish visually similar inputs, such as different faces. Here, we apply a recently developed deep learning system to reconstruct face images from human fMRI. We trained a variational auto-encoder (VAE) neural network using a GAN (Generative Adversarial Network) unsupervised procedure over a large data set of celebrity faces. The auto-encoder latent space provides a meaningful, topologically organized 1024-dimensional description of each image. We then presented several thousand faces to human subjects, and learned a simple linear mapping between the multi-voxel fMRI activation patterns and the 1024 latent dimensions. Finally, we applied this mapping to novel test images, translating fMRI patterns into VAE latent codes, and codes into face reconstructions. The system not only performed robust pairwise decoding (>95% correct), but also accurate gender classification, and even decoded which face was imagined, rather than seen.VanRullen and Reddy apply a state-of-the-art AI technique to brain decoding. After learning to translate multi-voxel fMRI activity patterns into the activation space of a deep generative neural network, each particular face viewed, or even imagined, by a human subject in the scanner can be visualized with unprecedented accuracy.

[1]  Marcel van Gerven,et al.  Reconstructing perceived faces from brain activations with deep adversarial neural decoding , 2017, NIPS.

[2]  Jack L. Gallant,et al.  A voxel-wise encoding model for early visual areas decodes mental images of remembered scenes , 2015, NeuroImage.

[3]  Luca Ambrogioni,et al.  Generative adversarial networks for reconstructing natural images from brain activity , 2017, NeuroImage.

[4]  J. DiCarlo,et al.  Using goal-driven deep learning models to understand sensory cortex , 2016, Nature Neuroscience.

[5]  M. Seghier,et al.  A network of occipito-temporal face-sensitive areas besides the right middle fusiform gyrus is necessary for normal face processing. , 2003, Brain : a journal of neurology.

[6]  J. Gallant,et al.  Identifying natural images from human brain activity , 2008, Nature.

[7]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[8]  R. VanRullen Perception Science in the Age of Deep Neural Networks , 2017, Front. Psychol..

[9]  Thomas Serre,et al.  Reading the mind's eye: Decoding category information during mental imagery , 2010, NeuroImage.

[10]  G. Yovel,et al.  Successful Decoding of Famous Faces in the Fusiform Face Area , 2015, PloS one.

[11]  S. Kosslyn,et al.  Neural foundations of imagery , 2001, Nature Reviews Neuroscience.

[12]  Yoshua Bengio,et al.  Generative Adversarial Networks , 2014, ArXiv.

[13]  J. Duncan,et al.  Top-Down Activation of Shape-Specific Population Codes in Visual Cortex during Mental Imagery , 2009, The Journal of Neuroscience.

[14]  Marcia K. Johnson,et al.  Decoding individual natural scene representations during perception and imagery , 2010 .

[15]  Brice A. Kuhl,et al.  Reconstructing Perceived and Retrieved Faces from Activity Patterns in Lateral Parietal Cortex , 2016, The Journal of Neuroscience.

[16]  Guohua Shen,et al.  Deep image reconstruction from human brain activity , 2017, bioRxiv.

[17]  Philippe G Schyns,et al.  Decoding face categories in diagnostic subregions of primary visual cortex , 2013, The European journal of neuroscience.

[18]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[19]  N. Kanwisher,et al.  Mental Imagery of Faces and Places Activates Corresponding Stimulus-Specific Brain Regions , 2000, Journal of Cognitive Neuroscience.

[20]  Jaakko Lehtinen,et al.  Progressive Growing of GANs for Improved Quality, Stability, and Variation , 2017, ICLR.

[21]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[22]  A. Ishai,et al.  The Gender of Face Stimuli is Represented in Multiple Regions in the Human Brain , 2011, Front. Hum. Neurosci..

[23]  Keiji Tanaka,et al.  Matching Categorical Object Representations in Inferior Temporal Cortex of Man and Monkey , 2008, Neuron.

[24]  David D. Cox,et al.  Opinion TRENDS in Cognitive Sciences Vol.11 No.8 Untangling invariant object recognition , 2022 .

[25]  F. Tong,et al.  Decoding the visual and subjective contents of the human brain , 2005, Nature Neuroscience.

[26]  Doris Y. Tsao,et al.  The Code for Facial Identity in the Primate Brain , 2017, Cell.

[27]  Yizhen Zhang,et al.  Variational Autoencoder: An Unsupervised Model for Modeling and Decoding fMRI Activity in Visual Cortex , 2017, bioRxiv.

[28]  Lars Muckli,et al.  Decoding Sound and Imagery Content in Early Visual Cortex , 2014, Current Biology.

[29]  T. Carlson,et al.  Patterns of Activity in the Categorical Representations of Objects , 2003, Journal of Cognitive Neuroscience.

[30]  W. K. Simmons,et al.  Circular analysis in systems neuroscience: the dangers of double dipping , 2009, Nature Neuroscience.

[31]  Ole Winther,et al.  Autoencoding beyond pixels using a learned similarity metric , 2015, ICML.

[32]  Y Kamitani,et al.  Neural Decoding of Visual Imagery During Sleep , 2013, Science.

[33]  Kshitij Dwivedi,et al.  End-to-End Deep Image Reconstruction From Human Brain Activity , 2018, bioRxiv.

[34]  Brice A. Kuhl,et al.  Neural portraits of perception: Reconstructing face images from evoked brain activity , 2014, NeuroImage.

[35]  R. Goebel,et al.  Individual faces elicit distinct response patterns in human anterior temporal cortex , 2007, Proceedings of the National Academy of Sciences.

[36]  Prafulla Dhariwal,et al.  Glow: Generative Flow with Invertible 1x1 Convolutions , 2018, NeurIPS.

[37]  Marcia K. Johnson,et al.  Decoding individual natural scene representations during perception and imagery , 2010, Front. Hum. Neurosci..

[38]  Juan Manuel Contreras,et al.  Multivoxel Patterns in Fusiform Face Area Differentiate Faces by Sex and Race , 2013, PloS one.

[39]  Jesper Andersson,et al.  A multi-modal parcellation of human cerebral cortex , 2016, Nature.

[40]  N. Kanwisher,et al.  The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for Face Perception , 1997, The Journal of Neuroscience.

[41]  Ghislain St-Yves,et al.  Generative Adversarial Networks Conditioned on Brain Activity Reconstruct Seen Images , 2018, bioRxiv.

[42]  A. Ishai,et al.  Distributed and Overlapping Representations of Faces and Objects in Ventral Temporal Cortex , 2001, Science.

[43]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[44]  Xiaogang Wang,et al.  Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).