论文信息 - Deep image reconstruction from human brain activity

Deep image reconstruction from human brain activity

Machine learning-based analysis of human functional magnetic resonance imaging (fMRI) patterns has enabled the visualization of perceptual content. However, it has been limited to the reconstruction with low-level image bases (Miyawaki et al., 2008; Wen et al., 2016) or to the matching to exemplars (Naselaris et al., 2009; Nishimoto et al., 2011). Recent work showed that visual cortical activity can be decoded (translated) into hierarchical features of a deep neural network (DNN) for the same input image, providing a way to make use of the information from hierarchical visual features (Horikawa & Kamitani, 2017). Here, we present a novel image reconstruction method, in which the pixel values of an image are optimized to make its DNN features similar to those decoded from human brain activity at multiple layers. We found that the generated images resembled the stimulus images (both natural images and artificial shapes) and the subjective visual content during imagery. While our model was solely trained with natural images, our method successfully generalized the reconstruction to artificial shapes, indicating that our model indeed ‘reconstructs’ or ‘generates’ images from brain activity, not simply matches to exemplars. A natural image prior introduced by another deep neural network effectively rendered semantically meaningful details to reconstructions by constraining reconstructed images to be similar to natural images. Furthermore, human judgment of reconstructions suggests the effectiveness of combining multiple DNN layers to enhance visual quality of generated images. The results suggest that hierarchical visual information in the brain can be effectively combined to reconstruct perceptual and subjective images.

Guohua Shen | Tomoyasu Horikawa | Kei Majima | Yukiyasu Kamitani

[1] Andrea Vedaldi,et al. Understanding deep image representations by inverting them , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Ryan J. Prenger,et al. Bayesian Reconstruction of Natural Images from Human Brain Activity , 2009, Neuron.

[3] Tom Heskes,et al. Linear reconstruction of perceived images from human brain activity , 2013, NeuroImage.

[4] Nancy Kanwisher,et al. A cortical representation of the local visual environment , 1998, Nature.

[5] Leon A. Gatys,et al. Image Style Transfer Using Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Luca Ambrogioni,et al. Generative adversarial networks for reconstructing natural images from brain activity , 2017, NeuroImage.

[7] J. DiCarlo,et al. Using goal-driven deep learning models to understand sensory cortex , 2016, Nature Neuroscience.

[8] Yizhen Zhang,et al. Neural Encoding and Decoding with Deep Learning for Dynamic Natural Vision , 2016, Cerebral cortex.

[9] Tomoyasu Horikawa,et al. Generic decoding of seen and imagined objects using hierarchical visual features , 2015, Nature Communications.

[10] Quoc V. Le,et al. On optimization methods for deep learning , 2011, ICML.

[11] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[12] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[13] N. Kanwisher,et al. Cortical Regions Involved in Perceiving Object Shape , 2000, The Journal of Neuroscience.

[14] Ning Qian,et al. On the momentum term in gradient descent learning algorithms , 1999, Neural Networks.

[15] Yizhen Zhang,et al. Variational Autoencoder: An Unsupervised Model for Modeling and Decoding fMRI Activity in Visual Cortex , 2017, bioRxiv.

[16] R. Lotto,et al. Responses of human visual cortex to uniform surfaces , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[17] J W Belliveau,et al. Borders of multiple visual areas in humans revealed by functional magnetic resonance imaging. , 1995, Science.

[18] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[19] J. Gallant,et al. Reconstructing Visual Experiences from Brain Activity Evoked by Natural Movies , 2011, Current Biology.

[20] Adrian T. Lee,et al. fMRI of human visual cortex , 1994, Nature.

[21] Thomas Brox,et al. Synthesizing the preferred inputs for neurons in neural networks via deep generator networks , 2016, NIPS.

[22] Thomas Brox,et al. Generating Images with Perceptual Similarity Metrics based on Deep Networks , 2016, NIPS.

[23] Jorge Nocedal,et al. On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[24] Masa-aki Sato,et al. Visual Image Reconstruction from Human Brain Activity using a Combination of Multiscale Local Image Decoders , 2008, Neuron.

[25] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[26] N. Kanwisher,et al. The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for Face Perception , 1997, The Journal of Neuroscience.