More than meets the eye: Self-supervised depth reconstruction from brain activity

In the past few years, significant advancements were made in reconstruction of observed natural images from fMRI brain recordings using deep-learning tools. Here, for the first time, we show that dense 3D depth maps of observed 2D natural images can also be recovered directly from fMRI brain recordings. We use an off-the-shelf method to estimate the unknown depth maps of natural images. This is applied to both: (i) the small number of images presented to subjects in an fMRI scanner (images for which we have fMRI recordings – referred to as “paired” data), and (ii) a very large number of natural images with no fMRI recordings (“unpaired data”). The estimated depth maps are then used as an auxiliary reconstruction criterion to train for depth reconstruction directly from fMRI. We propose two main approaches: Depth-only recovery and joint image-depth RGBD recovery. Because the number of available “paired” training data (images with fMRI) is small, we enrich the training data via self-supervised cycle-consistent training on many “unpaired” data (natural images & depth maps without fMRI). This is achieved using our newly defined and trained Depth-based Perceptual Similarity metric as a reconstruction criterion. We show that predicting the depth map directly from fMRI outperforms its indirect sequential recovery from the reconstructed images. We further show that activations from early cortical visual areas dominate our depth reconstruction results, and propose means to characterize fMRI voxels by their degree of depth-information tuning. This work adds an important layer of decoded information, extending the current envelope of visual brain decoding capabilities.

[1]  Yizhen Zhang,et al.  Variational autoencoder: An unsupervised model for encoding and decoding fMRI activity in visual cortex , 2019, NeuroImage.

[2]  J. Gallant,et al.  Reconstructing Visual Experiences from Brain Activity Evoked by Natural Movies , 2011, Current Biology.

[3]  G. Rees,et al.  Neuroimaging: Decoding mental states from brain activity in humans , 2006, Nature Reviews Neuroscience.

[4]  Marcel A. J. van Gerven,et al.  Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream , 2014, The Journal of Neuroscience.

[5]  J. Gallant,et al.  Identifying natural images from human brain activity , 2008, Nature.

[6]  Tomoyasu Horikawa,et al.  Generic decoding of seen and imagined objects using hierarchical visual features , 2015, Nature Communications.

[7]  Li Tong,et al.  BigGAN-based Bayesian Reconstruction of Natural Images from Human Brain Activity , 2020, Neuroscience.

[8]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[9]  Yunfeng Lin,et al.  DCNN-GAN: Reconstructing Realistic Image from fMRI , 2019, 2019 16th International Conference on Machine Vision Applications (MVA).

[10]  Ryan J. Prenger,et al.  Bayesian Reconstruction of Natural Images from Human Brain Activity , 2009, Neuron.

[11]  Kendrick N Kay,et al.  I can see what you see , 2009, Nature Neuroscience.

[12]  Guohua Shen,et al.  Deep image reconstruction from human brain activity , 2017, bioRxiv.

[13]  W. Vanduffel,et al.  Areal differences in depth cue integration between monkey and human , 2019, PLoS biology.

[14]  Michal Irani,et al.  From voxels to pixels and back: Self-supervision in natural-image reconstruction from fMRI , 2019, NeurIPS.

[15]  Ghislain St-Yves,et al.  Generative Adversarial Networks Conditioned on Brain Activity Reconstruct Seen Images , 2018, bioRxiv.

[16]  F. Tong,et al.  Decoding the visual and subjective contents of the human brain , 2005, Nature Neuroscience.

[17]  Michal Irani,et al.  Self-supervised Natural Image Reconstruction and Large-scale Semantic Classification from Brain Activity , 2020, NeuroImage.

[18]  Johannes Burge,et al.  Depth variation and stereo processing tasks in natural scenes , 2018, Journal of vision.

[19]  Chi Zhang,et al.  Constraint-Free Natural Image Reconstruction From fMRI Signals Based on Convolutional Neural Network , 2018, Front. Hum. Neurosci..

[20]  Yizhen Zhang,et al.  Neural Encoding and Decoding with Deep Learning for Dynamic Natural Vision , 2016, Cerebral cortex.

[21]  Konrad Schindler,et al.  Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-Shot Cross-Dataset Transfer , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Alexei A. Efros,et al.  The Unreasonable Effectiveness of Deep Features as a Perceptual Metric , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[23]  Andrew E Welchman,et al.  The Human Brain in Depth: How We See in 3D. , 2016, Annual review of vision science.

[24]  Luca Ambrogioni,et al.  Generative adversarial networks for reconstructing natural images from brain activity , 2017, NeuroImage.

[25]  L. Reddy,et al.  Reconstructing Natural Scenes from fMRI Patterns using BigBiGAN , 2020, 2020 International Joint Conference on Neural Networks (IJCNN).

[26]  Xinbo Gao,et al.  Reconstructing seen image from brain activity by visually-guided cognitive representation and adversarial learning , 2021, NeuroImage.