论文信息 - Generalization in data-driven models of primary visual cortex - 字舞流文

Generalization in data-driven models of primary visual cortex

Deep neural networks (DNN) have set new standards at predicting responses of neural populations to visual input. Most such DNNs consist of a convolutional network (core) shared across all neurons which learns a representation of neural computation in visual cortex and a neuron-specific readout that linearly combines the relevant features in this representation. The goal of this paper is to test whether such a representation is indeed generally characteristic for visual cortex, i.e. gener-alizes between animals of a species, and what factors contribute to obtaining such a generalizing core. To push all non-linear computations into the core where the generalizing cortical features should be learned, we devise a novel readout that reduces the number of parameters per neuron in the readout by up to two orders of magnitude compared to the previous state-of-the-art. It does so by taking advantage of retinotopy and learns a Gaussian distribution over the neuron’s receptive field po-sition. With this new readout we train our network on neural responses from mouse primary visual cortex (V1) and obtain a gain in performance of 7% compared to the previous state-of-the-art network. We then investigate whether the convolutional core indeed captures general cortical features by using the core in transfer learning to a different animal. When transferring a core trained on thousands of neurons from various animals and scans we exceed the performance of training directly on that animal by 12%, and outperform a commonly used VGG16 core pre-trained on imagenet by 33%. In addition, transfer learning with our data-driven core is more data-efficient than direct training, achieving the same performance with only 40% of the data. Our model with its novel readout thus sets a new state-of-the-art for neural response prediction in mouse visual cortex from natural images, generalizes between animals, and captures better characteristic cortical features than current task-driven pre-training approaches such as VGG16.

Alexander S. Ecker | Andreas S. Tolias | Fabian H. Sinz | Konstantin-Klemens Lurz | Edgar Y. Walker | Erick Cobos | Taliah Muhammad | Santiago A. Cadena | Konstantin Willeke | Akshay K. Jagadish | Mohammad Bashiri | Eric Wang | Eric Wang | A. Tolias | Fabian H Sinz | Erick Cobos | Taliah Muhammad | K. Willeke | Konstantin-Klemens Lurz | Mohammad Bashiri | A. Jagadish

[1] Wei Chen,et al. Transferring and generalizing deep-learning-based neural encoding models across subjects , 2017, NeuroImage.

[2] Nikolaus Kriegeskorte,et al. Deep neural networks: a new framework for modelling biological vision and brain information processing , 2015, bioRxiv.

[3] Ghislain St-Yves,et al. The feature-weighted receptive field: an interpretable encoding model for complex feature spaces , 2017 .

[4] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[5] Elijah D. Christensen,et al. Using deep learning to probe the neural code for images in primary visual cortex , 2019, Journal of vision.

[6] Surya Ganguli,et al. Task-Driven Convolutional Recurrent Models of the Visual System , 2018, NeurIPS.

[7] Jasper Snoek,et al. Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[8] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Tai Sing Lee,et al. Convolutional neural network models of V1 responses to complex patterns , 2018 .

[10] Nikolaus Kriegeskorte,et al. Deep Supervised, but Not Unsupervised, Models May Explain IT Cortical Representation , 2014, PLoS Comput. Biol..

[11] David J. Field,et al. Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[12] Jumpei Ukita,et al. Characterisation of nonlinear receptive fields of visual neurons by convolutional neural network , 2019, Scientific Reports.

[13] François Chollet,et al. Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14] Alexander S. Ecker,et al. Inception loops discover what excites neurons most using deep predictive models , 2019, Nature Neuroscience.

[15] Nicole C. Rust,et al. Do We Know What the Early Visual System Does? , 2005, The Journal of Neuroscience.

[16] Carlos R. Ponce,et al. Evolving Images for Visual Neurons Using a Deep Generative Network Reveals Coding Principles and Neuronal Preferences , 2019, Cell.

[17] K. Svoboda,et al. A large field of view two-photon mesoscope with subcellular resolution for in vivo imaging , 2016, bioRxiv.

[18] Alexander S. Ecker,et al. A rotation-equivariant convolutional neural network model of primary visual cortex , 2018, ICLR.

[19] Eero P. Simoncelli,et al. A Convolutional Subunit Model for Neuronal Responses in Macaque V1 , 2015, The Journal of Neuroscience.

[20] Aran Nayebi,et al. CORnet: Modeling the Neural Mechanisms of Core Object Recognition , 2018, bioRxiv.

[21] Daniel L. K. Yamins,et al. A Task-Optimized Neural Network Replicates Human Auditory Behavior, Predicts Brain Responses, and Reveals a Cortical Processing Hierarchy , 2018, Neuron.

[22] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[23] Leon A. Gatys,et al. Deep convolutional models improve predictions of macaque V1 responses to natural images , 2019, PLoS Comput. Biol..

[24] Junxing Shi,et al. Transferring and Generalizing Deep-Learning-based Neural Encoding Models across Subjects , 2017 .

[25] Alexander S. Ecker,et al. How well do deep neural networks trained on object recognition characterize the mouse visual system , 2019 .

[26] Sepp Hochreiter,et al. Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) , 2015, ICLR.

[27] Nikolaus Kriegeskorte. Deep neural networks: a new framework for modelling biological vision and brain information processing , 2015 .

[28] Alexander S. Ecker,et al. Neural system identification for large populations separating "what" and "where" , 2017, NIPS.

[29] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[30] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[31] Nikolaus Kriegeskorte,et al. Recurrence is required to capture the representational dynamics of the human visual system , 2019, Proceedings of the National Academy of Sciences.

[32] Steffen Katzner,et al. Learning Enhances Sensory Processing in Mouse V1 before Improving Behavior , 2017, The Journal of Neuroscience.

[33] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[34] William F. Kindel,et al. Using deep learning to reveal the neural code for images in primary visual cortex , 2017, ArXiv.

[35] G. Palm,et al. Density of neurons and synapses in the cerebral cortex of the mouse , 1989, The Journal of comparative neurology.

[36] HighWire Press. The journal of neuroscience : the official journal of the Society for Neuroscience. , 1981 .

[37] Pouya Bashivan,et al. Neural Population Control via Deep ANN Image Synthesis , 2018 .

[38] Marcel A. J. van Gerven,et al. Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream , 2014, The Journal of Neuroscience.

[39] J. DiCarlo,et al. Using goal-driven deep learning models to understand sensory cortex , 2016, Nature Neuroscience.

[40] Jonas Kubilius,et al. Brain-Score: Which Artificial Neural Network for Object Recognition is most Brain-Like? , 2018, bioRxiv.

[41] Ha Hong,et al. Performance-optimized hierarchical models predict neural responses in higher visual cortex , 2014, Proceedings of the National Academy of Sciences.

[42] Matthias Bethge,et al. Engineering a Less Artificial Intelligence , 2019, Neuron.

[43] David Pfau,et al. Simultaneous Denoising, Deconvolution, and Demixing of Calcium Imaging Data , 2016, Neuron.

[44] Alexander S. Ecker,et al. Stimulus domain transfer in recurrent models for large scale cortical population prediction on video , 2018, NeurIPS.

[45] Matthias Bethge,et al. Learning From Brains How to Regularize Machines , 2019, NeurIPS.

[46] W. Bair,et al. The unbiased estimation of the fraction of variance explained by a model , 2020, bioRxiv.

[47] Ian Nauhaus,et al. Topography and Areal Organization of Mouse Visual Cortex , 2014, The Journal of Neuroscience.

[48] J. P. Jones,et al. The two-dimensional spatial structure of simple receptive fields in cat striate cortex. , 1987, Journal of neurophysiology.

[49] Lutz Prechelt,et al. Automatic early stopping using cross validation: quantifying the criteria , 1998, Neural Networks.

[50] James A. Bednar,et al. Model Constrained by Visual Hierarchy Improves Prediction of Neural Responses to Natural Scenes , 2016, PLoS Comput. Biol..