Stimulus domain transfer in recurrent models for large scale cortical population prediction on video

To better understand the representations in visual cortex, we need to generate better predictions of neural activity in awake animals presented with their ecological input: natural video. Despite recent advances in models for static images, models for predicting responses to natural video are scarce and standard linear-nonlinear models perform poorly. We developed a new deep recurrent network architecture that predicts inferred spiking activity of thousands of mouse V1 neurons simultaneously recorded with two-photon microscopy, while accounting for confounding factors such as the animal’s gaze position and brain state changes related to running state and pupil dilation. Powerful system identification models provide an opportunity to gain insight into cortical functions through in silico experiments that can subsequently be tested in the brain. However, in many cases this approach requires that the model is able to generalize to stimulus statistics that it was not trained on, such as band-limited noise and other parameterized stimuli. We investigated these domain transfer properties in our model and find that our model trained on natural images is able to correctly predict the orientation tuning of neurons in responses to artificial noise stimuli. Finally, we show that we can fully generalize from movies to noise and maintain high predictive performance on both stimulus domains by fine-tuning only the final layer’s weights on a network otherwise trained on natural movies. The converse, however, is not true.

[1]  M. Stryker,et al.  Modulation of Visual Responses by Behavioral State in Mouse Visual Cortex , 2010, Neuron.

[2]  D. McCormick,et al.  Waking State: Rapid Variations Modulate Neural and Behavioral Responses , 2015, Neuron.

[3]  Yves Frégnac,et al.  Adaptation of the simple or complex nature of V1 receptive fields to visual statistics , 2011, Nature Neuroscience.

[4]  George H. Denfield,et al.  Pupil Fluctuations Track Fast Switching of Cortical States during Quiet Wakefulness , 2014, Neuron.

[5]  Leon A. Gatys,et al.  Deep convolutional models improve predictions of macaque V1 responses to natural images , 2017, bioRxiv.

[6]  Tai Sing Lee,et al.  Convolutional neural network models of V1 responses to complex patterns , 2018 .

[7]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Alexander S. Ecker,et al.  State Dependence of Noise Correlations in Macaque Primary Visual Cortex , 2014, Neuron.

[9]  Gaël Varoquaux,et al.  The NumPy Array: A Structure for Efficient Numerical Computation , 2011, Computing in Science & Engineering.

[10]  Alexander S. Ecker,et al.  Neural system identification for large populations separating "what" and "where" , 2017, NIPS.

[11]  J. Gallant,et al.  Predicting neuronal responses during natural vision , 2005, Network.

[12]  J. S Stahl,et al.  A comparison of video and magnetic search coil recordings of mouse eye movements , 2000, Journal of Neuroscience Methods.

[13]  James A. Bednar,et al.  Model Constrained by Visual Hierarchy Improves Prediction of Neural Responses to Natural Scenes , 2016, PLoS Comput. Biol..

[14]  Andrew Zisserman,et al.  Spatial Transformer Networks , 2015, NIPS.

[15]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[16]  J. Gallant,et al.  Natural Stimulus Statistics Alter the Receptive Field Structure of V1 Neurons , 2004, The Journal of Neuroscience.

[17]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[18]  et al.,et al.  Jupyter Notebooks - a publishing format for reproducible computational workflows , 2016, ELPUB.

[19]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.

[20]  J. Gallant,et al.  A Three-Dimensional Spatiotemporal Receptive Field Model Explains Responses of Area MT Neurons to Naturalistic Movies , 2011, The Journal of Neuroscience.

[21]  Alexander S. Ecker,et al.  DataJoint: managing big scientific data using MATLAB or Python , 2015, bioRxiv.

[22]  K. Svoboda,et al.  A large field of view two-photon mesoscope with subcellular resolution for in vivo imaging , 2016, bioRxiv.

[23]  Chethan Pandarinath,et al.  LFADS - Latent Factor Analysis via Dynamical Systems , 2016, ArXiv.

[24]  J. Poulet,et al.  Internal brain state regulates membrane potential synchrony in barrel cortex of behaving mice , 2008, Nature.

[25]  Liam Paninski,et al.  Multilayer Recurrent Network Models of Primate Retinal Ganglion Cell Responses , 2016, ICLR.

[26]  Ming Li,et al.  Convolutional neural network models of V1 responses to complex patterns , 2018, Journal of Computational Neuroscience.

[27]  M. Frens,et al.  Three-dimensional optokinetic eye movements in the C57BL/6J mouse. , 2010, Investigative ophthalmology & visual science.

[28]  Richard A. Andersen,et al.  A back-propagation programmed network that simulates response properties of a subset of posterior parietal neurons , 1988, Nature.

[29]  Brett J. Graham,et al.  A Self-Calibrating, Camera-Based Eye Tracker for the Recording of Rodent Eye Movements , 2010, Front. Neurosci..

[30]  P. Dayan,et al.  A mathematical model explains saturating axon guidance responses to molecular gradients , 2016, eLife.

[31]  David Pfau,et al.  Simultaneous Denoising, Deconvolution, and Demixing of Calcium Imaging Data , 2016, Neuron.

[32]  Sepp Hochreiter,et al.  Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) , 2015, ICLR.

[33]  R. Desimone,et al.  Predicting responses of nonlinear neurons in monkey striate cortex to complex patterns , 1992, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[34]  Brian Lau,et al.  Computational subunits of visual cortical neurons revealed by artificial neural networks , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[35]  William F. Kindel,et al.  Using deep learning to reveal the neural code for images in primary visual cortex , 2017, ArXiv.

[36]  Ryan J. Prenger,et al.  Nonlinear V1 responses to natural scenes revealed by neural network analysis , 2004, Neural Networks.

[37]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[38]  Dirk Merkel,et al.  Docker: lightweight Linux containers for consistent development and deployment , 2014 .

[39]  M. Stryker,et al.  A Cortical Circuit for Gain Control by Behavioral State , 2014, Cell.

[40]  Nicholas A. Steinmetz,et al.  Spontaneous behaviors drive multidimensional, brain-wide activity , 2018, bioRxiv.

[41]  Maneesh Sahani,et al.  Learning visual motion in recurrent neural networks , 2012, NIPS.

[42]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[43]  Eero P. Simoncelli,et al.  A Convolutional Subunit Model for Neuronal Responses in Macaque V1 , 2015, The Journal of Neuroscience.

[44]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .