论文信息 - Enabling hyperparameter optimization in sequential autoencoders for spiking neural data

Enabling hyperparameter optimization in sequential autoencoders for spiking neural data

Continuing advances in neural interfaces have enabled simultaneous monitoring of spiking activity from hundreds to thousands of neurons. To interpret these large-scale data, several methods have been proposed to infer latent dynamic structure from high-dimensional datasets. One recent line of work uses recurrent neural networks in a sequential autoencoder (SAE) framework to uncover dynamics. SAEs are an appealing option for modeling nonlinear dynamical systems, and enable a precise link between neural activity and behavior on a single-trial basis. However, the very large parameter count and complexity of SAEs relative to other models has caused concern that SAEs may only perform well on very large training sets. We hypothesized that with a method to systematically optimize hyperparameters (HPs), SAEs might perform well even in cases of limited training data. Such a breakthrough would greatly extend their applicability. However, we find that SAEs applied to spiking neural data are prone to a particular form of overfitting that cannot be detected using standard validation metrics, which prevents standard HP searches. We develop and test two potential solutions: an alternate validation method ("sample validation") and a novel regularization method ("coordinated dropout"). These innovations prevent overfitting quite effectively, and allow us to test whether SAEs can achieve good performance on limited data through large-scale HP optimization. When applied to data from motor cortex recorded while monkeys made reaches in various directions, large-scale HP optimization allowed SAEs to better maintain performance for small dataset sizes. Our results should greatly extend the applicability of SAEs in extracting latent dynamics from sparse, multidimensional data, such as neural population spiking activity.

Chethan Pandarinath | Mohammad Reza Keshtkaran | C. Pandarinath

[1] Anqi Wu,et al. Gaussian process based nonlinear latent structure discovery in multivariate spike train data , 2017, NIPS.

[2] Uri Shalit,et al. Structured Inference Networks for Nonlinear State Space Models , 2016, AAAI.

[3] Alex Graves,et al. DRAW: A Recurrent Neural Network For Image Generation , 2015, ICML.

[4] Yuan Zhao,et al. Variational Latent Gaussian Process for Recovering Single-Trial Dynamics from Population Spike Trains , 2016, Neural Computation.

[5] Yuan Zhao,et al. Interpretable Nonlinear Dynamic Modeling of Neural Trajectories , 2016, NIPS.

[6] Matthew T. Kaufman,et al. Neural population dynamics during reaching , 2012, Nature.

[7] Mónica F. Bugallo,et al. Learning Structured Neural Dynamics From Single Trial Population Recording , 2018, 2018 52nd Asilomar Conference on Signals, Systems, and Computers.

[8] S. Wold. Cross-Validatory Estimation of the Number of Components in Factor and Principal Components Models , 1978 .

[9] Scott W. Linderman,et al. Bayesian Learning and Inference in Recurrent Switching Linear Dynamical Systems , 2017, AISTATS.

[10] Maneesh Sahani,et al. Temporal alignment and latent Gaussian process factor inference in population spike trains , 2018, bioRxiv.

[11] Maneesh Sahani,et al. Spectral learning of linear dynamics from generalised-linear observations with application to neural population data , 2012, NIPS.