Automated Curation of CNMF-E-Extracted ROI Spatial Footprints and Calcium Traces Using Open-Source AutoML Tools

In vivo 1-photon (1p) calcium imaging is an increasingly prevalent method in behavioral neuroscience. Numerous analysis pipelines have been developed to improve the reliability and scalability of pre-processing and ROI extraction for these large calcium imaging datasets. Despite these advancements in pre-processing methods, manual curation of the extracted spatial footprints and calcium traces of neurons remains important for quality control. Here, we propose an additional semi-automated curation step for sorting spatial footprints and calcium traces from putative neurons extracted using the popular constrained non-negative matrixfactorization for microendoscopic data (CNMF-E) algorithm. We used the automated machine learning (AutoML) tools TPOT and AutoSklearn to generate classifiers to curate the extracted ROIs trained on a subset of human-labeled data. AutoSklearn produced the best performing classifier, achieving an F1 score >92% on the ground truth test dataset. This automated approach is a useful strategy for filtering ROIs with relatively few labeled data points and can be easily added to pre-existing pipelines currently using CNMF-E for ROI extraction.

[1]  S. Sawilowsky New Effect Size Rules of Thumb , 2009 .

[2]  James M. Otis,et al.  Visualization of cortical, subcortical and deep brain neural circuit dynamics during naturalistic mammalian behavior with head-mounted microscopes and chronically implanted lenses , 2016, Nature Protocols.

[3]  Alcino J. Silva,et al.  A shared neural ensemble links distinct contextual memories encoded close in time , 2016, Nature.

[4]  Randal S. Olson,et al.  Evaluation of a Tree-based Pipeline Optimization Tool for Automating Data Science , 2016, GECCO.

[5]  K. Svoboda,et al.  Principles of Two-Photon Excitation Microscopy and Its Applications to Neuroscience , 2006, Neuron.

[6]  Liam Paninski,et al.  Efficient and accurate extraction of in vivo calcium signals from microendoscopic video data , 2016, eLife.

[7]  Samuel S-H Wang,et al.  Fast calcium sensor proteins for monitoring neural activity , 2014, Neurophotonics.

[8]  Benjamin F. Grewe,et al.  Cellular Level Brain Imaging in Behaving Mammals: An Engineering Approach , 2015, Neuron.

[9]  Anna Harutyunyan,et al.  Persistence of neuronal representations through time and damage in the hippocampus , 2019 .

[10]  Reza Farivar,et al.  Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools , 2019, 2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI).

[11]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Aaron Klein,et al.  Auto-sklearn: Efficient and Robust Automated Machine Learning , 2019, Automated Machine Learning.

[13]  Lina M. Tran,et al.  A compact head-mounted endoscope for in vivo calcium imaging in freely-behaving mice , 2018, bioRxiv.

[14]  Jason Lines,et al.  HIVE-COTE: The Hierarchical Vote Collective of Transformation-Based Ensembles for Time Series Classification , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[15]  Fan Wang,et al.  MIN1PIPE: A Miniscope 1-Photon-Based Calcium Imaging Signal Extraction Pipeline. , 2018, Cell reports.

[16]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[17]  Eftychios A Pnevmatikakis,et al.  Analysis pipelines for calcium imaging data , 2019, Current Opinion in Neurobiology.

[18]  A. Gamal,et al.  Miniaturized integration of a fluorescence microscope , 2011, Nature Methods.

[19]  P. Fearnhead,et al.  Optimal detection of changepoints with a linear computational cost , 2011, 1101.1438.

[20]  Randal S. Olson,et al.  TPOT: A Tree-based Pipeline Optimization Tool for Automating Machine Learning , 2016, AutoML@ICML.

[21]  Kenneth D. Harris,et al.  High-dimensional geometry of population responses in visual cortex , 2019, Nat..

[22]  Mark J. Schnitzer,et al.  Automated Analysis of Cellular Signals from Large-Scale Calcium Imaging Data , 2009, Neuron.

[23]  Yaniv Ziv,et al.  Hippocampal ensemble dynamics timestamp events in long-term memory , 2015, eLife.

[24]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.