HNCcorr: combinatorial optimization for neuron identification

We present a combinatorial algorithm for cell detection in two-photon calcium imaging. Calcium imaging is a modern technique used by neuroscientists for recording movies of in-vivo neuronal activity at cellular resolution. The proposed algorithm, named HNCcorr, builds on the combinatorial clustering problem Hochbaum’s Normalized Cut (HNC). HNC is a model that trades off two goals: One goal is that the cluster has low similarity to the remaining objects. The second goal is that the cluster is highly similar to itself. The HNC model is closely related to the Normalized Cut problem of Shi and Malik, a well-known problem in image segmentation. However, whereas Normalized Cut is an NP-hard problem, HNC is solvable in polynomial time. The neuronal cell detection in calcium imaging movies is viewed here as a clustering problem. HNCcorr utilizes HNC to detect cells in these movies as coherent clusters of pixels that are highly distinct from the remaining pixels. HNCcorr guarantees, unlike existing methodologies for cell identification, a globally optimal solution to the underlying optimization problem. Of independent interest is a novel method, named similarity-squared , that is devised here for measuring similarity. In an experimental study on data from the Neurofinder cell identification benchmark, HNCcorr is a top performer. In particular, it achieves a higher average score than two frequently used matrix factorization algorithms. The Python and Matlab implementations of HNCcorr used here are publicly available. The use of HNCcorr demonstrates that combinatorial optimization is a valuable tool for neuroscience and other biomedical disciplines.

[1]  Dorit S. Hochbaum,et al.  Ranking of multidimensional drug profiling data by fractional-adjusted bi-partitional scores , 2012, Bioinform..

[2]  Ronen Basri,et al.  Hierarchy and adaptivity in segmenting visual scenes , 2006, Nature.

[3]  Wojciech Kotlowski,et al.  Learning Rule Ensembles for Ordinal Classification with Monotonicity Constraints , 2009, Fundam. Informaticae.

[4]  J. Rownd,et al.  Characteristics of sensitometric curves of radiographic films. , 2003, Medical physics.

[5]  Liam Paninski,et al.  Bayesian spike inference from calcium imaging data , 2013, 2013 Asilomar Conference on Signals, Systems and Computers.

[6]  Liam Paninski,et al.  Sparse nonnegative deconvolution for compressive calcium imaging: algorithms and phase transitions , 2013, NIPS.

[7]  Dorit S. Hochbaum,et al.  A Polynomial Time Algorithm for Rayleigh Ratio on Discrete Variables: Replacing Spectral Techniques for Expander Ratio, Normalized Cut, and Cheeger Constant , 2013, Oper. Res..

[8]  Dorit S. Hochbaum,et al.  Real-time robust target tracking in videos via graph-cuts , 2013, Electronic Imaging.

[9]  Daniela Witten,et al.  EXACT SPIKE TRAIN INFERENCE VIA ℓ0 OPTIMIZATION. , 2017, The annals of applied statistics.

[10]  Rafael Yuste,et al.  Fast nonnegative deconvolution for spike train inference from population calcium imaging. , 2009, Journal of neurophysiology.

[11]  Dorit S. Hochbaum,et al.  Nuclear threat detection with mobile distributed sensor networks , 2011, Ann. Oper. Res..

[12]  Liam Paninski,et al.  OnACID: Online Analysis of Calcium Imaging Data in Real Time , 2017, bioRxiv.

[13]  Selmaan N. Chettih,et al.  Dynamic Reorganization of Neuronal Activity Patterns in Parietal Cortex , 2017, Cell.

[14]  Kenneth D. Harris,et al.  Community-based benchmarking improves spike rate inference from two-photon calcium imaging data , 2018, PLoS Comput. Biol..

[15]  Dorit S. Hochbaum,et al.  Sparse Computation for Large-Scale Data Mining , 2016, IEEE Trans. Big Data.

[16]  Dorit S. Hochbaum,et al.  The Supervised Normalized Cut Method for Detecting, Classifying, and Identifying Special Nuclear Materials , 2014, INFORMS J. Comput..

[17]  Dorit S. Hochbaum,et al.  A comparative study of the leading machine learning techniques and two new optimization algorithms , 2019, Eur. J. Oper. Res..

[18]  Mario Dipoppa,et al.  Suite2p: beyond 10,000 neurons with standard two-photon microscopy , 2016, bioRxiv.

[19]  Dorit S. Hochbaum,et al.  Evaluating performance of image segmentation criteria and techniques , 2013, EURO J. Comput. Optim..

[20]  Dorit S. Hochbaum Polynomial Time Algorithms for Ratio Regions and a Variant of Normalized Cut , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Shawn D. Burton,et al.  NeuroElectro: a window to the world's neuron electrophysiology data , 2014, Front. Neuroinform..

[22]  Tülay Adali,et al.  Parameter-free automated extraction of neuronal signals from calcium imaging data , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[23]  Attila Losonczy,et al.  SIMA: Python software for analysis of dynamic fluorescence imaging data , 2014, Front. Neuroinform..

[24]  David J. Slate,et al.  Letter Recognition Using Holland-Style Adaptive Classifiers , 1991, Machine Learning.

[25]  David Pfau,et al.  Simultaneous Denoising, Deconvolution, and Demixing of Calcium Imaging Data , 2016, Neuron.

[26]  James M. Otis,et al.  Visualization of cortical, subcortical and deep brain neural circuit dynamics during naturalistic mammalian behavior with head-mounted microscopes and chronically implanted lenses , 2016, Nature Protocols.

[27]  Benjamin F. Grewe,et al.  High-speed in vivo calcium imaging reveals neuronal network activity with near-millisecond precision , 2010, Nature Methods.

[28]  H. Sebastian Seung,et al.  Automatic Neuron Detection in Calcium Imaging Data Using Convolutional Networks , 2016, NIPS.

[29]  Jitendra Malik,et al.  Normalized Cuts and Image Segmentation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[31]  Varghese S. Jacob,et al.  Prognosis Using an Isotonic Prediction Technique , 2004, Manag. Sci..

[32]  C. Stosiek,et al.  In vivo two-photon calcium imaging of neuronal networks , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Quico Spaen,et al.  Sparse-Reduced Computation - Enabling Mining of Massively-large Data Sets , 2016, ICPRAM.

[34]  Quico Spaen,et al.  High-performance geometric algorithms for sparse computation in big data analytics , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[35]  Matthew Eicholtz,et al.  Fast, Simple Calcium Imaging Segmentation with Fully Convolutional Networks , 2017, DLMIA/ML-CDS@MICCAI.

[36]  Petros Drineas,et al.  FAST MONTE CARLO ALGORITHMS FOR MATRICES II: COMPUTING A LOW-RANK APPROXIMATION TO A MATRIX∗ , 2004 .

[37]  Adam M. Packer,et al.  Extracting regions of interest from biological images with convolutional sparse block coding , 2013, NIPS.

[38]  Matthias Bethge,et al.  Benchmarking Spike Rate Inference in Population Calcium Imaging , 2016, Neuron.

[39]  Dorit S. Hochbaum,et al.  The Pseudoflow Algorithm: A New Algorithm for the Maximum-Flow Problem , 2008, Oper. Res..

[40]  Toru Aonishi,et al.  Detecting cells using non-negative matrix factorization on calcium imaging data , 2014, Neural Networks.

[41]  Mark J. Schnitzer,et al.  Automated Analysis of Cellular Signals from Large-Scale Calcium Imaging Data , 2009, Neuron.

[42]  Fred A. Hamprecht,et al.  Sparse Space-Time Deconvolution for Calcium Image Analysis , 2014, NIPS.

[43]  Robert E. Tarjan,et al.  A Fast Parametric Maximum Flow Algorithm and Applications , 1989, SIAM J. Comput..

[44]  Hongbo Jia,et al.  In vivo two-photon imaging of sensory-evoked dendritic calcium signals in cortical neurons , 2011, Nature Protocols.

[45]  Dorit S. Hochbaum,et al.  Solving integer programs over monotone inequalities in three variables: A framework for half integrality and good approximations , 2002, Eur. J. Oper. Res..