Probabilistic Joint Segmentation and Labeling of C. elegans Neurons

Automatic identification and segmentation of the neurons of C. elegans enables evaluating nervous system mutations, positional variability, and allows us to conduct high-throughput population studies employing many animals. A recently introduced transgene of C. elegans, named “NeuroPAL” has enabled the efficient annotation of neurons and the construction of a statistical atlas of their positions. Previous atlas-based segmentation approaches have modeled images of cells as a mixture model. The expectation-maximization (EM) algorithm and its variants are used to find the (local) maximum likelihood parameters for this class of models. We present a variation of the EM algorithm called Sinkhorn-EM (sEM) that uses regularized optimal transport Sinkhorn iterations to enforce constraints on the marginals of the joint distribution of observed variables and latent assignments in order to incorporate our prior information about cell sizes into the cluster-data assignment proportions. We apply our method to the problem of segmenting and labeling neurons in fluorescent microscopy images of C. elegans specimens. We show empirically that sEM outperforms vanilla EM and a recently proposed 3-step (filter, detect, identify) labeling approach. Open source code implementing this method is available at https://github.com/amin-nejat/SinkhornEM.

[1]  Liam Paninski,et al.  NeuroPAL: A Neuronal Polychromatic Atlas of Landmarks for Whole-Brain Imaging in C. elegans , 2019, bioRxiv.

[2]  Sol Ah Lee,et al.  Graphical-model framework for automated annotation of cell identities in dense cellular images , 2020, bioRxiv.

[3]  Jonathan Niles-Weed,et al.  Sinkhorn EM: An Expectation-Maximization algorithm based on entropic optimal transport , 2020, ArXiv.

[4]  Takayuki Teramoto,et al.  Automated detection and tracking of many cells by using 4D live-cell imaging data , 2014, Bioinform..

[5]  Matthew D. DiFranco,et al.  A probabilistic atlas for cell identification. , 2019 .

[6]  Paul J. Besl,et al.  Method for registration of 3-D shapes , 1992, Other Conferences.

[7]  Theodore H. Lindsay,et al.  Global Brain Dynamics Embed the Motor Command Sequence of Caenorhabditis elegans , 2015, Cell.

[8]  Takeshi Ishihara,et al.  SPF-CellTracker: Tracking Multiple Cells with Strongly-Correlated Moves Using a Spatial Particle Filter , 2015, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[9]  Yuko Murakami,et al.  An annotation dataset facilitates automatic annotation of whole-brain activity imaging of C. elegans , 2019 .

[10]  Michel Verleysen,et al.  On Convergence Problems of the EM Algorithm for Finite Gaussian Mixtures , 2003, ESANN.

[11]  R. Prevedel,et al.  Brain-wide 3D imaging of neuronal activity in Caenorhabditis elegans with sculpted light , 2013, Nature Methods.

[12]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[13]  Xiao Liu,et al.  Automated cellular annotation for high-resolution images of adult Caenorhabditis elegans , 2013, Bioinform..

[14]  Richard Sinkhorn,et al.  Concerning nonnegative matrices and doubly stochastic matrices , 1967 .

[15]  Xiao Liu,et al.  Simultaneous recognition and segmentation of cells: application in C.elegans , 2011, Bioinform..

[16]  Joshua W Shaevitz,et al.  Whole-brain calcium imaging with cellular resolution in freely behaving Caenorhabditis elegans , 2015, Proceedings of the National Academy of Sciences.