Bayesian detection of embryonic gene expression onset in C. elegans

To study how a zygote develops into an embryo with different tissues, large-scale 4D confocal movies of C. elegans embryos have been produced recently by experimental biologists. However, the lack of principled statistical methods for the highly noisy data has hindered the comprehensive analysis of these data sets. We introduced a probabilistic change point model on the cell lineage tree to estimate the embryonic gene expression onset time. A Bayesian approach is used to fit the 4D confocal movies data to the model. Subsequent classification methods are used to decide a model selection threshold and further refine the expression onset time from the branch level to the specific cell time level. Extensive simulations have shown the high accuracy of our method. Its application on real data yields both previously known results and new findings.

[1]  Bin Yan,et al.  DDGni: Dynamic delay gene-network inference from high-temporal data using gapped local alignment , 2014, Bioinform..

[2]  R. Waterston,et al.  Multidimensional regulation of gene expression in the C. elegans embryo , 2012, Genome research.

[3]  Vipin T. Sreedharan,et al.  A spatial and temporal map of C. elegans gene expression. , 2011, Genome research.

[4]  Eugene W. Myers,et al.  Analysis of Cell Fate from Single-Cell Gene Expression Profiles in C. elegans , 2009, Cell.

[5]  E. Myers,et al.  A 3D Digital Atlas of C. elegans and Its Application To Single-Cell Analyses , 2009, Nature Methods.

[6]  Thomas J. Nicholas,et al.  Automated analysis of embryonic gene expression with cellular resolution in C. elegans , 2008, Nature Methods.

[7]  Jon M. Kleinberg,et al.  Tracing information flow on a global scale using Internet chain-letter data , 2008, Proceedings of the National Academy of Sciences.

[8]  Wouter Houthoofd,et al.  The embryonic cell lineage of the nematode Halicephalobus gingivalis (Nematoda: Cephalobina: Panagrolaimoidea) , 2007 .

[9]  H. Horvitz,et al.  C. elegans ISWI and NURF301 antagonize an Rb-like pathway in the determination of multiple cell fates , 2006, Development.

[10]  R. Waterston,et al.  Automated cell lineage tracing in Caenorhabditis elegans. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Morris F. Maduro,et al.  Genetic redundancy in endoderm specification within the genus Caenorhabditis. , 2005, Developmental biology.

[12]  R. J. Hill,et al.  The T-box transcription factors TBX-37 and TBX-38 link GLP-1/Notch signaling to mesoderm induction in C. elegans embryos , 2004, Development.

[13]  Bernard Bobée,et al.  Bayesian change-point analysis in hydrometeorological time series. Part 2. Comparison of change-point models and forecasting , 2000 .

[14]  J. Ahnn,et al.  Analysis of the , 2000 .

[15]  Jaideep Srivastava,et al.  Event detection from time series data , 1999, KDD '99.

[16]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.

[17]  M. Krause MyoD and myogenesis in C. elegans , 1995, BioEssays : news and reviews in molecular, cellular and developmental biology.

[18]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .

[19]  D. Picard Testing and estimating change-points in time series , 1985, Advances in Applied Probability.

[20]  J. Sulston,et al.  The embryonic cell lineage of the nematode Caenorhabditis elegans. , 1983, Developmental biology.