TASIC: determining branching models from time series single cell data

Motivation: Single cell RNA‐Seq analysis holds great promise for elucidating the networks and pathways controlling cellular differentiation and disease. However, the analysis of time series single cell RNA‐Seq data raises several new computational challenges. Cells at each time point are often sampled from a mixture of cell types, each of which may be a progenitor of one, or several, specific fates making it hard to determine which cells should be used to reconstruct temporal trajectories. In addition, cells, even from the same time point, may be unsynchronized making it hard to rely on the measured time for determining these trajectories. Results: We present TASIC a new method for determining temporal trajectories, branching and cell assignments in single cell time series experiments. Unlike prior approaches TASIC uses on a probabilistic graphical model to integrate expression and time information making it more robust to noise and stochastic variations. Applying TASIC to in vitro myoblast differentiation and in‐vivo lung development data we show that it accurately reconstructs developmental trajectories from single cell experiments. The reconstructed models enabled us to identify key genes involved in cell fate determination and to obtain new insights about a specific type of lung cells and its role in development. Availability and Implementation: The TASIC software package is posted in the supporting website. The datasets used in the paper are publicly available. Contact: zivbj@cs.cmu.edu Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  N. Neff,et al.  Reconstructing lineage hierarchies of the distal lung epithelium using single cell RNA-seq , 2014, Nature.

[2]  Robert H Singer,et al.  Single-Cell Gene Expression Profiling , 2002, Science.

[3]  N. Neff,et al.  Quantitative assessment of single-cell RNA-sequencing methods , 2013, Nature Methods.

[4]  Jeffrey A Whitsett,et al.  Foxa2 is required for transition to air breathing at birth. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[5]  M. Krasnow,et al.  Alveolar progenitor and stem cells in lung development, renewal and cancer , 2014, Nature.

[6]  G. Pavlath,et al.  Myoblast fusion: lessons from flies and mice , 2012, Development.

[7]  E. Shapiro,et al.  Single-cell sequencing-based technologies will revolutionize whole-organism science , 2013, Nature Reviews Genetics.

[8]  S. Tsui,et al.  Calcyclin binding protein promotes DNA synthesis and differentiation in rat neonatal cardiomyocytes , 2006, Journal of cellular biochemistry.

[9]  Pradeep S Rajendran,et al.  Single-cell dissection of transcriptional heterogeneity in human colon tumors , 2011, Nature Biotechnology.

[10]  E. Marco,et al.  Bifurcation analysis of single-cell gene expression data reveals epigenetic landscape , 2014, Proceedings of the National Academy of Sciences.

[11]  T. Jaakkola,et al.  Comparing the continuous representation of time-series expression profiles to identify differentially expressed genes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[12]  S. Teichmann,et al.  Computational and analytical challenges in single-cell transcriptomics , 2015, Nature Reviews Genetics.

[13]  Richard Bonneau,et al.  The Inferelator: an algorithm for learning parsimonious regulatory networks from systems-biology data sets de novo , 2006, Genome Biology.

[14]  Sean C. Bendall,et al.  Single-Cell Trajectory Detection Uncovers Progression and Regulatory Coordination in Human B Cell Development , 2014, Cell.

[15]  Cole Trapnell,et al.  The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells , 2014, Nature Biotechnology.

[16]  Ashwini Patil TimeXNet: Identifying active gene sub-networks using time-course gene expression profiles , 2015 .

[17]  E. Morrisey,et al.  Lung regeneration: mechanisms, applications and emerging stem cell populations , 2014, Nature Medicine.

[18]  I. Simon,et al.  Reconstructing dynamic regulatory maps , 2007, Molecular systems biology.

[19]  Adrian E. Raftery,et al.  Fast Bayesian inference for gene regulatory networks using ScanBMA , 2014, BMC Systems Biology.

[20]  I. Simon,et al.  Studying and modelling dynamic biological processes using time-series gene expression data , 2012, Nature Reviews Genetics.

[21]  Rona S. Gertner,et al.  Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells , 2013, Nature.

[22]  D H Bowden,et al.  Derivation of type 1 epithelium from type 2 cells in the developing rat lung. , 1975, Laboratory investigation; a journal of technical methods and pathology.

[23]  Lorenz Wernisch,et al.  Pseudotime estimation: deconfounding single cell time series , 2015, bioRxiv.

[24]  L. Fritsch,et al.  The Core Binding Factor CBF Negatively Regulates Skeletal Muscle Terminal Differentiation , 2010, PloS one.

[25]  H. Zoghbi,et al.  ATXN1 protein family and CIC regulate extracellular matrix remodeling and lung alveolarization. , 2011, Developmental cell.

[26]  Steven D Shapiro,et al.  Expression profiling of the developing mouse lung: insights into the establishment of the extracellular matrix. , 2002, American journal of respiratory cell and molecular biology.

[27]  Ziv Bar-Joseph,et al.  DREM 2.0: Improved reconstruction of dynamic regulatory networks from time-series expression data , 2012, BMC Systems Biology.

[28]  S. Yamanaka,et al.  Induction of Pluripotent Stem Cells from Mouse Embryonic and Adult Fibroblast Cultures by Defined Factors , 2006, Cell.

[29]  S. Tapscott,et al.  The circuitry of a master switch: Myod and the regulation of skeletal muscle gene transcription , 2005, Development.

[30]  Sean C. Bendall,et al.  Wishbone identifies bifurcating developmental trajectories from single-cell data , 2016, Nature Biotechnology.

[31]  Chen Xu,et al.  Identification of cell types from single-cell transcriptomes using a novel clustering method , 2015, Bioinform..

[32]  Johan T. den Dunnen,et al.  Large-scale gene expression analysis of human skeletal myoblast differentiation , 2004, Neuromuscular Disorders.

[33]  Fabian J Theis,et al.  Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells , 2015, Nature Biotechnology.