Inferring TF activation order in time series scRNA-Seq studies

Methods for the analysis of time series single cell expression data (scRNA-Seq) either do not utilize information about transcription factors (TFs) and their targets or only study these as a post-processing step. Using such information can both, improve the accuracy of the reconstructed model and cell assignments, while at the same time provide information on how and when the process is regulated. We developed the Continuous-State Hidden Markov Models TF (CSHMM-TF) method which integrates probabilistic modeling of scRNA-Seq data with the ability to assign TFs to specific activation points in the model. TFs are assumed to influence the emission probabilities for cells assigned to later time points allowing us to identify not just the TFs controlling each path but also their order of activation. We tested CSHMM-TF on several mouse and human datasets. As we show, the method was able to identify known and novel TFs for all processes, assigned time of activation agrees with both expression information and prior knowledge and combinatorial predictions are supported by known interactions. We also show that CSHMM-TF improves upon prior methods that do not utilize TF-gene interaction.

[1]  R. Misra,et al.  Hepatocyte expression of serum response factor is essential for liver function, hepatocyte proliferation and survival, and postnatal body growth in mice , 2009, Hepatology.

[2]  I. Simon,et al.  Reconstructing dynamic regulatory maps , 2007, Molecular systems biology.

[3]  N. Dulin,et al.  Critical role of serum response factor in pulmonary myofibroblast differentiation induced by TGF-beta. , 2009, American journal of respiratory cell and molecular biology.

[4]  Sean C. Bendall,et al.  Single-Cell Trajectory Detection Uncovers Progression and Regulatory Coordination in Human B Cell Development , 2014, Cell.

[5]  R. Umek,et al.  Regulated expression of three C/EBP isoforms during adipose conversion of 3T3-L1 cells. , 1991, Genes & development.

[6]  Ziv Bar-Joseph,et al.  Continuous State HMMs for Modeling Time Series Single Cell RNA-Seq Data , 2018, bioRxiv.

[7]  Fabian J Theis,et al.  Diffusion pseudotime robustly reconstructs lineage branching , 2016, Nature Methods.

[8]  Cole Trapnell,et al.  The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells , 2014, Nature Biotechnology.

[9]  I. Simon,et al.  Studying and modelling dynamic biological processes using time-series gene expression data , 2012, Nature Reviews Genetics.

[10]  Ziv Bar-Joseph,et al.  Reconstructing dynamic microRNA-regulated interaction networks , 2013, Proceedings of the National Academy of Sciences.

[11]  J. Corton,et al.  Transcriptional ontogeny of the developing liver , 2012, BMC Genomics.

[12]  A. Teschendorff,et al.  Single-cell entropy for accurate estimation of differentiation potency from a cell's transcriptome , 2017, Nature Communications.

[13]  J. Ross,et al.  MIDER: Network Inference with Mutual Information Distance and Entropy Reduction , 2014, PloS one.

[14]  O. Sansom,et al.  Spatiotemporal regulation of liver development by the Wnt/β-catenin pathway , 2018, Scientific Reports.

[15]  V. Gouon-Evans,et al.  Functional Blood Progenitor Markers in Developing Human Liver Progenitors , 2016, Stem cell reports.

[16]  M. Sander,et al.  Sox9 plays multiple roles in the lung epithelium during branching morphogenesis , 2013, Proceedings of the National Academy of Sciences.

[17]  S. Simmons,et al.  Nkx3.1 binds and negatively regulates the transcriptional activity of Sp-family members in prostate-derived cells. , 2006, The Biochemical journal.

[18]  Single , 2020, Definitions.

[19]  K. Kaestner,et al.  The Fox genes in the liver: from organogenesis to functional integration. , 2010, Physiological reviews.

[20]  P. Ainsleigh Theory of Continuous-State Hidden Markov Models and Hidden Gauss-Markov Models , 2001 .

[21]  N. Neff,et al.  Dissecting direct reprogramming from fibroblast to neuron using single-cell RNA-seq , 2016, Nature.

[22]  Jeffrey A Whitsett,et al.  GATA6 regulates differentiation of distal lung epithelium. , 2002, Development.

[23]  C. Delacourt,et al.  Epithelial inactivation of Yy1 abrogates lung branching morphogenesis , 2015, Development.

[24]  P. Calderon,et al.  Downregulation of Sox9 expression associates with hepatogenic differentiation of human liver mesenchymal stem/progenitor cells. , 2014, Stem Cells and Development.

[25]  G. Rousseau,et al.  The Onecut transcription factors HNF-6/OC-1 and OC-2 regulate early liver expansion by controlling hepatoblast migration. , 2007, Developmental biology.

[26]  Christoph Hafemeister,et al.  Developmental diversification of cortical inhibitory interneurons , 2017, Nature.

[27]  Neil D. Lawrence,et al.  Single-cell RNA-seq and computational analysis using temporal mixture modeling resolves TH1/TFH fate bifurcation in malaria , 2017, Science Immunology.

[28]  Jing Guo,et al.  HopLand: single-cell pseudotime recovery using continuous Hopfield network-based modeling of Waddington’s epigenetic landscape , 2017, Bioinform..

[29]  J. Whitsett,et al.  Epithelial SCAP/INSIG/SREBP Signaling Regulates Multiple Biological Processes during Perinatal Lung Maturation , 2014, PloS one.

[30]  L. Didon,et al.  Airway epithelial cell differentiation during lung organogenesis requires C/EBPα and C/EBPβ , 2012, Developmental dynamics : an official publication of the American Association of Anatomists.

[31]  Shuna Yu,et al.  Microarray comparison of the gene expression profiles in the adult vs. embryonic day 14 rat liver. , 2014, Biomedical reports.

[32]  Elena K. Kandror,et al.  Single-cell topological RNA-Seq analysis reveals insights into cellular differentiation and development , 2017, Nature Biotechnology.

[33]  P. Jacquemin,et al.  Transcription factors SOX4 and SOX9 cooperatively control development of bile ducts. , 2015, Developmental biology.

[34]  J. Slack,et al.  C/EBPalpha and C/EBPbeta are markers of early liver development. , 2006, The International journal of developmental biology.

[35]  Rona S. Gertner,et al.  Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells , 2013, Nature.

[36]  Ziv Bar-Joseph,et al.  Continuous State HMMs for Modeling Time Series Single Cell RNA-Seq Data , 2018, bioRxiv.

[37]  J. Nevins,et al.  Interaction of YY1 with E2Fs, mediated by RYBP, provides a mechanism for specificity of E2F function , 2002, The EMBO journal.

[38]  N. Neff,et al.  Reconstructing lineage hierarchies of the distal lung epithelium using single cell RNA-seq , 2014, Nature.

[39]  Kieran R. Campbell,et al.  Order Under Uncertainty: Robust Differential Expression Analysis Using Probabilistic Models for Pseudotime Inference , 2016, bioRxiv.

[40]  A. Zorn,et al.  Interactions between SOX factors and Wnt/β‐catenin signaling in development and disease , 2009, Developmental dynamics : an official publication of the American Association of Anatomists.

[41]  Zhiying He,et al.  Suppressing Pitx2 inhibits proliferation and promotes differentiation of iHepSCs. , 2016, The international journal of biochemistry & cell biology.

[42]  Deepak Kumar Jha,et al.  Reconstruction of complex single-cell trajectories using CellRouter , 2018, Nature Communications.

[43]  Ziv Bar-Joseph,et al.  DREM 2.0: Improved reconstruction of dynamic regulatory networks from time-series expression data , 2012, BMC Systems Biology.

[44]  M. Ebina,et al.  Transcription repressor Bach2 is required for pulmonary surfactant homeostasis and alveolar macrophage function , 2013, The Journal of experimental medicine.

[45]  Lorenz Wernisch,et al.  Pseudotime estimation: deconfounding single cell time series , 2015, bioRxiv.

[46]  Andrew J. Hill,et al.  Single-cell mRNA quantification and differential analysis with Census , 2017, Nature Methods.

[47]  P. Storz,et al.  NFATc1 Links EGFR Signaling to Induction of Sox9 Transcription and Acinar-Ductal Transdifferentiation in the Pancreas. , 2015, Gastroenterology.

[48]  Vladimir B. Bajic,et al.  TcoF-DB v2: update of the database of human and mouse transcription co-factors and transcription factor interactions , 2016, Nucleic Acids Res..

[49]  E. Marco,et al.  Bifurcation analysis of single-cell gene expression data reveals epigenetic landscape , 2014, Proceedings of the National Academy of Sciences.

[50]  Marcus Oswald,et al.  Estimating the activity of transcription factors by the effect on their target genes , 2014, Bioinform..

[51]  Paulina C. Piairo,et al.  STATs in Lung Development: Distinct Early and Late Expression, Growth Modulation and Signaling Dysregulation in Congenital Diaphragmatic Hernia , 2017, Cellular Physiology and Biochemistry.

[52]  Richard Bonneau,et al.  Robust data-driven incorporation of prior knowledge into the inference of dynamic regulatory networks , 2013, Bioinform..

[53]  Fabian J Theis,et al.  PAGA: graph abstraction reconciles clustering with trajectory inference through a topology preserving map of single cells , 2019, Genome Biology.

[54]  Russell B. Fletcher,et al.  Slingshot: cell lineage and pseudotime inference for single-cell transcriptomics , 2017, BMC Genomics.

[55]  D. A. Engel,et al.  Transcriptional repression of the c-fos gene by YY1 is mediated by a direct interaction with ATF/CREB , 1995, Journal of virology.

[56]  T. Kerppola,et al.  Close encounters of many kinds: Fos-Jun interactions that mediate transcription regulatory specificity , 2001, Oncogene.

[57]  D. Ann,et al.  Cell-specific expression of aquaporin-5 (Aqp5) in alveolar epithelium is directed by GATA6/Sp1 via histone acetylation , 2017, Scientific Reports.

[58]  Daniel A. Skelly,et al.  Single-Cell Transcriptional Profiling Reveals Cellular Diversity and Intercommunication in the Mouse Heart. , 2018, Cell reports.

[59]  D. Warburton,et al.  Lung mesenchymal expression of Sox9 plays a critical role in tracheal development , 2013, BMC Biology.

[60]  H. Binder,et al.  Multilineage communication regulates human liver bud development from pluripotency , 2017, Nature.

[61]  F. Christians,et al.  E2Fs regulate the expression of genes involved in differentiation, development, proliferation, and apoptosis. , 2001, Genes & development.

[62]  Z. Bar-Joseph,et al.  Reconstructing differentiation networks and their regulation from time series single-cell expression data , 2018, Genome research.

[63]  P. Farnham,et al.  E2F-mediated Growth Regulation Requires Transcription Factor Cooperation* , 1997, The Journal of Biological Chemistry.

[64]  Ziv Bar-Joseph,et al.  TASIC: determining branching models from time series single cell data , 2017, Bioinform..