Towards inferring causal gene regulatory networks from single cell expression Measurements

Single-cell transcriptome sequencing now routinely samples thousands of cells, potentially providing enough data to reconstruct causal gene regulatory networks from observational data. Here, we present Scribe, a toolkit for detecting and visualizing causal regulatory interactions between genes and explore the potential for single-cell experiments to power network reconstruction. Scribe employs Restricted Directed Information to determine causality by estimating the strength of information transferred from a potential regulator to its downstream target. We apply Scribe and other leading approaches for causal network reconstruction to several types of single-cell measurements and show that there is a dramatic drop in performance for "pseudotime” ordered single-cell data compared to true time series data. We demonstrate that performing causal inference requires temporal coupling between measurements. We show that methods such as “RNA velocity” restore some degree of coupling through an analysis of chromaffin cell fate commitment. These analyses therefore highlight an important shortcoming in experimental and computational methods for analyzing gene regulation at single-cell resolution and point the way towards overcoming it.

[1]  R. Waterston,et al.  Multidimensional regulation of gene expression in the C. elegans embryo , 2012, Genome research.

[2]  Eric H. Davidson,et al.  A gene regulatory network controlling the embryonic specification of endoderm , 2011, Nature.

[3]  Evan O. Paull,et al.  Inferring causal molecular networks: empirical assessment through a community-based effort , 2016, Nature Methods.

[4]  Fabian J Theis,et al.  Diffusion pseudotime robustly reconstructs lineage branching , 2016, Nature Methods.

[5]  Christopher J. Cronin,et al.  Dynamics and Spatial Genomics of the Nascent Transcriptome by Intron seqFISH , 2018, Cell.

[6]  Meaghan C. Sullivan,et al.  TimeLapse-seq: Adding a temporal dimension to RNA sequencing through nucleoside recoding , 2018, Nature Methods.

[7]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[8]  S. Aerts,et al.  Mapping gene regulatory networks from single-cell omics data , 2018, Briefings in functional genomics.

[9]  Thalia E. Chan,et al.  Gene Regulatory Network Inference from Single-Cell Data Using Multivariate Information Measures , 2016, bioRxiv.

[10]  I. Amit,et al.  Transcriptional Heterogeneity and Lineage Commitment in Myeloid Progenitors , 2015, Cell.

[11]  W. Lim,et al.  Defining Network Topologies that Can Achieve Biochemical Adaptation , 2009, Cell.

[12]  J. Aerts,et al.  SCENIC: Single-cell regulatory network inference and clustering , 2017, Nature Methods.

[13]  Bruce J. Aronow,et al.  Single-cell analysis of mixed-lineage states leading to a binary cell fate choice , 2016, Nature.

[14]  A. Kraskov,et al.  Estimating mutual information. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  Xiaohua Hu,et al.  Reverse-engineering of gene networks for regulating early blood development from single-cell measurements , 2017, BMC Medical Genomics.

[16]  Gina Broitman-Maduro,et al.  Roles of the Wnt effector POP-1/TCF in the C. elegans endomesoderm specification gene network. , 2010, Developmental biology.

[17]  D. Amanatullah,et al.  PU.1 inhibits the erythroid program by binding to GATA‐1 on DNA and creating a repressive chromatin structure , 2005, The EMBO journal.

[18]  Hannah A. Pliner,et al.  Reversed graph embedding resolves complex single-cell trajectories , 2017, Nature Methods.

[19]  Rudiyanto Gunawan,et al.  SINCERITIES: inferring gene regulatory networks from time-stamped single cell transcriptional expression profiles , 2016, bioRxiv.

[20]  Johannes Zuber,et al.  Thiol-linked alkylation of RNA to assess expression dynamics , 2017, Nature Methods.

[21]  F. Takens Detecting strange attractors in turbulence , 1981 .

[22]  Erik Sundström,et al.  RNA velocity of single cells , 2018, Nature.

[23]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.

[24]  Cole Trapnell,et al.  The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells , 2014, Nature Biotechnology.

[25]  Sean C. Bendall,et al.  Wishbone identifies bifurcating developmental trajectories from single-cell data , 2016, Nature Biotechnology.

[26]  Jianfeng Feng,et al.  Granger causality vs. dynamic Bayesian network inference: a comparative study , 2009, BMC Bioinformatics.

[27]  Cole Trapnell,et al.  Single-cell transcriptome sequencing: recent advances and remaining challenges , 2016, F1000Research.

[28]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[29]  S. Teichmann,et al.  Exponential scaling of single-cell RNA-seq in the past decade , 2017, Nature Protocols.

[30]  Zachary D. Smith,et al.  Unbiased Reconstruction of a Mammalian Transcriptional Network Mediating Pathogen Responses , 2009 .

[31]  Manuel Sanchez-Castillo,et al.  A Bayesian framework for the inference of gene regulatory networks from time and pseudo‐time series data , 2018, Bioinform..

[32]  P. Geurts,et al.  Inferring Regulatory Networks from Expression Data Using Tree-Based Methods , 2010, PloS one.

[33]  P. Lásló,et al.  Multilineage Transcriptional Priming and Determination of Alternate Hematopoietic Cell Fates , 2006, Cell.

[34]  Sreeram Kannan,et al.  Network inference using directed information: The deterministic limit , 2016, 2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[35]  Xiaojie Qiu,et al.  From Understanding the Development Landscape of the Canonical Fate-Switch Pair to Constructing a Dynamic Landscape for Two-Step Neural Differentiation , 2012, PloS one.

[36]  Nir Friedman,et al.  A high-throughput chromatin immunoprecipitation approach reveals principles of dynamic gene regulation in mammals. , 2012, Molecular cell.

[37]  Fabian J. Theis,et al.  Reconstructing gene regulatory dynamics from high-dimensional single-cell snapshot data , 2015, Bioinform..

[38]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1999, SODA '98.

[39]  Berthold Göttgens,et al.  Reconstructing blood stem cell regulatory network models from single-cell molecular profiles , 2017 .

[40]  Gianluca Bontempi,et al.  minet: A R/Bioconductor Package for Inferring Large Transcriptional Networks Using Mutual Information , 2008, BMC Bioinformatics.

[41]  Igor Adameyko,et al.  Multipotent peripheral glial cells generate neuroendocrine cells of the adrenal medulla , 2017, Science.

[42]  Andrew J. Hill,et al.  Single-cell mRNA quantification and differential analysis with Census , 2017, Nature Methods.

[43]  Berthold Göttgens,et al.  Reconstructing blood stem cell regulatory network models from single-cell molecular profiles , 2017, Proceedings of the National Academy of Sciences.

[44]  Rona S. Gertner,et al.  Single cell RNA Seq reveals dynamic paracrine control of cellular variation , 2014, Nature.

[45]  Hisanori Kiryu,et al.  SCODE: an efficient regulatory network inference algorithm from single-cell RNA-Seq during differentiation , 2016, bioRxiv.

[46]  G. Swiers,et al.  Genetic regulatory networks programming hematopoietic stem cells and erythroid lineage specification. , 2006, Developmental biology.

[47]  George Sugihara,et al.  Detecting Causality in Complex Ecosystems , 2012, Science.

[48]  Jesse J. Lipp,et al.  SLAM-seq defines direct gene-regulatory functions of the BRD4-MYC axis , 2018, Science.

[49]  I. Simon,et al.  Studying and modelling dynamic biological processes using time-series gene expression data , 2012, Nature Reviews Genetics.

[50]  U. Alon Network motifs: theory and experimental approaches , 2007, Nature Reviews Genetics.

[51]  Steve Horvath,et al.  WGCNA: an R package for weighted correlation network analysis , 2008, BMC Bioinformatics.

[52]  Sreeram Kannan,et al.  Estimating Mutual Information for Discrete-Continuous Mixtures , 2017, NIPS.

[53]  T. Tamura,et al.  Regulation of myelopoiesis by the transcription factor IRF8 , 2015, International Journal of Hematology.

[54]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.

[55]  Schreiber,et al.  Measuring information transfer , 2000, Physical review letters.

[56]  Dane Taylor,et al.  Causal Network Inference by Optimal Causation Entropy , 2014, SIAM J. Appl. Dyn. Syst..

[57]  Ping Ao,et al.  Decoding early myelopoiesis from dynamics of core endogenous network , 2017, Science China Life Sciences.

[58]  Michael P. H. Stumpf,et al.  Learning regulatory models for cell development from single cell transcriptomic data , 2017 .

[59]  Dietmar Rieder,et al.  Osmium-Mediated Transformation of 4-Thiouridine to Cytidine as Key To Study RNA Dynamics by Sequencing. , 2017, Angewandte Chemie.

[60]  N. Neff,et al.  Reconstructing lineage hierarchies of the distal lung epithelium using single cell RNA-seq , 2014, Nature.

[61]  Sreeram Kannan,et al.  Potential conditional mutual information: Estimators and properties , 2017, 2017 55th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[62]  Sean C. Bendall,et al.  Conditional density-based analysis of T cell signaling in single-cell data , 2014, Science.

[63]  Damian Szklarczyk,et al.  The STRING database in 2017: quality-controlled protein–protein association networks, made broadly accessible , 2016, Nucleic Acids Res..

[64]  Sreeram Kannan,et al.  Potential conditional mutual information: Estimators and properties , 2017, Allerton.

[65]  C. Granger Investigating Causal Relations by Econometric Models and Cross-Spectral Methods , 1969 .

[66]  Andrew C. Adey,et al.  Chromatin accessibility dynamics of myogenesis at single cell resolution , 2017, bioRxiv.