SCODE: an efficient regulatory network inference algorithm from single-cell RNA-Seq during differentiation

Motivation: The analysis of RNA‐Seq data from individual differentiating cells enables us to reconstruct the differentiation process and the degree of differentiation (in pseudo‐time) of each cell. Such analyses can reveal detailed expression dynamics and functional relationships for differentiation. To further elucidate differentiation processes, more insight into gene regulatory networks is required. The pseudo‐time can be regarded as time information and, therefore, single‐cell RNA‐Seq data are time‐course data with high time resolution. Although time‐course data are useful for inferring networks, conventional inference algorithms for such data suffer from high time complexity when the number of samples and genes is large. Therefore, a novel algorithm is necessary to infer networks from single‐cell RNA‐Seq during differentiation. Results: In this study, we developed the novel and efficient algorithm SCODE to infer regulatory networks, based on ordinary differential equations. We applied SCODE to three single‐cell RNA‐Seq datasets and confirmed that SCODE can reconstruct observed expression dynamics. We evaluated SCODE by comparing its inferred networks with use of a DNaseI‐footprint based network. The performance of SCODE was best for two of the datasets and nearly best for the remaining dataset. We also compared the runtimes and showed that the runtimes for SCODE are significantly shorter than for alternatives. Thus, our algorithm provides a promising approach for further single‐cell differentiation analyses. Availability and Implementation: The R source code of SCODE is available at https://github.com/hmatsu1226/SCODE Contact: hirotaka.matsumoto@riken.jp Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  R. Stewart,et al.  Single-cell RNA-seq reveals novel regulators of human embryonic stem cell differentiation to definitive endoderm , 2016, Genome Biology.

[2]  J. Kawai,et al.  A genome-wide and nonredundant mouse transcription factor database. , 2004, Biochemical and biophysical research communications.

[3]  Lars Martin Jakt,et al.  DNA Methylation Restricts Lineage-specific Functions of Transcription Factor Gata4 during Embryonic Stem Cell Differentiation , 2013, PLoS genetics.

[4]  Jing Guo,et al.  Single-cell transcriptional analysis to uncover regulatory circuits driving cell fate decisions in early mouse development , 2015, Bioinform..

[5]  J. Collins,et al.  Inferring Genetic Networks and Identifying Compound Mode of Action via Expression Profiling , 2003, Science.

[6]  Hisanori Kiryu,et al.  SCOUP: a probabilistic model based on the Ornstein–Uhlenbeck process to analyze single-cell expression data during differentiation , 2016, BMC Bioinformatics.

[7]  Cole Trapnell,et al.  The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells , 2014, Nature Biotechnology.

[8]  Hans Clevers,et al.  Single-cell messenger RNA sequencing reveals rare intestinal cell types , 2015, Nature.

[9]  Shane J. Neph,et al.  Circuitry and Dynamics of Human Transcription Factor Regulatory Networks , 2012, Cell.

[10]  Hitoshi Niwa,et al.  Extra-embryonic endoderm cells derived from ES cells induced by GATA Factors acquire the character of XEN cells , 2007, BMC Developmental Biology.

[11]  Siyu Zhu,et al.  Systematic Reconstruction of Molecular Cascades Regulating GP Development Using Single-Cell RNA-Seq. , 2016, Cell reports.

[12]  V. Anne Smith,et al.  Relationship between differentially expressed mRNA and mRNA-protein correlations in a xenograft model system , 2015 .

[13]  Hongkai Ji,et al.  TSCAN: Pseudo-time reconstruction and evaluation in single-cell RNA-seq analysis , 2016, Nucleic acids research.

[14]  P. Geurts,et al.  Inferring Regulatory Networks from Expression Data Using Tree-Based Methods , 2010, PloS one.

[15]  H. Lehrach,et al.  Analysis of Oct4‐Dependent Transcriptional Networks Regulating Self‐Renewal and Pluripotency in Human Embryonic Stem Cells , 2007, Stem cells.

[16]  M. Mann,et al.  Defining the transcriptome and proteome in three functionally different human cell lines , 2010, Molecular systems biology.

[17]  Ricardo J. Miragaia,et al.  MERVL/Zscan4 Network Activation Results in Transient Genome-wide DNA Demethylation of mESCs , 2016, Cell reports.

[18]  Jung Eun Shim,et al.  TRRUST: a reference database of human transcriptional regulatory interactions , 2015, Scientific Reports.

[19]  Fabian J. Theis,et al.  Reconstructing gene regulatory dynamics from high-dimensional single-cell snapshot data , 2015, Bioinform..

[20]  Tomohiro Hayakawa,et al.  Maintenance of self‐renewal ability of mouse embryonic stem cells in the absence of DNA methyltransferases Dnmt1, Dnmt3a and Dnmt3b , 2006, Genes to cells : devoted to molecular & cellular mechanisms.

[21]  Shuang Yang,et al.  Identification of DeltaEF1 as a novel target that is negatively regulated by LMO2 in T‐cell leukemia , 2010, European journal of haematology.

[22]  Cole Trapnell,et al.  Defining cell types and states with single-cell genomics , 2015, Genome research.

[23]  Diego di Bernardo,et al.  Inference of gene regulatory networks and compound mode of action from time course gene expression profiles , 2006, Bioinform..

[24]  Guocheng Yuan,et al.  GiniClust: detecting rare cell types from single-cell gene expression data with Gini index , 2016, Genome Biology.

[25]  Luis Serrano,et al.  Correlation of mRNA and protein in complex biological samples , 2009, FEBS letters.

[26]  Diego di Bernardo,et al.  Robust Identification of Large Genetic Networks , 2003, Pacific Symposium on Biocomputing.

[27]  Berthold Göttgens,et al.  BTR: training asynchronous Boolean models using single-cell expression data , 2016, BMC Bioinformatics.

[28]  Elhanan Borenstein,et al.  Conservation of trans-acting circuitry during mammalian regulatory evolution , 2014, Nature.

[29]  Diogo M. Camacho,et al.  Wisdom of crowds for robust gene network inference , 2012, Nature Methods.

[30]  J. Berg,et al.  Dnmt3a is essential for hematopoietic stem cell differentiation , 2011, Nature Genetics.

[31]  Carsten Peterson,et al.  Single-Cell Network Analysis Identifies DDIT3 as a Nodal Lineage Regulator in Hematopoiesis , 2015, Cell reports.

[32]  Berthold Göttgens,et al.  Preview: Published ahead of advance online publication Processing, visualising and reconstructing network models from single cell data , 2015 .

[33]  Fabian J Theis,et al.  Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells , 2015, Nature Biotechnology.

[34]  S. Pelet,et al.  Real-time quantification of protein expression at the single-cell level via dynamic protein synthesis translocation reporters , 2016, Nature Communications.

[35]  Luke D. Lavis,et al.  Real-time quantification of single RNA translation dynamics in living cells , 2016, Science.

[36]  N. Neff,et al.  Dissecting direct reprogramming from fibroblast to neuron using single-cell RNA-seq , 2016, Nature.

[37]  Guido Sanguinetti,et al.  Combining tree-based and dynamical systems for the inference of gene regulatory networks , 2015, Bioinform..

[38]  Yu Xue,et al.  AnimalTFDB 2.0: a resource for expression, prediction and functional study of animal transcription factors , 2014, Nucleic Acids Res..

[39]  N. Neff,et al.  Reconstructing lineage hierarchies of the distal lung epithelium using single cell RNA-seq , 2014, Nature.

[40]  Michael C. Kelly,et al.  Single-cell RNA-Seq resolves cellular complexity in sensory organs from the neonatal inner ear , 2015, Nature Communications.

[41]  S. Linnarsson,et al.  Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq , 2015, Science.

[42]  Fabian J Theis,et al.  Decoding the Regulatory Network for Blood Development from Single-Cell Gene Expression Measurements , 2015, Nature Biotechnology.

[43]  Wei-Po Lee,et al.  Computational methods for discovering gene networks from expression data , 2009, Briefings Bioinform..

[44]  Mark Gerstein,et al.  DREISS: Using State-Space Models to Infer the Dynamics of Gene Expression Driven by External and Internal Regulatory Networks , 2016, PLoS Comput. Biol..

[45]  Aleksandra A. Kolodziejczyk,et al.  The technology and biology of single-cell RNA sequencing. , 2015, Molecular cell.