The gene regulatory network of mESC differentiation: a benchmark for reverse engineering methods

A large body of data have accumulated that characterize the gene regulatory network of stem cells. Yet, a comprehensive and integrative understanding of this complex network is lacking. Network reverse engineering methods that use transcriptome data to derive these networks may help to uncover the topology in an unbiased way. Many methods exist that use co-expression to reconstruct networks. However, it remains unclear how these methods perform in the context of stem cell differentiation, as most systematic assessments have been made for regulatory networks of unicellular organisms. Here, we report a systematic benchmark of different reverse engineering methods against functional data. We show that network pruning is critical for reconstruction performance. We also find that performance is similar for algorithms that use different co-expression measures, i.e. mutual information or correlation. In addition, different methods yield very different network topologies, highlighting the challenge of interpreting these resulting networks as a whole. This article is part of the theme issue ‘Designer human tissue: coming to a lab near you’.

[1]  Samantha A. Morris,et al.  CellNet: Network Biology Applied to Stem Cell Engineering , 2014, Cell.

[2]  M. Babu,et al.  Dissecting ensemble networks in ES cell populations reveals micro-heterogeneity underlying pluripotency. , 2012, Molecular bioSystems.

[3]  Alexei A. Sharov,et al.  Systematic repression of transcription factors reveals limited patterns of gene expression changes in ES cells , 2013, Scientific Reports.

[4]  S. Orkin,et al.  An Extended Transcriptional Network for Pluripotency of Embryonic Stem Cells , 2008, Cell.

[5]  Georg Fuellen,et al.  The PluriNetWork: An Electronic Representation of the Network Underlying Pluripotency in Mouse, and Its Applications , 2010, PloS one.

[6]  Kevin Kontos,et al.  Information-Theoretic Inference of Large Transcriptional Regulatory Networks , 2007, EURASIP J. Bioinform. Syst. Biol..

[7]  Suzana de Siqueira Santos,et al.  A comparative study of statistical methods used to identify dependencies between gene expression signals , 2014, Briefings Bioinform..

[8]  Mikael Huss,et al.  Resolution of cell fate decisions revealed by single-cell gene expression analysis from zygote to blastocyst. , 2010, Developmental cell.

[9]  Thomas Lengauer,et al.  ROCR: visualizing classifier performance in R , 2005, Bioinform..

[10]  Eric H. Davidson,et al.  Evolution of Gene Regulatory Networks Controlling Body Plan Development , 2011, Cell.

[11]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[12]  杉本 敏美,et al.  Esrrb is a pivotal target of the Gsk3/Tcf3 axis regulating embryonic stem cell self-renewal , 2012 .

[13]  Adam A. Margolin,et al.  Reverse engineering of regulatory networks in human B cells , 2005, Nature Genetics.

[14]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[15]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.

[16]  Ravinder Singh,et al.  Fast-Find: A novel computational approach to analyzing combinatorial motifs , 2006, BMC Bioinformatics.

[17]  Geppino Falco,et al.  Uncovering early response of gene regulatory networks in ESCs by systematic induction of transcription factors. , 2009, Cell stem cell.

[18]  Martin Renqiang Min,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[19]  Matthew A. Hibbs,et al.  Cell-Type-Specific Predictive Network Yields Novel Insights into Mouse Embryonic Stem Cell Self-Renewal and Cell Fate , 2013, PloS one.

[20]  Justin Schwartz Engineering , 1929, Nature.

[21]  S. Emmott,et al.  Defining an essential transcription factor program for naïve pluripotency , 2014, Science.

[22]  Alexei A. Sharov,et al.  Generation of mouse ES cell lines engineered for the forced induction of transcription factors , 2011, Scientific reports.

[23]  Bertram Klinger,et al.  Supplementary Materials for : Network quantification of EGFR signaling unveils potential for targeted combination therapy , 2022 .

[24]  Andrea Califano,et al.  Theory and Limitations of Genetic Network Inference from Microarray Data , 2007, Annals of the New York Academy of Sciences.

[25]  Lin Song,et al.  Comparison of co-expression measures: mutual information, correlation, and model based indices , 2012, BMC Bioinformatics.

[26]  W. Wong,et al.  ChIP-Seq of transcription factors predicts absolute and differential gene expression in embryonic stem cells , 2009, Proceedings of the National Academy of Sciences.

[27]  Achim Tresch,et al.  Modeling the temporal interplay of molecular signaling and gene expression by using dynamic nested effects models , 2009, Proceedings of the National Academy of Sciences.

[28]  Edward T. Bullmore,et al.  On the use of correlation as a measure of network connectivity , 2012, NeuroImage.

[29]  Jonathan K. Pritchard,et al.  The Functional Consequences of Variation in Transcription Factor Binding , 2013, PLoS genetics.

[30]  Feng Q. He,et al.  Reverse engineering and verification of gene networks: principles, assumptions, and limitations of present methods and future perspectives. , 2009, Journal of biotechnology.

[31]  Austin G Smith,et al.  Mapping the route from naive pluripotency to lineage specification , 2014, Philosophical Transactions of the Royal Society B: Biological Sciences.

[32]  A. Califano,et al.  Dialogue on Reverse‐Engineering Assessment and Methods , 2007, Annals of the New York Academy of Sciences.

[33]  Sandy L. Klemm,et al.  Single-Cell Expression Analyses during Cellular Reprogramming Reveal an Early Stochastic and a Late Hierarchic Phase , 2012, Cell.

[34]  D. di Bernardo,et al.  How to infer gene networks from expression profiles , 2007, Molecular systems biology.

[35]  Christof Niehrs,et al.  Emerging links between CDK cell cycle regulators and Wnt signaling. , 2010, Trends in cell biology.

[36]  D. di Bernardo,et al.  Reverse engineering a mouse embryonic stem cell-specific transcriptional network reveals a new modulator of neuronal differentiation , 2012, Nucleic acids research.

[37]  Hidde de Jong,et al.  Modeling and Simulation of Genetic Regulatory Systems: A Literature Review , 2002, J. Comput. Biol..

[38]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.

[39]  Anne-Laure Boulesteix,et al.  Regularized estimation of large-scale gene association networks using graphical Gaussian models , 2009, BMC Bioinformatics.

[40]  Avi Ma'ayan,et al.  ESCAPE: database for integrating high-content published data collected from human and mouse embryonic stem cells , 2013, Database J. Biol. Databases Curation.

[41]  Min Chen,et al.  Comparing Statistical Methods for Constructing Large Scale Gene Networks , 2012, PloS one.

[42]  Hui Liu,et al.  AnimalTFDB: a comprehensive animal transcription factor database , 2011, Nucleic Acids Res..

[43]  Avi Ma'ayan,et al.  Construction and Validation of a Regulatory Network for Pluripotency and Self-Renewal of Mouse Embryonic Stem Cells , 2014, PLoS Comput. Biol..

[44]  Diogo M. Camacho,et al.  Wisdom of crowds for robust gene network inference , 2012, Nature Methods.