Ensemble Inference and Inferability of Gene Regulatory Networks

The inference of gene regulatory network (GRN) from gene expression data is an unsolved problem of great importance. This inference has been stated, though not proven, to be underdetermined implying that there could be many equivalent (indistinguishable) solutions. Motivated by this fundamental limitation, we have developed new framework and algorithm, called TRaCE, for the ensemble inference of GRNs. The ensemble corresponds to the inherent uncertainty associated with discriminating direct and indirect gene regulations from steady-state data of gene knock-out (KO) experiments. We applied TRaCE to analyze the inferability of random GRNs and the GRNs of E. coli and yeast from single- and double-gene KO experiments. The results showed that, with the exception of networks with very few edges, GRNs are typically not inferable even when the data are ideal (unbiased and noise-free). Finally, we compared the performance of TRaCE with top performing methods of DREAM4 in silico network inference challenge.

[1]  Dario Floreano,et al.  GeneNetWeaver: in silico benchmark generation and performance profiling of network inference methods , 2011, Bioinform..

[2]  Frank Harary,et al.  Graph Theory , 2016 .

[3]  V. Hatzimanikatis,et al.  Modeling of uncertainties in biochemical reactions , 2011, Biotechnology and bioengineering.

[4]  A. G. de la Fuente,et al.  From Knockouts to Networks: Establishing Direct Cause-Effect Relationships through Graph Analysis , 2010, PloS one.

[5]  P. Geurts,et al.  Inferring Regulatory Networks from Expression Data Using Tree-Based Methods , 2010, PloS one.

[6]  J. Stelling,et al.  Ensemble modeling for analysis of cell signaling dynamics , 2007, Nature Biotechnology.

[7]  Andrea Califano,et al.  Theory and Limitations of Genetic Network Inference from Microarray Data , 2007, Annals of the New York Academy of Sciences.

[8]  Rudiyanto Gunawan,et al.  Parameter identifiability of power-law biochemical system models. , 2010, Journal of biotechnology.

[9]  J. Liao,et al.  Metabolic ensemble modeling for strain engineers , 2012, Biotechnology journal.

[10]  Alfred V. Aho,et al.  The Transitive Reduction of a Directed Graph , 1972, SIAM J. Comput..

[11]  R. Russell,et al.  Illuminating drug discovery with biological pathways , 2005, FEBS letters.

[12]  R. Albert Scale-free networks in cell biology , 2005, Journal of Cell Science.

[13]  G. Stephanopoulos,et al.  Engineering for biofuels: exploiting innate microbial capacity or importing biosynthetic potential? , 2009, Nature Reviews Microbiology.

[14]  Xing-Ming Zhao,et al.  Prediction of Drug Combinations by Integrating Molecular and Pharmacological Data , 2011, PLoS Comput. Biol..

[15]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[16]  Riet De Smet,et al.  Advantages and limitations of current network inference methods , 2010, Nature Reviews Microbiology.

[17]  D. Floreano,et al.  Revealing strengths and weaknesses of methods for gene network inference , 2010, Proceedings of the National Academy of Sciences.

[18]  Michael Hecker,et al.  Gene regulatory network inference: Data integration in dynamic models - A review , 2009, Biosyst..

[19]  Diogo M. Camacho,et al.  Wisdom of crowds for robust gene network inference , 2012, Nature Methods.

[20]  Jason A. Papin,et al.  Applications of genome-scale metabolic reconstructions , 2009, Molecular systems biology.

[21]  Rudiyanto Gunawan,et al.  Iterative approach to model identification of biological networks , 2005, BMC Bioinformatics.

[22]  J. Banga,et al.  Structural Identifiability of Systems Biology Models: A Critical Comparison of Methods , 2011, PloS one.

[23]  Sherri L. Jackson Research Methods and Statistics: A Critical Thinking Approach , 2005 .

[24]  Rudiyanto Gunawan,et al.  Assessment of Network Inference Methods: How to Cope with an Underdetermined Problem , 2014, PloS one.

[25]  Andreas Wagner,et al.  How to reconstruct a large genetic network from n gene perturbations in fewer than n2 easy steps , 2001, Bioinform..

[26]  Gustavo Stolovitzky,et al.  Lessons from the DREAM2 Challenges , 2009, Annals of the New York Academy of Sciences.

[27]  Constantin F. Aliferis,et al.  The max-min hill-climbing Bayesian network structure learning algorithm , 2006, Machine Learning.

[28]  Julio R. Banga,et al.  Inference of complex biological networks: distinguishability issues and optimization-based solutions , 2011, BMC Systems Biology.

[29]  Jean-Philippe Vert,et al.  TIGRESS: Trustful Inference of Gene REgulation using Stability Selection , 2012, BMC Systems Biology.

[30]  Rudiyanto Gunawan,et al.  Ensemble Kinetic Modeling of Metabolic Networks from Dynamic Metabolic Profiles , 2012, Metabolites.