Gaining confidence in inferred networks

Network inference is a notoriously challenging problem. Inferred networks are associated with high uncertainty and likely riddled with false positive and false negative interactions. Especially for biological networks we do not have good ways of judging the performance of inference methods against real networks, and instead we often rely solely on the performance against simulated data. Gaining confidence in networks inferred from real data nevertheless thus requires establishing reliable validation methods. Here, we argue that the expectation of mixing patterns in biological networks such as gene regulatory networks offers a reasonable starting point: interactions are more likely to occur between nodes with similar biological functions. We can quantify this behaviour using the assortativity coefficient, and here we show that the resulting heuristic, functional assortativity, offers a reliable and informative route for comparing different inference algorithms.

[1]  R. Tanaka,et al.  Scale-rich metabolic networks. , 2005, Physical review letters.

[2]  Emil Saucan,et al.  Persistent homology of unweighted complex networks via discrete Morse theory , 2019, Scientific Reports.

[3]  Michael P H Stumpf,et al.  Transition state characteristics during cell differentiation , 2018, bioRxiv.

[4]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.

[5]  Roel Snieder,et al.  The Anatomy of Inverse Problems , 2000 .

[6]  N. D. Clarke,et al.  Towards a Rigorous Assessment of Systems Biology Models: The DREAM3 Challenges , 2010, PloS one.

[7]  M. Stumpf,et al.  Incomplete and noisy network data as a percolation process , 2010, Journal of The Royal Society Interface.

[8]  Patrick S. Stumpf,et al.  Stem Cell Differentiation as a Non-Markov Stochastic Process , 2017, Cell systems.

[9]  Christopher A. Penfold,et al.  How to infer gene networks from expression profiles, revisited , 2011, Interface Focus.

[10]  Gary D Bader,et al.  The Genetic Landscape of a Cell , 2010, Science.

[11]  Carsten Peterson,et al.  Probing the role of stochasticity in a model of the embryonic stem cell – heterogeneous gene expression and reprogramming efficiency , 2012, BMC Systems Biology.

[12]  Michael P. H. Stumpf,et al.  Learning regulatory models for cell development from single cell transcriptomic data , 2017 .

[13]  Sarah Filippi,et al.  Information theory and signal transduction systems: from molecular information processing to network inference. , 2014, Seminars in cell & developmental biology.

[14]  Randall D. Beer,et al.  Nonnegative Decomposition of Multivariate Information , 2010, ArXiv.

[15]  M. Newman,et al.  Mixing patterns in networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[16]  D. di Bernardo,et al.  How to infer gene networks from expression profiles , 2007, Molecular systems biology.

[17]  M. Porter,et al.  Critical Truths About Power Laws , 2012, Science.

[18]  Jessica C. Mar,et al.  Evaluating methods of inferring gene regulatory networks highlights their lack of performance for single cell gene expression data , 2018, BMC Bioinformatics.

[19]  Albert-László Barabási,et al.  Life's Complexity Pyramid , 2002, Science.

[20]  Péter Csermely,et al.  A unified data representation theory for network visualization, ordering and coarse-graining , 2014, Scientific Reports.

[21]  Christophe Dessimoz,et al.  Gene Ontology: Pitfalls, Biases, and Remedies. , 2016, Methods in molecular biology.

[22]  Diogo M. Camacho,et al.  Wisdom of crowds for robust gene network inference , 2012, Nature Methods.

[23]  D G Bates,et al.  Validation and invalidation of systems biology models using robustness analysis. , 2011, IET systems biology.

[24]  T. M. Murali,et al.  Benchmarking algorithms for gene regulatory network inference from single-cell transcriptomic data , 2019, Nature Methods.

[25]  Gary D Bader,et al.  Global Mapping of the Yeast Genetic Interaction Network , 2004, Science.

[26]  M E J Newman Assortative mixing in networks. , 2002, Physical review letters.

[27]  Gianluca Bontempi,et al.  minet: A R/Bioconductor Package for Inferring Large Transcriptional Networks Using Mutual Information , 2008, BMC Bioinformatics.

[28]  M. Stumpf,et al.  The Evolution of the Phage Shock Protein Response System: Interplay between Protein Function, Genomic Organization, and System Function , 2010, Molecular biology and evolution.

[29]  A. Goldbeter,et al.  Gata6, Nanog and Erk signaling control cell fate in the inner cell mass through a tristable regulatory network , 2014, Development.

[30]  J. Kinney,et al.  Equitability, mutual information, and the maximal information coefficient , 2013, Proceedings of the National Academy of Sciences.

[31]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[32]  L. Hood,et al.  Reverse Engineering of Biological Complexity , 2007 .

[33]  Stefan Streif,et al.  Probabilistic and Set-based Model Invalidation and Estimation Using LMIs , 2013, 1311.7099.

[34]  Johannes Jaeger,et al.  Modularity, criticality, and evolvability of a developmental gene regulatory network , 2018, bioRxiv.

[35]  K. Sneppen,et al.  Specificity and Stability in Topology of Protein Networks , 2002, Science.

[36]  Michael P. H. Stumpf,et al.  Generating confidence intervals on biological networks , 2007, BMC Bioinformatics.

[37]  P. Geurts,et al.  Inferring Regulatory Networks from Expression Data Using Tree-Based Methods , 2010, PloS one.

[38]  Michael P. H. Stumpf,et al.  Inference of temporally varying Bayesian Networks , 2012, Bioinform..

[39]  Michael P. H. Stumpf,et al.  Graph spectral analysis of protein interaction network evolution , 2012, Journal of The Royal Society Interface.