Efficient randomization of biological networks while preserving functional characterization of individual nodes

BackgroundNetworks are popular and powerful tools to describe and model biological processes. Many computational methods have been developed to infer biological networks from literature, high-throughput experiments, and combinations of both. Additionally, a wide range of tools has been developed to map experimental data onto reference biological networks, in order to extract meaningful modules. Many of these methods assess results’ significance against null distributions of randomized networks. However, these standard unconstrained randomizations do not preserve the functional characterization of the nodes in the reference networks (i.e. their degrees and connection signs), hence including potential biases in the assessment.ResultsBuilding on our previous work about rewiring bipartite networks, we propose a method for rewiring any type of unweighted networks. In particular we formally demonstrate that the problem of rewiring a signed and directed network preserving its functional connectivity (F-rewiring) reduces to the problem of rewiring two induced bipartite networks. Additionally, we reformulate the lower bound to the iterations’ number of the switching-algorithm to make it suitable for the F-rewiring of networks of any size. Finally, we present BiRewire3, an open-source Bioconductor package enabling the F-rewiring of any type of unweighted network. We illustrate its application to a case study about the identification of modules from gene expression data mapped on protein interaction networks, and a second one focused on building logic models from more complex signed-directed reference signaling networks and phosphoproteomic data.ConclusionsBiRewire3 it is freely available at https://www.bioconductor.org/packages/BiRewire/, and it should have a broad application as it allows an efficient and analytically derived statistical assessment of results from any network biology tool.

[1]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[2]  T. Ideker,et al.  Integrative approaches for finding modular structure in biological networks , 2013, Nature Reviews Genetics.

[3]  Ali Pinar,et al.  Are We There Yet? When to Stop a Markov Chain while Generating Random Graphs , 2012, WAW.

[4]  Tobias Müller,et al.  Identifying functional modules in protein–protein interaction networks: an integrated exact approach , 2008, ISMB.

[5]  Nicholas J. Gotelli,et al.  Swap and fill algorithms in null model analysis: rethinking the knight's tour , 2001, Oecologia.

[6]  Evan O. Paull,et al.  Inferring causal molecular networks: empirical assessment through a community-based effort , 2016, Nature Methods.

[7]  P. Jaccard,et al.  Etude comparative de la distribution florale dans une portion des Alpes et des Jura , 1901 .

[8]  Tobias Müller,et al.  Bioinformatics Applications Note Systems Biology Bionet: an R-package for the Functional Analysis of Biological Networks , 2022 .

[9]  Avi Ma’ayan Introduction to Network Analysis in Systems Biology , 2011, Science Signaling.

[10]  Julio Saez-Rodriguez,et al.  Fast randomization of large genomic datasets while preserving alteration counts , 2014, Bioinform..

[11]  Xin Wang,et al.  Bioinformatics Applications Note Systems Biology Htsanalyzer: an R/bioconductor Package for Integrated Network Analysis of High-throughput Screens , 2022 .

[12]  Isabelle Stanton,et al.  Constructing and sampling graphs with a prescribed joint degree distribution , 2011, JEAL.

[13]  David F. Gleich,et al.  Algorithms and Models for the Web Graph , 2014, Lecture Notes in Computer Science.

[14]  Edward A. Bender,et al.  The Asymptotic Number of Labeled Graphs with Given Degree Sequences , 1978, J. Comb. Theory A.

[15]  Alfonso Valencia,et al.  EnrichNet: network-based gene set enrichment analysis , 2012, Bioinform..

[16]  Eli Upfal,et al.  Algorithms for Detecting Significantly Mutated Pathways in Cancer , 2010, RECOMB.

[17]  Joachim Selbig,et al.  Mass-balanced randomization of metabolic networks , 2011, Bioinform..

[18]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[19]  G. Hong,et al.  Nucleic Acids Research , 2015, Nucleic Acids Research.

[20]  R. Plummer,et al.  Progress in the Function and Regulation of ADP-Ribosylation , 2011, Science Signaling.

[21]  Giovanni Strona,et al.  A fast and unbiased procedure to randomize ecological binary matrices with fixed row and column totals , 2014, Nature Communications.

[22]  María Rodríguez Martínez,et al.  Elucidating Compound Mechanism of Action by Network Perturbation Analysis Graphical , 2015 .

[23]  Julio Saez-Rodriguez,et al.  Identification of drug-specific pathways based on gene expression data: application to drug induced lung injury. , 2015, Integrative biology : quantitative biosciences from nano to macro.

[24]  Julio Saez-Rodriguez,et al.  CellNOptR: a flexible toolkit to train protein signaling networks to data using multiple logic formalisms , 2012, BMC Systems Biology.

[25]  F. Iorio,et al.  Integrated transcriptomic and proteomic analysis identifies protein kinase CK2 as a key signaling node in an inflammatory cytokine network in ovarian cancer cells , 2016, Oncotarget.

[26]  Reinhard Schneider,et al.  Using graph theory to analyze biological networks , 2011, BioData Mining.

[27]  E S Roberts,et al.  Unbiased degree-preserving randomization of directed binary networks. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28]  Julio Saez-Rodriguez,et al.  Modeling Signaling Networks to Advance New Cancer Therapies. , 2015, Annual review of biomedical engineering.

[29]  Paola Lecca,et al.  Biological network inference for drug discovery. , 2013, Drug discovery today.

[30]  Benno Schwikowski,et al.  Discovering regulatory and signalling circuits in molecular interaction networks , 2002, ISMB.

[31]  Julio Saez-Rodriguez,et al.  Network based elucidation of drug response: from modulators to targets , 2013, BMC Systems Biology.

[32]  M. Newman,et al.  On the uniform generation of random graphs with prescribed degree sequences , 2003, cond-mat/0312028.

[33]  W. K. Chen Graph theory and its engineering applications , 1997 .