Loss of Conservation of Graph Centralities in Reverse-engineered Transcriptional Regulatory Networks

Graph centralities are commonly used to identify and prioritize disease genes in transcriptional regulatory networks. Studies on small networks of experimentally validated protein-protein interactions underpin the general validity of this approach and extensions of such findings have recently been proposed for networks inferred from gene expression data. However, it is largely unknown how well gene centralities are preserved between the underlying biological interactions and the networks inferred from gene expression data. Specifically, while previous studies have evaluated the performance of inference methods on synthetic gene expression, it has not been established how the choice of inference method affects individual centralities in the network. Here, we compare two gene centrality measures between reference networks and networks inferred from corresponding simulated gene expression data, using a number of commonly used network inference methods. The results indicate that the centrality of genes is only moderately conserved for all of the inference methods used. In conclusion, caution should be exercised when inspecting centralities in reverse-engineered networks and further work will be required to establish the use of such networks for prioritizing disease genes.

[1]  M. Vingron,et al.  Gene expression profile of mouse bone marrow stromal cells determined by cDNA microarray analysis , 2003, Cell and Tissue Research.

[2]  Paul Erdös,et al.  On random graphs, I , 1959 .

[3]  Adam A. Margolin,et al.  Reverse engineering of regulatory networks in human B cells , 2005, Nature Genetics.

[4]  Xiao Zhang,et al.  Molecular Network Analysis and Applications , 2010 .

[5]  Dario Floreano,et al.  GeneNetWeaver: in silico benchmark generation and performance profiling of network inference methods , 2011, Bioinform..

[6]  Diogo M. Camacho,et al.  Wisdom of crowds for robust gene network inference , 2012, Nature Methods.

[7]  Kevin Y. Yip,et al.  Improved Reconstruction of In Silico Gene Regulatory Networks by Integrating Knockout and Perturbation Data , 2010, PloS one.

[8]  U. Brandes A faster algorithm for betweenness centrality , 2001 .

[9]  Xavier Solé,et al.  Large differences in global transcriptional regulatory programs of normal and tumor colon cells , 2014, BMC Cancer.

[10]  Bhaskara Rao Siddani,et al.  Candidate Gene Identification for Systemic Lupus Erythematosus Using Network Centrality Measures and Gene Ontology , 2013, PloS one.

[11]  Frank Emmert-Streib,et al.  Inferring the conservative causal core of gene regulatory networks , 2010, BMC Systems Biology.

[12]  Paul A. Bates,et al.  Global topological features of cancer proteins in the human interactome , 2006, Bioinform..

[13]  F. Schreiber,et al.  Centrality Analysis Methods for Biological Networks and Their Application to Gene Regulatory Networks , 2008, Gene regulation and systems biology.

[14]  M. Gerstein,et al.  Getting connected: analysis and principles of biological networks. , 2007, Genes & development.

[15]  Marika Vezzoli,et al.  On Two Classes of Weighted Rank Correlation Measures Deriving from the Spearman's ρ , 2013, Statistical Models for Data Analysis.

[16]  Xing Qiu,et al.  Modeling Genome-Wide Dynamic Regulatory Network in Mouse Lungs with Influenza Infection Using High-Dimensional Ordinary Differential Equations , 2014, PloS one.

[17]  Ernesto Estrada Virtual identification of essential proteins within the protein interaction network of yeast , 2005, Proteomics.

[18]  Jian Zhu,et al.  Systematic identification of transcriptional and post-transcriptional regulations in human respiratory epithelial cells during influenza A virus infection , 2014, BMC Bioinformatics.

[19]  Lin Gao,et al.  Biological network analysis: insights into structure and functions. , 2012, Briefings in functional genomics.

[20]  M. DePamphilis,et al.  HUMAN DISEASE , 1957, The Ulster Medical Journal.

[21]  Linda M. Wills,et al.  Reverse Engineering , 1996, Springer US.

[22]  Welch Bl THE GENERALIZATION OF ‘STUDENT'S’ PROBLEM WHEN SEVERAL DIFFERENT POPULATION VARLANCES ARE INVOLVED , 1947 .

[23]  Kevin Kontos,et al.  Information-Theoretic Inference of Large Transcriptional Regulatory Networks , 2007, EURASIP J. Bioinform. Syst. Biol..

[24]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[25]  B. L. Welch The generalisation of student's problems when several different population variances are involved. , 1947, Biometrika.

[26]  Gianluca Bontempi,et al.  minet: A R/Bioconductor Package for Inferring Large Transcriptional Networks Using Mutual Information , 2008, BMC Bioinformatics.

[27]  Shinichiro Wachi,et al.  Interactome-transcriptome analysis reveals the high centrality of genes differentially expressed in lung cancer tissues , 2005, Bioinform..

[28]  Dragomir R. Radev,et al.  Identifying gene-disease associations using centrality on a literature mined gene-interaction network , 2008, ISMB.

[29]  Matthias Dehmer,et al.  B-cell lymphoma gene regulatory networks: biological consistency among inference methods , 2013, Front. Genet..

[30]  M. Zavelani-Rossi,et al.  Study of higher-energy core states in CdSe/CdS octapod nanocrystals by ultrafast spectroscopy , 2012 .

[31]  Mauno Vihinen,et al.  Identification of candidate disease genes by integrating Gene Ontologies and protein-interaction networks: case study of primary immunodeficiencies , 2008, Nucleic acids research.

[32]  Andrea Califano,et al.  Theory and Limitations of Genetic Network Inference from Microarray Data , 2007, Annals of the New York Academy of Sciences.

[33]  D. Ingber,et al.  High-Betweenness Proteins in the Yeast Protein Interaction Network , 2005, Journal of biomedicine & biotechnology.

[34]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.

[35]  Yongjin Li,et al.  Discovering disease-genes by topological features in human protein-protein interaction network , 2006, Bioinform..

[36]  Torbjörn E. M. Nordling,et al.  Network modeling of the transcriptional effects of copy number aberrations in glioblastoma , 2011, Molecular systems biology.

[37]  Matthew W. Hahn,et al.  Comparative genomics of centrality and essentiality in three eukaryotic protein-interaction networks. , 2005, Molecular biology and evolution.

[38]  Leonard M. Freeman,et al.  A set of measures of centrality based upon betweenness , 1977 .

[39]  Ernesto Estrada Protein bipartivity and essentiality in the yeast protein-protein interaction network. , 2006, Journal of proteome research.

[40]  L. Aravind,et al.  Interplay between gene expression noise and regulatory network architecture. , 2012, Trends in genetics : TIG.

[41]  Frank Emmert-Streib,et al.  Bagging Statistical Network Inference from Large-Scale Gene Expression Data , 2012, PloS one.

[42]  Frank Emmert-Streib,et al.  Revealing differences in gene network inference algorithms on the network level by ensemble methods , 2010, Bioinform..

[43]  David F. Gleich,et al.  Models and algorithms for pagerank sensitivity , 2009 .

[44]  Xing-Ming Zhao,et al.  Inferring gene regulatory networks from gene expression data by path consistency algorithm based on conditional mutual information , 2012, Bioinform..

[45]  Patrick E. Meyer,et al.  Inferring mutual information networks using the minet package , 2008 .

[46]  Mahesan Niranjan,et al.  Cross-Platform Analysis with Binarized Gene Expression Data , 2009, PRIB.

[47]  Matthias Dehmer,et al.  The gene regulatory network for breast cancer: integrated regulatory landscape of cancer hallmarks , 2014, Front. Genet..

[48]  Zhi-Ping Liu,et al.  Reverse Engineering of Genome-wide Gene Regulatory Networks from Gene Expression Data , 2015, Current genomics.

[49]  R. Albert Scale-free networks in cell biology , 2005, Journal of Cell Science.

[50]  Steve Horvath,et al.  WGCNA: an R package for weighted correlation network analysis , 2008, BMC Bioinformatics.

[51]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[52]  Cancer gene identification using graph centrality , .

[53]  Pedro Mendes,et al.  Artificial gene networks for objective comparison of analysis algorithms , 2003, ECCB.

[54]  Benno Schwikowski,et al.  Graph-based methods for analysing networks in cell biology , 2006, Briefings Bioinform..

[55]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.

[56]  D. Floreano,et al.  Revealing strengths and weaknesses of methods for gene network inference , 2010, Proceedings of the National Academy of Sciences.

[57]  Peter Dittrich,et al.  Artificial Gene Regulation: A Data Source for Validation of Reverse Bioengineering , 2004 .

[58]  Christophe Benoist,et al.  Transcriptomes of the B and T Lineages Compared by Multiplatform Microarray Profiling , 2011, The Journal of Immunology.

[59]  Kathleen Marchal,et al.  SynTReN: a generator of synthetic gene expression data for design and analysis of structure learning algorithms , 2006, BMC Bioinformatics.

[60]  Gary D. Bader,et al.  Pathway Commons, a web resource for biological pathway data , 2010, Nucleic Acids Res..

[61]  Peter Langfelder,et al.  When Is Hub Gene Selection Better than Standard Meta-Analysis? , 2013, PloS one.

[62]  P. Geurts,et al.  Inferring Regulatory Networks from Expression Data Using Tree-Based Methods , 2010, PloS one.

[63]  Francesco Pozzi,et al.  Exponential smoothing weighted correlations , 2012 .