Gene network coherence based on prior knowledge using direct and indirect relationships

Gene networks (GNs) have become one of the most important approaches for modeling biological processes. They are very useful to understand the different complex biological processes that may occur in living organisms. Currently, one of the biggest challenge in any study related with GN is to assure the quality of these GNs. In this sense, recent works use artificial data sets or a direct comparison with prior biological knowledge. However, these approaches are not entirely accurate as they only take into account direct gene-gene interactions for validation, leaving aside the weak (indirect) relationships. We propose a new measure, named gene network coherence (GNC), to rate the coherence of an input network according to different biological databases. In this sense, the measure considers not only the direct gene-gene relationships but also the indirect ones to perform a complete and fairer evaluation of the input network. Hence, our approach is able to use the whole information stored in the networks. A GNC JAVA-based implementation is available at: http://fgomezvela.github.io/GNC/. The results achieved in this work show that GNC outperforms the classical approaches for assessing GNs by means of three different experiments using different biological databases and input networks. According to the results, we can conclude that the proposed measure, which considers the inherent information stored in the direct and indirect gene-gene relationships, offers a new robust solution to the problem of GNs biological validation.

[1]  Francisco S. Roque,et al.  Dissecting spatio-temporal protein networks driving human heart development and related disorders , 2010, Molecular systems biology.

[2]  Stefan Bornholdt,et al.  Boolean network models of cellular regulation: prospects and limitations , 2008, Journal of The Royal Society Interface.

[3]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[4]  Hongzhe Li,et al.  A Markov random field model for network-based analysis of genomic data , 2007, Bioinform..

[5]  Susmita Datta,et al.  A statistical framework for differential network analysis from microarray data , 2010, BMC Bioinformatics.

[6]  Ralf Herwig,et al.  Cluster-based assessment of protein-protein interaction confidence , 2012, BMC Bioinformatics.

[7]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[8]  Gabriele Sales,et al.  graphite - a Bioconductor package to convert pathway topology to gene network , 2012, BMC Bioinformatics.

[9]  Gary D. Bader,et al.  The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function , 2010, Nucleic Acids Res..

[10]  Charles Elkan,et al.  Learning gene regulatory networks from only positive and unlabeled data , 2010, BMC Bioinformatics.

[11]  Juan F. Poyatos,et al.  The Balance of Weak and Strong Interactions in Genetic Networks , 2011, PloS one.

[12]  I S Kohane,et al.  Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[13]  Joachim Selbig,et al.  Validation and functional annotation of expression-based clusters based on gene ontology , 2005, BMC Bioinformatics.

[14]  Edith D. Wong,et al.  Saccharomyces Genome Database: the genomics resource of budding yeast , 2011, Nucleic Acids Res..

[15]  H.S. Muddana,et al.  Elimination of indirect regulatory interactions in gene network inference , 2006, 2006 IEEE International Workshop on Genomic Signal Processing and Statistics.

[16]  Z. Szallasi,et al.  Reliability and reproducibility issues in DNA microarray measurements. , 2006, Trends in genetics : TIG.

[17]  Frederick P. Roth,et al.  Next generation software for functional trend analysis , 2009, Bioinform..

[18]  Jesús S. Aguilar-Ruiz,et al.  Inferring gene regression networks with model trees , 2010, BMC Bioinformatics.

[19]  Yutaka Saito,et al.  An efficient algorithm for de novo predictions of biochemical pathways between chemical compounds , 2012, BMC Bioinformatics.

[20]  Diego di Bernardo,et al.  Inference of gene regulatory networks and compound mode of action from time course gene expression profiles , 2006, Bioinform..

[21]  Jugal K. Kalita,et al.  An effective method for network module extraction from microarray data , 2012, BMC Bioinformatics.

[22]  Edward R. Dougherty,et al.  Validation of gene regulatory networks: scientific and inferential , 2011, Briefings Bioinform..

[23]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[24]  Andrew J. Bulpitt,et al.  A Primer on Learning in Bayesian Networks for Computational Biology , 2007, PLoS Comput. Biol..

[25]  Reinhard Schneider,et al.  Using graph theory to analyze biological networks , 2011, BioData Mining.

[26]  Jessica Andrea Carballido,et al.  Discovering time-lagged rules from microarray data using gene profile classifiers , 2011, BMC Bioinformatics.

[27]  Kara Dolinski,et al.  The BioGRID Interaction Database: 2008 update , 2008, Nucleic Acids Res..

[28]  Yan Zhou,et al.  Evaluation of GO-based functional similarity measures using S. cerevisiae protein interaction and expression profile data , 2008, BMC Bioinformatics.

[29]  Jesús S. Aguilar-Ruiz,et al.  CarGene: Characterisation of sets of genes based on metabolic pathways analysis , 2011, Int. J. Data Min. Bioinform..

[30]  Andrey Alexeyenko,et al.  Network enrichment analysis: extension of gene-set enrichment analysis to gene networks , 2012, BMC Bioinformatics.

[31]  Christopher A. Penfold,et al.  How to infer gene networks from expression profiles, revisited , 2011, Interface Focus.

[32]  Michael Hecker,et al.  Gene regulatory network inference: Data integration in dynamic models - A review , 2009, Biosyst..

[33]  Dario Floreano,et al.  GeneNetWeaver: in silico benchmark generation and performance profiling of network inference methods , 2011, Bioinform..

[34]  Jugal K. Kalita,et al.  Reconstruction of gene co-expression network from microarray data using local expression patterns , 2014, BMC Bioinformatics.

[35]  James Bailey,et al.  Discovery and analysis of consistent active sub-networks in cancers , 2013, BMC Bioinformatics.

[36]  Francis J. Doyle,et al.  Core module biomarker identification with network exploration for breast cancer metastasis , 2012, BMC Bioinformatics.

[37]  E. Marcotte,et al.  An Improved, Bias-Reduced Probabilistic Functional Gene Network of Baker's Yeast, Saccharomyces cerevisiae , 2007, PloS one.

[38]  Eytan Domany,et al.  Intensity dependent estimation of noise in microarrays improves detection of differentially expressed genes , 2010, BMC Bioinformatics.

[39]  Amir Salehipour,et al.  Speeding up the Floyd-Warshall algorithm for the cycled shortest path problem , 2012, Appl. Math. Lett..

[40]  Edward R Dougherty,et al.  Validation of Inference Procedures for Gene Regulatory Networks , 2007, Current genomics.

[41]  E. Dougherty,et al.  Inferring Connectivity of Genetic Regulatory Networks Using Information-Theoretic Criteria , 2008, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[42]  A. Brazma,et al.  Towards reconstruction of gene networks from expression data by supervised learning , 2003, Genome Biology.

[43]  David Warde-Farley,et al.  GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function , 2008, Genome Biology.

[44]  Hongzhe Li,et al.  In Response to Comment on "Network-constrained regularization and variable selection for analysis of genomic data" , 2008, Bioinform..

[45]  Gary D Bader,et al.  The Genetic Landscape of a Cell , 2010, Science.

[46]  D.H. Glass,et al.  Inferring Adaptive Regulation Thresholds and Association Rules from Gene Expression Data through Combinatorial Optimization Learning , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[47]  Deendayal Dinakarpandian,et al.  Tandem machine learning for the identification of genes regulated by transcription factors , 2005, BMC Bioinformatics.

[48]  Adriana Climescu-Haulica,et al.  A stochastic differential equation model for transcriptional regulatory networks , 2007, BMC Bioinformatics.

[49]  Roland Eils,et al.  Inferring genetic regulatory logic from expression data , 2005, Bioinform..

[50]  L. Aravind,et al.  Comprehensive analysis of combinatorial regulation using the transcriptional regulatory network of yeast. , 2006, Journal of molecular biology.

[51]  Christie S. Chang,et al.  The BioGRID interaction database: 2013 update , 2012, Nucleic Acids Res..

[52]  Kathleen Marchal,et al.  SynTReN: a generator of synthetic gene expression data for design and analysis of structure learning algorithms , 2006, BMC Bioinformatics.

[53]  Yasser M Kadah,et al.  Construction of gene regulatory networks using biclustering and bayesian networks , 2011, Theoretical Biology and Medical Modelling.