ProPhyC: A Probabilistic Phylogenetic Model for Refining Regulatory Networks

The experimental determination of transcriptional regulatory networks in the laboratory remains difficult and time-consuming, while computational methods to infer these networks provide only modest accuracy. The latter can be attributed in part to the limitations of a single-organism approach. Computational biology has long used comparative and, more generally, evolutionary approaches to extend the reach and accuracy of its analyses. We therefore use an evolutionary approach to the inference of regulatory networks, which enables us to study evolutionary models for these networks as well as to improve the accuracy of inferred networks. We describe ProPhyC, a probabilistic phylogenetic model and associated inference algorithms, designed to improve the inference of regulatory networks for a family of organisms by using known evolutionary relationships among these organisms. ProPhyC can be used with various network evolutionary models and any existing inference method. We demonstrate its applicability with two different network evolutionary models: one that considers only the gains and losses of regulatory connections during evolution, and one that also takes into account the duplications and losses of genes. Extensive experimental results on both biological and synthetic data confirm that our model (through its associated refinement algorithms) yields substantial improvement in the quality of inferred networks over all current methods.

[1]  D. Hillis Approaches for Assessing Phylogenetic Accuracy , 1995 .

[2]  Xiuwei Zhang,et al.  Refining transcriptional regulatory networks using network evolutionary models and gene histories , 2010, Algorithms for Molecular Biology.

[3]  Satoru Miyano,et al.  Identification of Genetic Networks from a Small Number of Gene Expression Patterns Under the Boolean Network Model , 1998, Pacific Symposium on Biocomputing.

[4]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[5]  Bengt Sennblad,et al.  Gene tree reconstruction and orthology analysis based on an integrated model for duplications and sequence evolution , 2004, RECOMB.

[6]  Michael I. Jordan Learning in Graphical Models , 1999, NATO ASI Series.

[7]  A. Wagner,et al.  Structure and evolution of protein interaction networks: a statistical model for link dynamics and gene duplications , 2002, BMC Evolutionary Biology.

[8]  Satoru Miyano,et al.  Inferring gene networks from time series microarray data using dynamic Bayesian networks , 2003, Briefings Bioinform..

[9]  Lars Arvestad,et al.  Evolution after gene duplication: models, mechanisms, sequences, systems, and organisms. , 2007, Journal of experimental zoology. Part B, Molecular and developmental evolution.

[10]  Jotun Hein,et al.  A Bayesian Approach to the Evolution of Metabolic Networks on a Phylogeny , 2010, PLoS Comput. Biol..

[11]  R. Page,et al.  From gene to organismal phylogeny: reconciled trees and the gene tree/species tree problem. , 1997, Molecular phylogenetics and evolution.

[12]  Tandy J. Warnow,et al.  Reconstructing Optimal Phylogenetic Trees: A Challenge in Experimental Algorithmics , 2000, Experimental Algorithmics.

[13]  Dannie Durand,et al.  A hybrid micro-macroevolutionary approach to gene tree reconstruction. , 2006 .

[14]  A. Regev,et al.  Conservation and evolvability in regulatory networks: the evolution of ribosomal regulation in yeast. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Manolis Kellis,et al.  Reliable prediction of regulator targets using 12 Drosophila genomes. , 2007, Genome research.

[16]  Xiuwei Zhang,et al.  Boosting the Performance of Inference Algorithms for Transcriptional Regulatory Networks Using a Phylogenetic Approach , 2008, WABI.

[17]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1999, Innovations in Bayesian Networks.

[18]  Ting Chen,et al.  Modeling Gene Expression with Differential Equations , 1998, Pacific Symposium on Biocomputing.

[19]  Xiuwei Zhang,et al.  Improving Inference of Transcriptional Regulatory Networks Based on Network Evolutionary Models , 2009, WABI.

[20]  David Osumi-Sutherland,et al.  FlyBase: enhancing Drosophila Gene Ontology annotations , 2008, Nucleic Acids Res..

[21]  S. Teichmann,et al.  Evolutionary dynamics of prokaryotic transcriptional regulatory networks. , 2006, Journal of molecular biology.

[22]  Nicola J. Rinaldi,et al.  Transcriptional regulatory code of a eukaryotic genome , 2004, Nature.

[23]  Catherine C. McGeoch Experimental algorithmics , 2007, CACM.

[24]  S. Teichmann,et al.  Gene regulatory network growth by duplication , 2004, Nature Genetics.

[25]  S Fuhrman,et al.  Reveal, a general reverse engineering algorithm for inference of genetic network architectures. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[26]  Manolis Kellis,et al.  Systematic discovery and characterization of fly microRNAs using 12 Drosophila genomes. , 2007, Genome research.

[27]  Rita Casadio,et al.  Algorithms in Bioinformatics, 5th International Workshop, WABI 2005, Mallorca, Spain, October 3-6, 2005, Proceedings , 2005, WABI.

[28]  Martin Wainwright,et al.  Learning in graphical models: Missing data and rigorous guarantees with non-convexity , 2011 .

[29]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[30]  David Sankoff,et al.  Improving Gene Network Inference by Comparing Expression Time-series across Species, Developmental Stages or Tissues , 2004, J. Bioinform. Comput. Biol..

[31]  Anton Crombach,et al.  Evolution of Evolvability in Gene Regulatory Networks , 2008, PLoS Comput. Biol..

[32]  Saurabh Sinha,et al.  Evolution of Regulatory Sequences in 12 Drosophila Species , 2009, PLoS genetics.