Phylogenetic Transfer of Knowledge for Biological Networks

Advances in biotechnology have enabled researchers to study molecular bi- ology from the point of view of systems, from focused efforts at functional annotation to the study of pathways, regulatory networks, protein-protein interaction networks, etc. However, direct observation of these systems has proved difficult, time-consuming, and often unreliable. Thus computational methods have been developed to infer such sys- tems from high-throughput data, such as sequences, gene expression levels, ChIP-Seq signals, etc. For the most part, such methods have not yet proved accurate and reli- able enough to be used in automated analysis pipelines. Most methods used to infer biological networks rely on data for a single organism; a few attempt to leverage ex- isting knowledge about some related organisms. Today, however, we have data about a large variety of organisms as well as good consensus about the evolutionary relation- ships among these organisms, so that the latter can be used to integrate the former in a well founded manner, thereby gaining significant power in the analysis. We have coined the term phylogenetic transfer of knowledge (PTK) for this approach to inference and analysis. A PTK analysis considers a family of organisms with known evolutionary re- lationships and "transfers" biological knowledge among the organisms in accordance with these relationships. The output of a PTK analysis thus includes both predicted (or refined) target data for the extant organisms and inferred details about their evolution- ary history. While a few ad hoc inference methods used a PTK approach almost a dozen years ago, we first provided a global perspective on such methods just 6 years ago. The last few years have seen a significant increase in research in this area, as well as new ap- plications. The time is thus right for a review of recent work that falls under this heading, a characterization of the solutions proposed, and a description of remaining challenges.

[1]  Robert Patro,et al.  Predicting protein interactions via parsimonious network history inference , 2013, Bioinform..

[2]  Alexander V. Favorov,et al.  Hidden Markov Models for Evolution and Comparative Genomics Analysis , 2013, PloS one.

[3]  Manolis Kellis,et al.  Arboretum: Reconstruction and analysis of the evolutionary history of condition-specific transcriptional modules , 2013, Genome research.

[4]  Xiuwei Zhang,et al.  Refining Regulatory Networks through Phylogenetic Transfer of Information , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[5]  Robert Patro,et al.  Parsimonious reconstruction of network evolution , 2011, Algorithms for Molecular Biology.

[6]  Suzanna Lewis,et al.  Phylogenetic-based propagation of functional annotations within the Gene Ontology consortium , 2011, Briefings Bioinform..

[7]  Michael I. Jordan,et al.  Genome-scale phylogenetic function annotation of large and diverse protein families. , 2011, Genome research.

[8]  Xiuwei Zhang,et al.  ProPhyC: A Probabilistic Phylogenetic Model for Refining Regulatory Networks , 2011, ISBRA.

[9]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[10]  Carl Kingsford,et al.  Network Archaeology: Uncovering Ancient Networks from Present-Day Interactions , 2010, PLoS Comput. Biol..

[11]  Jotun Hein,et al.  A Bayesian Approach to the Evolution of Metabolic Networks on a Phylogeny , 2010, PLoS Comput. Biol..

[12]  Thomas Rattei,et al.  The Evolutionary Dynamics of Protein-Protein Interaction Networks Inferred from the Reconstruction of Ancient Networks , 2010, PloS one.

[13]  Jerzy Tiuryn,et al.  Phylogeny-guided interaction mapping in seven eukaryotes , 2009, BMC Bioinformatics.

[14]  Damian Szklarczyk,et al.  eggNOG v2.0: extending the evolutionary genealogy of genes with enhanced non-supervised orthologous groups, species and functional annotations , 2009, Nucleic Acids Res..

[15]  Xiuwei Zhang,et al.  Improving Inference of Transcriptional Regulatory Networks Based on Network Evolutionary Models , 2009, WABI.

[16]  Xiuwei Zhang,et al.  Boosting the Performance of Inference Algorithms for Transcriptional Regulatory Networks Using a Phylogenetic Approach , 2008, WABI.

[17]  Magnus Rattray,et al.  Reconstruction of ancestral protein interaction networks for the bZIP transcription factors , 2007, Proceedings of the National Academy of Sciences.

[18]  N. Friedman,et al.  Natural history and evolutionary principles of gene duplication in fungi , 2007, Nature.

[19]  Janusz Dutkowski,et al.  Protein-protein Interactions , 2022 .

[20]  Michael I. Jordan,et al.  Protein Molecular Function Prediction by Bayesian Phylogenomics , 2005, PLoS Comput. Biol..

[21]  Dannie Durand,et al.  A Hybrid Micro-Macroevolutionary Approach to Gene Tree Reconstruction , 2005, RECOMB.

[22]  David Sankoff,et al.  Improving Gene Network Inference by Comparing Expression Time-series across Species, Developmental Stages or Tissues , 2004, J. Bioinform. Comput. Biol..

[23]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.