Trees on networks: resolving statistical patterns of phylogenetic similarities among interacting proteins

BackgroundPhylogenies capture the evolutionary ancestry linking extant species. Correlations and similarities among a set of species are mediated by and need to be understood in terms of the phylogenic tree. In a similar way it has been argued that biological networks also induce correlations among sets of interacting genes or their protein products.ResultsWe develop suitable statistical resampling schemes that can incorporate these two potential sources of correlation into a single inferential framework. To illustrate our approach we apply it to protein interaction data in yeast and investigate whether the phylogenetic trees of interacting proteins in a panel of yeast species are more similar than would be expected by chance.ConclusionsWhile we find only negligible evidence for such increased levels of similarities, our statistical approach allows us to resolve the previously reported contradictory results on the levels of co-evolution induced by protein-protein interactions. We conclude with a discussion as to how we may employ the statistical framework developed here in further functional and evolutionary analyses of biological networks and systems.

[1]  T. Ideker,et al.  Comprehensive curation and analysis of global interaction networks in Saccharomyces cerevisiae , 2006, Journal of biology.

[2]  Edward A. Bender,et al.  The Asymptotic Number of Labeled Graphs with Given Degree Sequences , 1978, J. Comb. Theory A.

[3]  R. May,et al.  Stability and Complexity in Model Ecosystems , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[4]  C. Wilke,et al.  A single determinant dominates the rate of yeast protein evolution. , 2006, Molecular biology and evolution.

[5]  Hui Lu,et al.  Correlation between gene expression profiles and protein-protein interactions within and across genomes , 2005, Bioinform..

[6]  A. Wagner The yeast protein interaction network evolves rapidly and contains few redundant duplicate genes. , 2001, Molecular biology and evolution.

[7]  M. Nei,et al.  Relationships between gene trees and species trees. , 1988, Molecular biology and evolution.

[8]  A. Valencia,et al.  Computational methods for the prediction of protein interactions. , 2002, Current opinion in structural biology.

[9]  D. Eisenberg,et al.  Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Teresa M. Przytycka,et al.  Predicting protein-protein interaction by searching evolutionary tree automorphism space , 2005, ISMB.

[11]  Chern-Sing Goh,et al.  Co-evolutionary analysis reveals insights into protein-protein interactions. , 2002, Journal of molecular biology.

[12]  Alex W. Wilkinson,et al.  Computational prediction of protein-protein interactions , 2012 .

[13]  Kara Dolinski,et al.  The BioGRID Interaction Database: 2008 update , 2008, Nucleic Acids Res..

[14]  Michael P H Stumpf,et al.  Complex networks and simple models in biology , 2005, Journal of The Royal Society Interface.

[15]  A. Valencia,et al.  High-confidence prediction of global interactomes based on genome-wide coevolutionary networks , 2008, Proceedings of the National Academy of Sciences.

[16]  S. Shen-Orr,et al.  Networks Network Motifs : Simple Building Blocks of Complex , 2002 .

[17]  Bernardo Lemos,et al.  Regulatory evolution across the protein interaction network , 2004, Nature Genetics.

[18]  Daniel Simberloff,et al.  Ecological Communities: Conceptual Issues and the Evidence , 1984 .

[19]  Edward M Marcotte,et al.  A map of human protein interactions derived from co-expression of human mRNAs and their orthologs , 2008, Molecular systems biology.

[20]  P. Sharp,et al.  The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications. , 1987, Nucleic acids research.

[21]  David Martin,et al.  Functional classification of proteins for the prediction of cellular function from a protein-protein interaction network , 2003, Genome Biology.

[22]  Farshad Fotouhi,et al.  Computational Approaches for Predicting Protein–Protein Interactions: A Survey , 2006, Journal of Medical Systems.

[23]  Derek Huntley,et al.  Comparative analysis of the Saccharomyces cerevisiae and Caenorhabditis elegans protein interaction networks , 2005, BMC Evolutionary Biology.

[24]  Alfonso Valencia,et al.  Co‐evolution and co‐adaptation in protein networks , 2008, FEBS letters.

[25]  J. Rothberg,et al.  Gaining confidence in high-throughput protein interaction networks , 2004, Nature Biotechnology.

[26]  M. Sternberg,et al.  Assessing protein co-evolution in the context of the tree of life assists in the prediction of the interactome. , 2005, Journal of molecular biology.

[27]  Ronald W. Davis,et al.  A genome-wide transcriptional analysis of the mitotic cell cycle. , 1998, Molecular cell.

[28]  Paul Shannon,et al.  Derivation of genetic interaction networks from quantitative phenotype data , 2005, Genome Biology.

[29]  D. Eisenberg,et al.  Computational methods of analysis of protein-protein interactions. , 2003, Current opinion in structural biology.

[30]  Eugene V Koonin,et al.  Correction: No simple dependence between protein evolution rate and the number of protein-protein interactions: only the most prolific interactors tend to evolve slowly , 2003, BMC Evolutionary Biology.

[31]  Ziheng Yang PAML 4: phylogenetic analysis by maximum likelihood. , 2007, Molecular biology and evolution.

[32]  Carsten Wiuf,et al.  The effects of incomplete protein interaction data on structural and evolutionary inferences , 2006, BMC Biology.

[33]  Alan M. Frieze,et al.  Random graphs , 2006, SODA '06.

[34]  C. Deane,et al.  Protein Interactions , 2002, Molecular & Cellular Proteomics.

[35]  Michael Lässig,et al.  Local graph alignment and motif search in biological networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[36]  N. Gotelli,et al.  NULL MODELS IN ECOLOGY , 1996 .

[37]  Bruce Rothschild,et al.  Inferring protein interactions from phylogenetic distance matrices , 2003, Bioinform..

[38]  Michael P. H. Stumpf,et al.  Generating confidence intervals on biological networks , 2007, BMC Bioinformatics.

[39]  F. Tajima Evolutionary relationship of DNA sequences in finite populations. , 1983, Genetics.

[40]  M. Stumpf,et al.  Statistical Null Models for Biological Network Analysis , 2009 .

[41]  P. Bork,et al.  Proteome survey reveals modularity of the yeast cell machinery , 2006, Nature.

[42]  Kenneth H. Wolfe,et al.  Saccharomyces cerevisiae RM 11-1 a Saccharomyces bayanus Saccharomyces castellii Saccharomyces kluyveri Kluyveromyces lactis Debaryomyces hansenii Candida albicans Saccharomyces paradoxus Saccharomyces mikatae Saccharomyces kudriavzevii Candida glabrata Ashbya gossypii Kluyveromyces waltii Yarrowia , 2006 .

[43]  Arun K. Ramani,et al.  Exploiting the co-evolution of interacting proteins to discover interaction specificity. , 2003, Journal of molecular biology.

[44]  A. E. Hirsh,et al.  Evolutionary Rate in the Protein Interaction Network , 2002, Science.

[45]  Alfonso Valencia,et al.  Protein co-evolution, co-adaptation and interactions , 2008, The EMBO journal.

[46]  Ioannis Xenarios,et al.  DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions , 2002, Nucleic Acids Res..

[47]  David L. Robertson,et al.  Specificity in protein interactions and its relationship with sequence diversity and coevolution , 2007, Proceedings of the National Academy of Sciences.

[48]  Alvis Brazma,et al.  Modelling gene networks at different organisational levels , 2005, FEBS letters.

[49]  Carsten Wiuf,et al.  Statistical and evolutionary analysis of biological networks , 2009 .

[50]  F. Cohen,et al.  Co-evolution of proteins with their interaction partners. , 2000, Journal of molecular biology.

[51]  Jason E Stajich,et al.  A fungal phylogeny based on 42 complete genomes derived from supertree and combined gene analysis , 2006, BMC Evolutionary Biology.

[52]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[53]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[54]  Julie D Thompson,et al.  Multiple Sequence Alignment Using ClustalW and ClustalX , 2003, Current protocols in bioinformatics.