Prediction of protein interaction based on similarity of phylogenetic trees.

Computational methods for predicting protein interaction partners are becoming increasingly popular. Many of them are mature enough to be widely used by molecular biologists who can look for proteins related to the protein of interest in order to infer information about its context in the cell. In this chapter we describe the use of the mirrortree set of programs and related software for predicting protein interactions. They are all based on the idea that interacting or functionally related proteins tend to show similar phylogenetic trees due to coevolution. The basic mirrortree program can be used to calculate the similarity between the phylogenetic trees implicit in the multiple sequence alignments of two protein families. The ECID database contains protein interactions and relationships from different computational and experimental sources for the model organism Escherichia coli, including the ones generated with mirrortree. Finally, the TSEMA server uses the concept of tree similarity between interacting families to look for the best mapping between two families of interacting proteins: which member in one family interacts with which member in the other.

[1]  D. Eisenberg,et al.  Detecting protein function and protein-protein interactions from genome sequences. , 1999, Science.

[2]  Bernard Labedan,et al.  Using quaternary structures to assess the evolutionary history of proteins: the case of the aspartate carbamoyltransferase. , 2003, Molecular biology and evolution.

[3]  Yoshihiro Yamanishi,et al.  The inference of protein-protein interactions by co-evolutionary analysis is improved by excluding the information about the phylogenetic relationships , 2005, Bioinform..

[4]  Rodrigo Lopez,et al.  Multiple sequence alignment with the Clustal series of programs , 2003, Nucleic Acids Res..

[5]  M. Sternberg,et al.  Assessing protein co-evolution in the context of the tree of life assists in the prediction of the interactome. , 2005, Journal of molecular biology.

[6]  M. Gouy,et al.  WWW-query: an on-line retrieval system for biological sequence banks. , 1996, Biochimie.

[7]  A. Valencia,et al.  Computational methods for the prediction of protein interactions. , 2002, Current opinion in structural biology.

[8]  A. Valencia,et al.  A gene network for navigating the literature , 2004, Nature Genetics.

[9]  A. Valencia,et al.  Similarity of phylogenetic trees as indicator of protein-protein interaction. , 2001, Protein engineering.

[10]  D. Eisenberg,et al.  Computational methods of analysis of protein-protein interactions. , 2003, Current opinion in structural biology.

[11]  A. Valencia,et al.  In silico two‐hybrid system for the selection of physically interacting protein pairs , 2002, Proteins.

[12]  B. Snel,et al.  Conservation of gene order: a fingerprint of proteins that physically interact. , 1998, Trends in biochemical sciences.

[13]  Warren C. Lathe,et al.  Predicting protein function by genomic context: quantitative evaluation and qualitative inferences. , 2000, Genome research.

[14]  Christian von Mering,et al.  STRING: a database of predicted functional associations between proteins , 2003, Nucleic Acids Res..

[15]  B. Snel,et al.  Comparative assessment of large-scale data sets of protein–protein interactions , 2002, Nature.

[16]  Arun K. Ramani,et al.  Exploiting the co-evolution of interacting proteins to discover interaction specificity. , 2003, Journal of molecular biology.

[17]  Alfonso Valencia,et al.  TSEMA: interactive prediction of protein pairings between interacting families , 2006, Nucleic Acids Res..

[18]  Susumu Goto,et al.  The KEGG resource for deciphering the genome , 2004, Nucleic Acids Res..

[19]  F. Cohen,et al.  Co-evolution of proteins with their interaction partners. , 2000, Journal of molecular biology.

[20]  D. Lipman,et al.  A genomic perspective on protein families. , 1997, Science.

[21]  K. J. Fryxell,et al.  The coevolution of gene family trees. , 1996, Trends in genetics : TIG.