Several computational methods have been recently developed for the prediction of protein interactions. The first class of these methods exclusively uses sequence information for the predictions, including three methods rooted in comparative genomics (phylogenetic profiles, conserved gene neighborhood, and gene fusion), and two other methods that use multiple sequence alignments as input information (in silico two-hybrid and MirrorTree methods). Interestingly, these two methods can be extended beyond the prediction of interaction partners to the detection of regions of interactions and key functional residues, as demonstrated in various experimental systems. The second type of computational methods exploits the information of protein structures by analyzing the observed combinatorial of protein domains or by extrapolating the structural information of protein complexes to compatible sequences.
Several studies have addressed the interesting possibilities of combining predicted and experimentally derived interactions. The current view is that the overlap between the interactions predicted by different methods is relatively small, and the various experimental and computational methods seem to be able to detect specific types of protein interactions.
A number of computational methods for predicting functional and/or physical relations between proteins have been developed in the last 4 years. The computational methods are strongly rooted in the traces left by the evolution in the organization and composition of bacterial genomes (see Microbial genomes). The methods developed can be divided into three categories: those methods that only use the information from genomes and sequences for the prediction of interaction partners, those methods that use the information of protein complexes of known structures (see Fundamentals of protein structure and function, Large complexes by X-ray methods, and Large complexes and molecular machines by electron microscopy), and those methods that use the puzzle composition of proteins in domains (see Classification of proteins into families, Pfam: the protein families database, and COGs) to predict the probability of interaction between the corresponding proteins.
Keywords:
protein interaction;
in silico predictions;
methods comparison;
bioinformatics;
computational methods
[1]
Hui Lu,et al.
MULTIPROSPECTOR: An algorithm for the prediction of protein–protein interactions by multimeric threading
,
2002,
Proteins.
[2]
F. Cohen,et al.
Co-evolution of proteins with their interaction partners.
,
2000,
Journal of molecular biology.
[3]
E. Sprinzak,et al.
Correlated sequence-signatures as markers of protein-protein interaction.
,
2001,
Journal of molecular biology.
[4]
A. Rzhetsky,et al.
Probabilistic prediction of unknown metabolic and signal-transduction networks.
,
2001,
Genetics.
[5]
Anton J. Enright,et al.
Protein interaction maps for complete genomes based on gene fusion events
,
1999,
Nature.
[6]
B. Snel,et al.
Conservation of gene order: a fingerprint of proteins that physically interact.
,
1998,
Trends in biochemical sciences.
[7]
B. Snel,et al.
Comparative assessment of large-scale data sets of protein–protein interactions
,
2002,
Nature.
[8]
A. Valencia,et al.
Similarity of phylogenetic trees as indicator of protein-protein interaction.
,
2001,
Protein engineering.
[9]
A. Valencia,et al.
Protein interaction: same network, different hubs.
,
2003,
Trends in genetics : TIG.
[10]
A. Valencia,et al.
In silico two‐hybrid system for the selection of physically interacting protein pairs
,
2002,
Proteins.
[11]
Sophia Tsoka,et al.
Prediction of protein interactions: metabolic enzymes are frequently involved in gene fusion
,
2000,
Nature Genetics.
[12]
D. Eisenberg,et al.
Detecting protein function and protein-protein interactions from genome sequences.
,
1999,
Science.
[13]
Patrick Aloy,et al.
Interrogating protein interaction networks through structural biology
,
2002,
Proceedings of the National Academy of Sciences of the United States of America.