Prediction of protein‐protein interactions based on surface patch comparison

A method to predict if two proteins interact, based on their three‐dimensional structures, is presented. It consists of five steps: (i) the surface of each protein, represented by the solvent accessible atoms, is divided into small patches; (ii) the shape of each patch is described by the atom distributions along its principal axes; (iii) the shape complementarity between two patches is estimated by comparing, through contingency table analysis, their atom distributions along their principal axes; (iv) given protein A, with nA surface patches, and protein B, with nB surface patches, nA×nB shape complementarity values are obtained; and (v) the distribution of the latter allows one to discriminate pairs of interacting and of noninteracting proteins. Only a few seconds are necessary to predict if two proteins interact, with accuracy close to 80%, sensitivity over 70% and specificity close to 50%.

[1]  Gary D Bader,et al.  Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry , 2002, Nature.

[2]  David A. Gough,et al.  Predicting protein-protein interactions from primary structure , 2001, Bioinform..

[3]  Mark D'Souza,et al.  Use of contiguity on the chromosome to predict functional coupling , 1998, Silico Biol..

[4]  B. Schwikowski,et al.  A network of protein–protein interactions in yeast , 2000, Nature Biotechnology.

[5]  B. Snel,et al.  Comparative assessment of large-scale data sets of protein–protein interactions , 2002, Nature.

[6]  Paul Young,et al.  Molecular Basis for Cross-Linking of Actin Filaments: Structure of the α-Actinin Rod , 1999, Cell.

[7]  Arun K. Ramani,et al.  Exploiting the co-evolution of interacting proteins to discover interaction specificity. , 2003, Journal of molecular biology.

[8]  R. Overbeek,et al.  The use of gene clusters to infer functional coupling. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[9]  T. Gaasterland,et al.  Microbial genescapes: phyletic and functional patterns of ORF distribution among prokaryotes. , 1998, Microbial & comparative genomics.

[10]  Yoshihide Hayashizaki,et al.  Interaction generality, a measurement to assess the reliability of a protein-protein interaction. , 2002, Nucleic acids research.

[11]  C. Deane,et al.  Protein Interactions , 2002, Molecular & Cellular Proteomics.

[12]  B. Snel,et al.  Conservation of gene order: a fingerprint of proteins that physically interact. , 1998, Trends in biochemical sciences.

[13]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[14]  M. L. Connolly Solvent-accessible surfaces of proteins and nucleic acids. , 1983, Science.

[15]  James E. Bray,et al.  The CATH database: an extended protein family resource for structural and functional genomics , 2003, Nucleic Acids Res..

[16]  Paul Nurse,et al.  Systems biology: Understanding cells , 2003, Nature.

[17]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[18]  A Tramontano,et al.  PUZZLE: a new method for automated protein docking based on surface shape complementarity. , 1994, Journal of molecular biology.

[19]  Patrick Aloy,et al.  Interrogating protein interaction networks through structural biology , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Zhiping Weng,et al.  A protein–protein docking benchmark , 2003, Proteins.

[21]  J. G. Skellam,et al.  A New Method for determining the Type of Distribution of Plant Individuals , 1954 .

[22]  P. Legrain,et al.  Genome‐wide protein interaction maps using two‐hybrid systems , 2000, FEBS letters.

[23]  P E Bourne,et al.  The Protein Data Bank. , 2002, Nucleic acids research.

[24]  H. Herzel,et al.  Is there a bias in proteome research? , 2001, Genome research.

[25]  Anton J. Enright,et al.  Protein interaction maps for complete genomes based on gene fusion events , 1999, Nature.

[26]  Sophia Tsoka,et al.  Prediction of protein interactions: metabolic enzymes are frequently involved in gene fusion , 2000, Nature Genetics.

[27]  D. Eisenberg,et al.  Detecting protein function and protein-protein interactions from genome sequences. , 1999, Science.

[28]  M. Sternberg,et al.  An analysis of conformational changes on protein-protein association: implications for predictive docking. , 1999, Protein engineering.

[29]  R. Ozawa,et al.  A comprehensive two-hybrid analysis to explore the yeast protein interactome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Oliviero Carugo,et al.  CX, an algorithm that identifies protruding atoms in proteins , 2002, Bioinform..

[31]  A. Valencia,et al.  Similarity of phylogenetic trees as indicator of protein-protein interaction. , 2001, Protein engineering.

[32]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[33]  A. Valencia,et al.  In silico two‐hybrid system for the selection of physically interacting protein pairs , 2002, Proteins.

[34]  Patrick Aloy,et al.  The third dimension for protein interactions and complexes. , 2002, Trends in biochemical sciences.

[35]  T. Ito,et al.  Toward a protein-protein interaction map of the budding yeast: A comprehensive system to examine two-hybrid interactions in all possible combinations between the yeast proteins. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[36]  S. Fields,et al.  Networking proteins in yeast , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[37]  Robert B. Russell,et al.  InterPreTS: protein Interaction Prediction through Tertiary Structure , 2003, Bioinform..

[38]  Pierre Legrain,et al.  Prediction, assessment and validation of protein interaction maps in bacteria. , 2002, Journal of molecular biology.

[39]  R. Russell,et al.  Potential artefacts in protein‐interaction networks , 2002, FEBS letters.

[40]  Florian Iragne,et al.  IPPRED: Server for Proteins Interactions Inference , 2003, Bioinform..

[41]  Dong Xu,et al.  Computational analyses of high-throughput protein-protein interaction data. , 2003, Current protein & peptide science.

[42]  F. Cohen,et al.  Co-evolution of proteins with their interaction partners. , 2000, Journal of molecular biology.

[43]  J. Wojcik,et al.  The protein–protein interaction map of Helicobacter pylori , 2001, Nature.

[44]  M. Vidal,et al.  Structural genomics: A pipeline for providing structures for the biologist , 2002, Protein science : a publication of the Protein Society.

[45]  T. Cox,et al.  A conditioned distance ratio method for analyzing spatial patterns , 1976 .