Homology Inference of Protein-Protein Interactions via Conserved Binding Sites

The coverage and reliability of protein-protein interactions determined by high-throughput experiments still needs to be improved, especially for higher organisms, therefore the question persists, how interactions can be verified and predicted by computational approaches using available data on protein structural complexes. Recently we developed an approach called IBIS (Inferred Biomolecular Interaction Server) to predict and annotate protein-protein binding sites and interaction partners, which is based on the assumption that the structural location and sequence patterns of protein-protein binding sites are conserved between close homologs. In this study first we confirmed high accuracy of our method and found that its accuracy depends critically on the usage of all available data on structures of homologous complexes, compared to the approaches where only a non-redundant set of complexes is employed. Second we showed that there exists a trade-off between specificity and sensitivity if we employ in the prediction only evolutionarily conserved binding site clusters or clusters supported by only one observation (singletons). Finally we addressed the question of identifying the biologically relevant interactions using the homology inference approach and demonstrated that a large majority of crystal packing interactions can be correctly identified and filtered by our algorithm. At the same time, about half of biological interfaces that are not present in the protein crystallographic asymmetric unit can be reconstructed by IBIS from homologous complexes without the prior knowledge of crystal parameters of the query protein.

[1]  K. Henrick,et al.  Inference of macromolecular assemblies from crystalline state. , 2007, Journal of molecular biology.

[2]  S. Ciurli,et al.  Subsite-specific structures and reactions in native and synthetic [4Fe-4S) cubane-type clusters , 2007 .

[3]  M. Vidal,et al.  Protein interaction mapping in C. elegans using proteins involved in vulval development. , 2000, Science.

[4]  Alexander Goncearenco,et al.  Thermophilic adaptation of protein complexes inferred from proteomic homology modeling. , 2010, Structure.

[5]  Huan-Xiang Zhou,et al.  Prediction of interface residues in protein–protein complexes by a consensus neural network method: Test against NMR data , 2005, Proteins.

[6]  John B. Anderson,et al.  MMDB: Entrez's 3D-structure database , 2002, Nucleic Acids Res..

[7]  M. Gerstein,et al.  Annotation transfer between genomes: protein-protein interologs and protein-DNA regulogs. , 2004, Genome research.

[8]  G. Roberts,et al.  Biological nitrogen fixation. , 1993, Annual review of nutrition.

[9]  M. Vidal,et al.  Identification of potential interaction networks using sequence-based searches for conserved protein-protein interactions or "interologs". , 2001, Genome research.

[10]  Raquel Norel,et al.  Protein interface conservation across structure space , 2010, Proceedings of the National Academy of Sciences.

[11]  Narmada Thanki,et al.  CDD: a Conserved Domain Database for the functional annotation of proteins , 2010, Nucleic Acids Res..

[12]  Benjamin A. Shoemaker,et al.  Evolution of protein binding modes in homooligomers. , 2010, Journal of molecular biology.

[13]  D. Rees,et al.  X-ray crystal structure of the nitrogenase molybdenum-iron protein from Clostridium pasteurianum at 3.0-A resolution. , 1993, Biochemistry.

[14]  S. Fairhurst,et al.  Exploring the reactivity of the isolated iron-molybdenum cofactor of nitrogenase , 1999 .

[15]  B. Burgess The Mechanism of Molybdenum Nitrogenase: An Overview , 2000 .

[16]  W. Bialek,et al.  Information-based clustering. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Vasant Honavar,et al.  HomPPI: a class of sequence homology based protein-protein interface prediction methods , 2011, BMC Bioinformatics.

[18]  Hongbo Zhu,et al.  NOXclass: prediction of protein-protein interaction types , 2006, BMC Bioinformatics.

[19]  A. Valencia,et al.  Prediction of protein--protein interaction sites in heterocomplexes with neural networks. , 2002, European journal of biochemistry.

[20]  Ruth Nussinov,et al.  Generation and analysis of a protein–protein interface data set with similar chemical and spatial patterns of interactions , 2005, Proteins.

[21]  Benjamin A. Shoemaker,et al.  Knowledge-based annotation of small molecule binding sites in proteins , 2010, BMC Bioinformatics.

[22]  Emily R Jefferson,et al.  Biological units and their effect upon the properties and prediction of protein-protein interactions. , 2006, Journal of molecular biology.

[23]  F. Pedrosa,et al.  Nitrogen Fixation: From Molecules to Crop Productivity , 2000, Current Plant Science and Biotechnology in Agriculture.

[24]  Song Liu,et al.  Protein binding site prediction using an empirical scoring function , 2006, Nucleic acids research.

[25]  J. Thornton,et al.  PQS: a protein quaternary structure file server. , 1998, Trends in biochemical sciences.

[26]  Ruth Nussinov,et al.  Protein-Protein Interfaces: Recognition of Similar Spatial and Chemical Organizations , 2004, WABI.

[27]  Stephen H. Bryant,et al.  CD-Search: protein domain annotations on the fly , 2004, Nucleic Acids Res..

[28]  S. Altschul,et al.  The estimation of statistical parameters for local alignment score distributions. , 2001, Nucleic acids research.

[29]  John B. Anderson,et al.  CDD: a Conserved Domain Database for protein classification , 2004, Nucleic Acids Res..

[30]  Benoit H. Dessailly,et al.  Detailed analysis of function divergence in a large and diverse domain superfamily: toward a refined protocol of function classification. , 2010, Structure.

[31]  Emmanuel D Levy,et al.  PiQSi: protein quaternary structure investigation. , 2007, Structure.

[32]  Yanli Wang,et al.  MMDB: Entrez's 3D-structure database , 2003, Nucleic Acids Res..

[33]  R. Russell,et al.  The relationship between sequence and interaction divergence in proteins. , 2003, Journal of molecular biology.

[34]  Benjamin A. Shoemaker,et al.  Inferred Biomolecular Interaction Server—a web server to analyze and predict protein interacting partners and binding sites , 2009, Nucleic Acids Res..

[35]  Robert Hoffmann,et al.  HomoMINT: an inferred human network based on orthology mapping of protein interactions discovered in model organisms , 2005, BMC Bioinformatics.

[36]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[37]  R. Raz,et al.  ProMate: a structure based prediction program to identify the location of protein-protein binding sites. , 2004, Journal of molecular biology.

[38]  Benjamin A. Shoemaker,et al.  Deciphering Protein–Protein Interactions. Part II. Computational Methods to Predict Protein and Domain Interaction Partners , 2007, PLoS Comput. Biol..

[39]  D. C. Rees,et al.  Crystallographic structure and functional implications of the nitrogenase molybdenum–iron protein from Azotobacter vinelandii , 1992, Nature.

[40]  A. Panchenko,et al.  Mechanisms of protein oligomerization, the critical role of insertions and deletions in maintaining different oligomeric states , 2010, Proceedings of the National Academy of Sciences.

[41]  J F Gibrat,et al.  Surprising similarities in structure comparison. , 1996, Current opinion in structural biology.

[42]  Benjamin A. Shoemaker,et al.  Finding biologically relevant protein domain interactions: Conserved binding mode analysis , 2006, Protein science : a publication of the Protein Society.

[43]  Burkhard Rost,et al.  Protein–Protein Interactions More Conserved within Species than across Species , 2006, PLoS Comput. Biol..

[44]  Jonathan Lim,et al.  Ulysses - an application for the projection of molecular interactions across species , 2005, Genome Biology.

[45]  Hyeong Jun An,et al.  Estimating the size of the human interactome , 2008, Proceedings of the National Academy of Sciences.

[46]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[47]  Zoran Obradovic,et al.  Statistical analysis of interface similarity in crystals of homologous proteins. , 2008, Journal of molecular biology.

[48]  T. Ideker,et al.  Comprehensive curation and analysis of global interaction networks in Saccharomyces cerevisiae , 2006, Journal of biology.

[49]  A. D. McLachlan,et al.  Profile analysis: detection of distantly related proteins. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[50]  I. Ispolatov,et al.  Binding properties and evolution of homodimers in protein–protein interaction networks , 2005, Nucleic acids research.