Co-Occurring Atomic Contacts for the Characterization of Protein Binding Hot Spots

A binding hot spot is a small area at a protein-protein interface that can make significant contribution to binding free energy. This work investigates the substantial contribution made by some special co-occurring atomic contacts at a binding hot spot. A co-occurring atomic contact is a pair of atomic contacts that are close to each other with no more than three covalent-bond steps. We found that two kinds of co-occurring atomic contacts can play an important part in the accurate prediction of binding hot spot residues. One is the co-occurrence of two nearby hydrogen bonds. For example, mutations of any residue in a hydrogen bond network consisting of multiple co-occurring hydrogen bonds could disrupt the interaction considerably. The other kind of co-occurring atomic contact is the co-occurrence of a hydrophobic carbon contact and a contact between a hydrophobic carbon atom and a π ring. In fact, this co-occurrence signifies the collective effect of hydrophobic contacts. We also found that the B-factor measurements of several specific groups of amino acids are useful for the prediction of hot spots. Taking the B-factor, individual atomic contacts and the co-occurring contacts as features, we developed a new prediction method and thoroughly assessed its performance via cross-validation and independent dataset test. The results show that our method achieves higher prediction performance than well-known methods such as Robetta, FoldX and Hotpoint. We conclude that these contact descriptors, in particular the novel co-occurring atomic contacts, can be used to facilitate accurate and interpretable characterization of protein binding hot spots.

[1]  Burkhard Rost,et al.  Protein–Protein Interaction Hotspots Carved into Sequences , 2007, PLoS Comput. Biol..

[2]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[3]  Jinyan Li,et al.  Binding Affinity Prediction for Protein-Ligand Complexes Based on β Contacts and B Factor , 2013, J. Chem. Inf. Model..

[4]  D. Baker,et al.  A simple physical model for binding energy hot spots in protein–protein complexes , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Marianne Rooman,et al.  BeAtMuSiC: prediction of changes in protein–protein binding affinity on mutations , 2013, Nucleic Acids Res..

[6]  Jinyan Li,et al.  Accurate prediction of hot spot residues through physicochemical characteristics of amino acid sequences , 2013, Proteins.

[7]  Kurt S. Thorn,et al.  ASEdb: a database of alanine mutations and their effects on the free energy of binding in protein interactions , 2001, Bioinform..

[8]  Julie C. Mitchell,et al.  KFC Server: interactive forecasting of protein interaction hot spots , 2008, Nucleic Acids Res..

[9]  Peter A. Kollman,et al.  Computational alanine scanning of the 1:1 human growth hormone–receptor complex , 2002, J. Comput. Chem..

[10]  B. L. de Groot,et al.  Predicting free energy changes using structural ensembles. , 2009, Nature methods.

[11]  Zheng Yuan,et al.  Prediction of protein B‐factor profiles , 2005, Proteins.

[12]  R. Nussinov,et al.  Hot regions in protein--protein interactions: the organization and contribution of structurally conserved hot spot residues. , 2005, Journal of molecular biology.

[13]  Doheon Lee,et al.  A feature-based approach to modeling protein–protein interaction hot spots , 2009, Nucleic acids research.

[14]  Walter Birchmeier,et al.  Hot spots in β-catenin for interactions with LEF-1, conductin and APC , 2000, Nature Structural Biology.

[15]  S. Parthasarathy,et al.  Analysis of temperature factor distribution in high‐resolution protein structures , 1997, Protein science : a publication of the Protein Society.

[16]  Salam A. Assi,et al.  PCRPi: Presaging Critical Residues in Protein interfaces, a new computational tool to chart hot spots in protein interfaces , 2009, Nucleic acids research.

[17]  J. Wells,et al.  Systematic mutational analyses of protein-protein interfaces. , 1991, Methods in enzymology.

[18]  Xiang-Sun Zhang,et al.  Prediction of hot spots in protein interfaces using a random forest model with hybrid features. , 2012, Protein engineering, design & selection : PEDS.

[19]  Ozlem Keskin,et al.  HotSprint: database of computational hot spots in protein interfaces , 2007, Nucleic Acids Res..

[20]  Xing-Ming Zhao,et al.  APIS: accurate prediction of hot spots in protein interfaces by combining protrusion index with solvent accessibility , 2010, BMC Bioinformatics.

[21]  Solène Grosdidier,et al.  Expanding the frontiers of protein–protein modeling: From docking and scoring to binding affinity predictions and other challenges , 2013, Proteins.

[22]  Jinyan Li,et al.  Integrating water exclusion theory into βcontacts to predict binding free energy changes and binding hot spots , 2014, BMC Bioinformatics.

[23]  Ozlem Keskin,et al.  Identification of computational hot spots in protein interfaces: combining solvent accessibility and inter-residue potentials improves the accuracy , 2009, Bioinform..

[24]  Julie C. Mitchell,et al.  Community‐wide evaluation of methods for predicting the effect of mutations on protein–protein interactions , 2013, Proteins.

[25]  G. Folkers,et al.  Proteins feel more than they see: fine-tuning of binding affinity by properties of the non-interacting surface. , 2014, Journal of molecular biology.

[26]  H. Wolfson,et al.  Protein-Protein Interactions: Coupling of Structurally Conserved Residues and of Hot Spots across Interfaces. Implications for Docking , 2004 .

[27]  Juan Fernández-Recio,et al.  SKEMPI: a Structural Kinetic and Energetic database of Mutant Protein Interactions and its use in empirical models , 2012, Bioinform..

[28]  A. Bogan,et al.  Anatomy of hot spots in protein interfaces. , 1998, Journal of molecular biology.

[29]  Julie C. Mitchell,et al.  Data‐driven models for protein interaction and design , 2013, Proteins.

[30]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[31]  Steven C. H. Hoi,et al.  Beta Atomic Contacts: Identifying Critical Specific Contacts in Protein Binding Interfaces , 2013, PloS one.

[32]  D. Bailey,et al.  The Binding Interface Database (BID): A Compilation of Amino Acid Hot Spots in Protein Interfaces , 2003, Bioinform..

[33]  T. Clackson,et al.  A hot spot of binding energy in a hormone-receptor interface , 1995, Science.

[34]  Zheng Yuan,et al.  Flexibility analysis of enzyme active sites by crystallographic temperature factors. , 2003, Protein engineering.

[35]  Shuigeng Zhou,et al.  Boosting Prediction Performance of Protein-Protein Interaction Hot Spots by Using Structural Neighborhood Properties - (Extended Abstract) , 2013, RECOMB.

[36]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[37]  Ramanathan Sowdhamini,et al.  ECMIS: computational approach for the identification of hotspots at protein-protein interfaces , 2014, BMC Bioinformatics.

[38]  David E. Kim,et al.  Computational Alanine Scanning of Protein-Protein Interfaces , 2004, Science's STKE.

[39]  François Stricher,et al.  The FoldX web server: an online force field , 2005, Nucleic Acids Res..

[40]  Ozlem Keskin,et al.  HotPoint: hot spot prediction server for protein interfaces , 2010, Nucleic Acids Res..