A new scoring function for protein-protein docking that identifies native structures with unprecedented accuracy.

Protein-protein (P-P) 3D structures are fundamental to structural biology and drug discovery. However, most of them have never been determined. Many docking algorithms were developed for that purpose, but they have a very limited accuracy in generating native-like structures and identifying the most correct one, in particular when a single answer is asked for. With such a low success rate it is difficult to point out one docked structure as being native-like. Here we present a new, high accuracy, scoring method to identify the 3D structure of P-P complexes among a set of trial poses. It incorporates alanine scanning mutagenesis experimental data that need to be obtained a priori. The scoring scheme works by matching the computational and the experimental alanine scanning mutagenesis results. The size of the trial P-P interface area is also taken into account. We show that the method ranks the trial structures and identifies the native-like structures with unprecedented accuracy (∼94%), providing the correct P-P 3D structures that biochemists and molecular biologists need to pursue their studies. With such a success rate, the bottleneck of protein-protein docking moves from the scoring to searching algorithms.

[1]  Peter A. Kollman,et al.  Computational alanine scanning of the 1:1 human growth hormone–receptor complex , 2002, J. Comput. Chem..

[2]  K. Sharp,et al.  Accurate Calculation of Hydration Free Energies Using Macroscopic Solvent Models , 1994 .

[3]  G. Folkers,et al.  Proteins feel more than they see: fine-tuning of binding affinity by properties of the non-interacting surface. , 2014, Journal of molecular biology.

[4]  W. Marsden I and J , 2012 .

[5]  R. Poljak,et al.  Crystal structure of an Fv-Fv idiotope-anti-idiotope complex at 1.9 A resolution. , 1996, Journal of molecular biology.

[6]  Meena Kishore Sakharkar,et al.  Identification of hot spot residues at protein-protein interface , 2006, Bioinformation.

[7]  M. L. Connolly Analytical molecular surface calculation , 1983 .

[8]  B. Keyt,et al.  Identification of Vascular Endothelial Growth Factor Determinants for Binding KDR and FLT-1 Receptors , 1996, The Journal of Biological Chemistry.

[9]  Irina S. Moreira,et al.  CompASM: an Amber-VMD alanine scanning mutagenesis plug-in , 2012, Theoretical Chemistry Accounts.

[10]  Alfonso Hernández-Laguna,et al.  A comparative DFT study of the Schiff base formation from acetaldehyde and butylamine, glycine and phosphatidylethanolamine , 2012, Theoretical Chemistry Accounts.

[11]  Chi-Ren Shyu,et al.  Feature‐based classification of native and non‐native protein–protein interactions: Comparing supervised and semi‐supervised learning approaches , 2011, Proteomics.

[12]  Irina S. Moreira,et al.  Unravelling Hot Spots: a comprehensive computational mutagenesis study , 2006 .

[13]  Zhiping Weng,et al.  Accelerating Protein Docking in ZDOCK Using an Advanced 3D Convolution Library , 2011, PloS one.

[14]  David W. Ritchie,et al.  Ultra-fast FFT protein docking on graphics processors , 2010, Bioinform..

[15]  Sandor Vajda,et al.  ClusPro: an automated docking and discrimination method for the prediction of protein complexes , 2004, Bioinform..

[16]  Y. Wang,et al.  A mixed-charge pair in human interleukin 4 dominates high-affinity interaction with the receptor α chain , 1997 .

[17]  A. Bonvin,et al.  The HADDOCK web server for data-driven biomolecular docking , 2010, Nature Protocols.

[18]  Jean-Christophe Nebel,et al.  Scoring docking conformations using predicted protein interfaces , 2014, BMC Bioinformatics.

[19]  Andrey Tovchigrechko,et al.  GRAMM-X public web server for protein–protein docking , 2006, Nucleic Acids Res..

[20]  R Shapiro,et al.  Site-specific mutagenesis reveals differences in the structural bases for tight binding of RNase inhibitor to angiogenin and RNase A. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Pedro A Fernandes,et al.  Backbone Importance for Protein-Protein Binding. , 2007, Journal of chemical theory and computation.

[22]  Tammy M. K. Cheng,et al.  pyDock: Electrostatics and desolvation for effective scoring of rigid‐body protein–protein docking , 2007, Proteins.

[23]  Pedro Alexandrino Fernandes,et al.  Protein–protein docking dealing with the unknown , 2009, J. Comput. Chem..

[24]  G J Kleywegt,et al.  Crystal structure of the C2 fragment of streptococcal protein G in complex with the Fc domain of human IgG. , 1995, Structure.

[25]  Pedro A Fernandes,et al.  Hot spots—A review of the protein–protein interface determinant amino‐acid residues , 2007, Proteins.

[26]  J Deisenhofer,et al.  Mechanism of ribonuclease inhibition by ribonuclease inhibitor protein based on the crystal structure of its complex with ribonuclease A. , 1996, Journal of molecular biology.

[27]  Doheon Lee,et al.  A feature-based approach to modeling protein–protein interaction hot spots , 2009, Nucleic acids research.

[28]  Marc F Lensink,et al.  Docking, scoring, and affinity prediction in CAPRI , 2013, Proteins.

[29]  David Ryan Koes,et al.  Small-molecule inhibitor starting points learned from protein–protein interaction inhibitor structure , 2011, Bioinform..

[30]  Kevin Barraclough,et al.  I and i , 2001, BMJ : British Medical Journal.

[31]  Alexandre M J J Bonvin,et al.  Are scoring functions in protein-protein docking ready to predict interactomes? Clues from a novel binding affinity benchmark. , 2010, Journal of proteome research.

[32]  J. Deisenhofer,et al.  A structural basis of the interactions between leucine-rich repeats and protein ligands , 1995, Nature.

[33]  E. Goldman,et al.  Analysis of binding interactions in an idiotope-antiidiotope protein-protein complex by double mutant cycles. , 1997, Biochemistry.

[34]  C. Kleanthous,et al.  Specificity in protein-protein recognition: conserved Im9 residues are the major determinants of stability in the colicin E9 DNase-Im9 complex. , 1998, Biochemistry.

[35]  M. Champe,et al.  Solution structure of the VEGF-binding domain of Flt-1: comparison of its free and bound states. , 1999, Journal of molecular biology.

[36]  Ruth Nussinov,et al.  FireDock: Fast interaction refinement in molecular docking , 2007, Proteins.

[37]  E. Goldman,et al.  A mutational analysis of the binding of two different proteins to the same antibody. , 1996, Biochemistry.

[38]  T. Clackson,et al.  A hot spot of binding energy in a hormone-receptor interface , 1995, Science.

[39]  Michael W Parker,et al.  Model for growth hormone receptor activation based on subunit rotation within a receptor dimer , 2005, Nature Structural &Molecular Biology.

[40]  Irina S. Moreira,et al.  Protein–protein recognition: a computational mutagenesis study of the MDM2–P53 complex , 2008 .

[41]  Irina S. Moreira,et al.  Accuracy of the numerical solution of the Poisson–Boltzmann equation , 2005 .

[42]  Solène Grosdidier,et al.  Identification of hot-spot residues in protein-protein interactions by computational docking , 2008, BMC Bioinformatics.

[43]  Huan-Xiang Zhou,et al.  Prediction of interface residues in protein–protein complexes by a consensus neural network method: Test against NMR data , 2005, Proteins.

[44]  Marta A. S. Perez,et al.  Computational Alanine Scanning Mutagenesis: MM-PBSA vs TI. , 2013, Journal of chemical theory and computation.

[45]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[46]  P A Fernandes,et al.  Detailed microscopic study of the full zipA:FtsZ interface , 2006, Proteins.

[47]  G Schreiber,et al.  Interaction of barnase with its polypeptide inhibitor barstar studied by protein engineering. , 1993, Biochemistry.

[48]  R. Hartley,et al.  Refinement and structural analysis of barnase at 1.5 A resolution. , 1999, Acta crystallographica. Section D, Biological crystallography.

[49]  S. Wodak,et al.  Docking and scoring protein complexes: CAPRI 3rd Edition , 2007, Proteins.

[50]  David Ritchie,et al.  High-order analytic translation matrix elements for real-space six-dimensional polar Fourier correlations , 2005 .

[51]  R Shapiro,et al.  Analysis of the interactions of human ribonuclease inhibitor with angiogenin and ribonuclease A by mutagenesis: importance of inhibitor residues inside versus outside the C-terminal "hot spot". , 2000, Journal of molecular biology.

[52]  Mieczyslaw Torchala,et al.  SwarmDock: a server for flexible protein-protein docking , 2013, Bioinform..

[53]  Emil Alexov,et al.  Rapid grid‐based construction of the molecular surface and the use of induced surface charge to calculate reaction field energies: Applications to the molecular systems and geometric objects , 2002, J. Comput. Chem..

[54]  Yan Zhang,et al.  The bacterial cell‐division protein ZipA and its interaction with an FtsZ fragment revealed by X‐ray crystallography , 2001, The EMBO journal.

[55]  G. Winter,et al.  The contribution of contact and non-contact residues of antibody in the affinity of binding to antigen. The interaction of mutant D1.3 antibodies with lysozyme. , 1993, Journal of molecular biology.

[56]  Jinyan Li,et al.  Accurate prediction of hot spot residues through physicochemical characteristics of amino acid sequences , 2013, Proteins.

[57]  Ozlem Keskin,et al.  HotSprint: database of computational hot spots in protein interfaces , 2007, Nucleic Acids Res..

[58]  Arvind Rajpal,et al.  Quantitative evaluation of the chicken lysozyme epitope in the HyHEL‐10 fab complex: Free energies and kinetics , 1998, Protein science : a publication of the Protein Society.

[59]  Seren Soner,et al.  Hot Spots in a Network of Functional Sites , 2013, PloS one.

[60]  D. Ritchie,et al.  Evaluation of Protein Docking Predictions Using Hex 3.1 in CAPRI Rounds 1{2 , 2002 .

[61]  A. Gustchina,et al.  Crystal structure of human recombinant interleukin‐4 at 2.25 Å resolution , 1992, FEBS letters.

[62]  C. Dominguez,et al.  HADDOCK: a protein-protein docking approach based on biochemical or biophysical information. , 2003, Journal of the American Chemical Society.

[63]  Bin Xu,et al.  A semi-supervised boosting SVM for predicting hot spots at protein-protein Interfaces , 2012, BMC Systems Biology.

[64]  J. Wells,et al.  A systematic mutational analysis of hormone-binding determinants in the human growth hormone receptor. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[65]  K. Ramnarayan,et al.  Glu-96 of Basic Fibroblast Growth Factor Is Essential for High Affinity Receptor Binding , 1995, The Journal of Biological Chemistry.

[66]  T. Bhat,et al.  Bound water molecules and conformational stabilization help mediate an antigen-antibody association. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[67]  L Wang,et al.  Molecular dynamics and free-energy calculations applied to affinity maturation in antibody 48G7. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[68]  Xiang-Sun Zhang,et al.  Prediction of hot spots in protein interfaces using a random forest model with hybrid features. , 2012, Protein engineering, design & selection : PEDS.

[69]  Z. Weng,et al.  ZDOCK: An initial‐stage protein‐docking algorithm , 2003, Proteins.

[70]  Charles Eigenbrot,et al.  Crystal Structure at 1.7 Å Resolution of VEGF in Complex with Domain 2 of the Flt-1 Receptor , 1997, Cell.

[71]  Arno G. Stefani,et al.  Application of information theory to feature selection in protein docking , 2012, Journal of Molecular Modeling.

[72]  J. Martins,et al.  Solvent‐accessible surface area: How well can be applied to hot‐spot detection? , 2014, Proteins.

[73]  Lazaros Mavridis,et al.  HexServer: an FFT-based protein docking server powered by graphics processors , 2010, Nucleic Acids Res..

[74]  Salam A. Assi,et al.  Presaging Critical Residues in Protein interfaces-Web Server (PCRPi-W): A Web Server to Chart Hot Spots in Protein Interfaces , 2010, PloS one.

[75]  G. Schneider,et al.  Context-based identification of protein-protein interfaces and "hot-spot" residues. , 2011, Chemistry & biology.

[76]  Pedro Alexandrino Fernandes,et al.  Are hot-spots occluded from water? , 2014, Journal of biomolecular structure & dynamics.

[77]  Carla Mattos,et al.  Multiple solvent crystal structures of ribonuclease A: An assessment of the method , 2009, Proteins.

[78]  Ozlem Keskin,et al.  HotPoint: hot spot prediction server for protein interfaces , 2010, Nucleic Acids Res..

[79]  R. Huber,et al.  Structural analysis of human IgG-Fc glycoforms reveals a correlation between glycosylation and structural integrity. , 2003, Journal of molecular biology.

[80]  J. Janin Assessing predictions of protein–protein interaction: The CAPRI experiment , 2005, Protein science : a publication of the Protein Society.

[81]  Z. Weng,et al.  Integrating statistical pair potentials into protein complex prediction , 2007, Proteins.

[82]  Alexandre M J J Bonvin,et al.  Data‐driven docking for the study of biomolecular complexes , 2005, The FEBS journal.

[83]  David W Ritchie,et al.  Recent progress and future directions in protein-protein docking. , 2008, Current protein & peptide science.

[84]  Richard T. Bradshaw,et al.  Comparing experimental and computational alanine scanning techniques for probing a prototypical protein-protein interaction. , 2011, Protein engineering, design & selection : PEDS.

[85]  B. Keyt,et al.  The crystal structure of vascular endothelial growth factor (VEGF) refined to 1.93 A resolution: multiple copy flexibility and receptor binding. , 1997, Structure.

[86]  Marc F Lensink,et al.  Docking and scoring protein interactions: CAPRI 2009 , 2010, Proteins.

[87]  Julie C. Mitchell,et al.  KFC2: A knowledge‐based hot spot prediction method based on interface solvation, atomic density, and plasticity features , 2011, Proteins.

[88]  Stephen R. Comeau,et al.  PIPER: An FFT‐based protein docking program with pairwise potentials , 2006, Proteins.

[89]  G Schreiber,et al.  Energetics of protein-protein interactions: analysis of the barnase-barstar interface by single mutations and double mutant cycles. , 1995, Journal of molecular biology.

[90]  Brian W. Matthews,et al.  Refinement of the structure of human basic fibroblast growth factor at 1.6 Å resolution and analysis of presumed heparin binding sites by selenate substitution , 1993, Protein science : a publication of the Protein Society.

[91]  P. Reinemer,et al.  Crystal Structure of the Interleukin-4/Receptor α Chain Complex Reveals a Mosaic Binding Interface , 1999, Cell.

[92]  P. Kollman,et al.  Calculating structures and free energies of complex molecules: combining molecular mechanics and continuum models. , 2000, Accounts of chemical research.

[93]  A. Pommer,et al.  Specificity in protein-protein interactions: the structural basis for dual recognition in endonuclease colicin-immunity protein complexes. , 2000, Journal of molecular biology.

[94]  Xing-Ming Zhao,et al.  APIS: accurate prediction of hot spots in protein interfaces by combining protrusion index with solvent accessibility , 2010, BMC Bioinformatics.

[95]  David W Ritchie,et al.  Docking essential dynamics eigenstructures , 2005, Proteins.

[96]  Pedro A Fernandes,et al.  Hot spot occlusion from bulk water: a comprehensive study of the complex between the lysozyme HEL and the antibody FVD1.3. , 2007, The journal of physical chemistry. B.

[97]  Graham J. L. Kemp,et al.  Fast computation, rotation, and comparison of low resolution spherical harmonic molecular surfaces , 1999, J. Comput. Chem..

[98]  Pedro Alexandrino Fernandes,et al.  Computational alanine scanning mutagenesis—An improved methodological approach , 2007, J. Comput. Chem..

[99]  J. Schlessinger,et al.  Crystal structure of a ternary FGF-FGFR-heparin complex reveals a dual role for heparin in FGFR binding and dimerization. , 2000, Molecular cell.

[100]  K Watanabe,et al.  Role of Tyr Residues in the Contact Region of Anti-lysozyme Monoclonal Antibody HyHEL10 for Antigen Binding (*) , 1995, The Journal of Biological Chemistry.

[101]  P. Bates,et al.  SwarmDock and the Use of Normal Modes in Protein-Protein Docking , 2010, International journal of molecular sciences.

[102]  Shuigeng Zhou,et al.  Boosting Prediction Performance of Protein-Protein Interaction Hot Spots by Using Structural Neighborhood Properties - (Extended Abstract) , 2013, RECOMB.

[103]  A. Fersht,et al.  Protein-protein recognition: crystal structural analysis of a barnase-barstar complex at 2.0-A resolution. , 1994, Biochemistry.

[104]  Stephen R Comeau,et al.  Achieving reliability and high accuracy in automated protein docking: Cluspro, PIPER, SDU, and stability analysis in CAPRI rounds 13–19 , 2010, Proteins.

[105]  Jinyan Li,et al.  Integrating water exclusion theory into βcontacts to predict binding free energy changes and binding hot spots , 2013, BMC Bioinformatics.

[106]  Ruth Nussinov,et al.  FireDock: a web server for fast interaction refinement in molecular docking† , 2008, Nucleic Acids Res..

[107]  Kouhei Tsumoto,et al.  Critical contribution of VH–VL interaction to reshaping of an antibody: The case of humanization of anti‐lysozyme antibody, HyHEL‐10 , 2008, Protein science : a publication of the Protein Society.

[108]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[109]  T. Clackson,et al.  Structural and functional analysis of the 1:1 growth hormone:receptor complex reveals the molecular basis for receptor affinity. , 1998, Journal of molecular biology.

[110]  R. Stephenson A and V , 1962, The British journal of ophthalmology.

[111]  Ilya A Vakser,et al.  Development and testing of an automated approach to protein docking , 2005, Proteins.

[112]  Pinak Chakrabarti,et al.  PRICE (PRotein Interface Conservation and Energetics): a server for the analysis of protein–protein interfaces , 2011, Journal of Structural and Functional Genomics.

[113]  I. Moreira,et al.  Unraveling the importance of protein-protein interaction: application of a computational alanine-scanning mutagenesis to the study of the IgG1 streptococcal protein G (C2 fragment) complex. , 2006, The journal of physical chemistry. B.

[114]  J. Wells,et al.  Comparison of a structural and a functional epitope. , 1993, Journal of molecular biology.

[115]  David W. Ritchie,et al.  Accelerating and focusing protein-protein docking correlations using multi-dimensional rotational FFT generating functions , 2008, Bioinform..

[116]  H W Hellinga,et al.  Dissection of the protein G B1 domain binding site for human IgG Fc fragment , 1999, Protein science : a publication of the Protein Society.

[117]  R. Varadarajan,et al.  Discrepancies between the NMR and X-ray structures of uncomplexed barstar: analysis suggests that packing densities of protein structures determined by NMR are unreliable. , 1998, Biochemistry.

[118]  Ruth Nussinov,et al.  Taking geometry to its edge: Fast unbound rigid (and hinge‐bent) docking , 2003, Proteins.

[119]  Xiaofan Li,et al.  Detection and refinement of encounter complexes for protein–protein docking: Taking account of macromolecular crowding , 2010, Proteins.

[120]  P A Fernandes,et al.  Understanding the importance of the aromatic amino-acid residues as hot-spots. , 2013, Biochimica et biophysica acta.

[121]  M. Pantoliano,et al.  Identification and concerted function of two receptor binding surfaces on basic fibroblast growth factor required for mitogenesis. , 1994, The Journal of biological chemistry.

[122]  Zbigniew Dauter,et al.  Triclinic lysozyme at 0.65 A resolution. , 2007, Acta crystallographica. Section D, Biological crystallography.

[123]  Alexandre M J J Bonvin,et al.  HADDOCK versus HADDOCK: New features and performance of HADDOCK2.0 on the CAPRI targets , 2007, Proteins.

[124]  Burkhard Rost,et al.  Protein–Protein Interaction Hotspots Carved into Sequences , 2007, PLoS Comput. Biol..

[125]  Irina S. Moreira,et al.  Hot spot computational identification: Application to the complex formed between the hen egg white lysozyme (HEL) and the antibody HyHEL-10† , 2007 .

[126]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[127]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[128]  Barry Honig,et al.  Extending the Applicability of the Nonlinear Poisson−Boltzmann Equation: Multiple Dielectric Constants and Multivalent Ions† , 2001 .