Physicochemical and residue conservation calculations to improve the ranking of protein–protein docking solutions

Many protein–protein docking algorithms generate numerous possible complex structures with only a few of them resembling the native structure. The major challenge is choosing the near‐native structures from the generated set. Recently it has been observed that the density of conserved residue positions is higher at the interface regions of interacting protein surfaces, except for antibody–antigen complexes, where a very low number of conserved positions is observed at the interface regions. In the present study we have used this observation to identify putative interacting regions on the surface of interacting partners. We studied 59 protein complexes, used previously as a benchmark data set for docking investigations. We computed conservation indices of residue positions on the surfaces of interacting proteins using available homologous sequences and used this information to filter out from 56% to 86% of generated docked models, retaining near‐native structures for further evaluation. We used a reverse filter of conservation score to filter out the majority of nonnative antigen–antibody complex structures. For each docked model in the filtered subsets, we relaxed the conformation of the side chains by minimizing the energy with CHARMM, and then calculated the binding free energy using a generalized Born method and solvent‐accessible surface area calculations. Using the free energy along with conservation information and other descriptors used in the literature for ranking docking solutions, such as shape complementarity and pair potentials, we developed a global ranking procedure that significantly improves the docking results by giving top ranks to near‐native complex structures.

[1]  O. Lichtarge,et al.  Evolutionary predictions of binding surfaces and interactions. , 2002, Current opinion in structural biology.

[2]  S Vajda,et al.  Kinetics of desolvation-mediated protein-protein binding. , 2000, Biophysical journal.

[3]  Tal Pupko,et al.  Structural Genomics , 2005 .

[4]  Gerhard Wagner,et al.  TreeDock: a tool for protein docking based on minimizing van der Waals energies. , 2002, Journal of the American Chemical Society.

[5]  M J Sternberg,et al.  Predictive docking of protein-protein and protein-DNA complexes. , 1998, Current opinion in structural biology.

[6]  Gregory D. Hawkins,et al.  Parametrized Models of Aqueous Free Energies of Solvation Based on Pairwise Descreening of Solute Atomic Charges from a Dielectric Medium , 1996 .

[7]  Zhiping Weng,et al.  A protein–protein docking benchmark , 2003, Proteins.

[8]  M J Sternberg,et al.  Use of pair potentials across protein interfaces in screening predicted docked complexes , 1999, Proteins.

[9]  Z. Weng,et al.  ZDOCK: An initial‐stage protein‐docking algorithm , 2003, Proteins.

[10]  Daniel R. Caffrey,et al.  Are protein–protein interfaces more conserved in sequence than the rest of the protein surface? , 2004, Protein science : a publication of the Protein Society.

[11]  D. Ritchie,et al.  Protein docking using spherical polar Fourier correlations , 2000, Proteins.

[12]  Sandor Vajda,et al.  Computational mapping identifies the binding sites of organic solvents on proteins , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[13]  E. Katchalski‐Katzir,et al.  Molecular surface recognition: determination of geometric fit between proteins and their ligands by correlation techniques. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[14]  L. Krippahl,et al.  BiGGER: A new (soft) docking algorithm for predicting protein interactions , 2000, Proteins.

[15]  J. Janin,et al.  Protein-protein recognition. , 1995, Progress in biophysics and molecular biology.

[16]  Surjit B. Dixit,et al.  Free Energy Analysis of ProteinDNA Binding: The EcoRI EndonucleaseDNA Complex , 1999 .

[17]  Vasant Honavar,et al.  A two-stage classifier for identification of protein-protein interface residues , 2004, ISMB/ECCB.

[18]  A Tramontano,et al.  PUZZLE: a new method for automated protein docking based on surface shape complementarity. , 1994, Journal of molecular biology.

[19]  Jeffrey J. Gray,et al.  Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. , 2003, Journal of molecular biology.

[20]  Gideon Schreiber,et al.  A novel method for scoring of docked protein complexes using predicted protein-protein binding sites. , 2004, Protein engineering, design & selection : PEDS.

[21]  J. Thornton,et al.  Protein–protein interfaces: Analysis of amino acid conservation in homodimers , 2001, Proteins.

[22]  Hui Zhang,et al.  A new computational model for protein folding based on atomic solvation , 1995, Protein science : a publication of the Protein Society.

[23]  O. Lichtarge,et al.  A family of evolution-entropy hybrid methods for ranking protein residues by importance. , 2004, Journal of molecular biology.

[24]  Olivier Lichtarge,et al.  Accurate and scalable identification of functional sites by evolutionary tracing , 2004, Journal of Structural and Functional Genomics.

[25]  Todd J. A. Ewing,et al.  DOCK 4.0: Search strategies for automated molecular docking of flexible molecule databases , 2001, J. Comput. Aided Mol. Des..

[26]  Bhyravabhotla Jayaram,et al.  Solvation Free Energy of Biomacromolecules: Parameters for a Modified Generalized Born Model Consistent with the AMBER Force Field , 1998 .

[27]  Peter Willett,et al.  GAPDOCK: A genetic algorithm approach to protein docking in CAPRI round 1 , 2003, Proteins.

[28]  Jinn-Moon Yang,et al.  GEMDOCK: A generic evolutionary method for molecular docking , 2004, Proteins.

[29]  Paul D. Minton,et al.  Statistics: The Exploration and Analysis of Data , 2002, Technometrics.

[30]  S. Jones,et al.  Prediction of protein-protein interaction sites using patch analysis. , 1997, Journal of molecular biology.

[31]  A J Olson,et al.  Morphology of protein-protein interfaces. , 1998, Structure.

[32]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[33]  G. Gonnet,et al.  Exhaustive matching of the entire protein sequence database. , 1992, Science.

[34]  Sandor Vajda,et al.  Protein-protein association kinetics and protein docking. , 2002, Current Opinion in Structural Biology.

[35]  M. Sternberg,et al.  Modelling protein docking using shape complementarity, electrostatics and biochemical information. , 1997, Journal of molecular biology.

[36]  C. Dominguez,et al.  HADDOCK: a protein-protein docking approach based on biochemical or biophysical information. , 2003, Journal of the American Chemical Society.

[37]  Frederic M. Richards,et al.  Packing of α-helices: Geometrical constraints and contact areas☆ , 1978 .

[38]  Didier Rognan,et al.  ConsDock: A new program for the consensus analysis of protein–ligand interactions , 2002, Proteins.

[39]  I D Kuntz,et al.  Predicting the structure of protein complexes: a step in the right direction. , 1996, Chemistry & biology.

[40]  C. Chothia,et al.  The atomic structure of protein-protein recognition sites. , 1999, Journal of molecular biology.

[41]  M. Sternberg,et al.  Prediction of protein-protein interactions by docking methods. , 2002, Current opinion in structural biology.

[42]  S. Vajda,et al.  Scoring docked conformations generated by rigid‐body protein‐protein docking , 2000, Proteins.

[43]  Ruth Nussinov,et al.  Principles of docking: An overview of search algorithms and a guide to scoring functions , 2002, Proteins.

[44]  Sarah A. Teichmann,et al.  Principles of protein-protein interactions , 2002, ECCB.

[45]  Li Li,et al.  RDOCK: Refinement of rigid‐body protein docking predictions , 2003, Proteins.

[46]  L. T. Ten Eyck,et al.  Protein docking using continuum electrostatics and geometric fit. , 2001, Protein engineering.

[47]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998, J. Comput. Chem..

[48]  Miriam Eisenstein,et al.  Weighted geometric docking: Incorporating external information in the rotation‐translation scan , 2003, Proteins.

[49]  R. M. Burnett,et al.  DARWIN: A program for docking flexible molecules , 2000, Proteins.

[50]  Bhyravabhotla Jayaram,et al.  Free Energy Analysis of Protein-DNA Binding , 1999 .

[51]  T. Blundell,et al.  Definition of general topological equivalence in protein structures. A procedure involving comparison of properties and relationships through simulated annealing and dynamic programming. , 1990, Journal of molecular biology.

[52]  J. Janin,et al.  Protein docking algorithms: simulating molecular recognition , 1993 .