Computational sidechain placement and protein mutagenesis with implicit solvent models

Structure prediction and computational protein design should benefit from accurate solvent models. We have applied implicit solvent models to two problems that are central to this area. First, we performed sidechain placement for 29 proteins, using a solvent model that combines a screened Coulomb term with an Accessible Surface Area term (CASA model). With optimized parameters, the prediction quality is comparable with earlier work that omitted electrostatics and solvation altogether. Second, we computed the stability changes associated with point mutations involving ionized sidechains. For over 1000 mutations, including many fully or partly buried positions, we compared CASA and two generalized Born models (GB) with a more accurate model, which solves the Poisson equation of continuum electrostatics numerically. CASA predicts the correct sign and order of magnitude of the stability change for 81% of the mutations, compared to 97% with the best GB. We also considered 140 mutations for which experimental data are available. Comparing to experiment requires additional assumptions about the unfolded protein structure, protein relaxation in response to the mutations, and contributions from the hydrophobic effect. With a simple, commonly‐used unfolded state model, the mean unsigned error is 2.1 kcal/mol with both CASA and the best GB. Overall, the electrostatic model is not important for sidechain placement; CASA and GB are equivalent for surface mutations, while GB is far superior for fully or partly buried positions. Thus, for problems like protein design that involve all these aspects, the most recent GB models represent an important step forward. Along with the recent discovery of efficient, pairwise implementations of GB, this will open new possibilities for the computational engineering of proteins. Proteins 2007. © 2007 Wiley‐Liss, Inc.

[1]  Homme W Hellinga,et al.  An empirical model for electrostatic interactions in proteins incorporating multiple geometry‐dependent dielectric constants , 2003, Proteins.

[2]  R. Goldstein Efficient rotamer elimination applied to protein side-chains and related spin glasses. , 1994, Biophysical journal.

[3]  H. Scheraga,et al.  Accessible surface areas as a measure of the thermodynamic parameters of hydration of peptides. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[4]  S. Subbiah,et al.  Prediction of protein side-chain conformation by packing optimization. , 1991, Journal of molecular biology.

[5]  W. C. Still,et al.  Semianalytical treatment of solvation for molecular mechanics and dynamics , 1990 .

[6]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[7]  L. R. Scott,et al.  Electrostatics and diffusion of molecules in solution: simulations with the University of Houston Brownian dynamics program , 1995 .

[8]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[9]  W. V. van Gunsteren,et al.  An efficient mean solvation force model for use in molecular dynamics simulations of proteins in aqueous solution. , 1996, Journal of molecular biology.

[10]  Thomas Simonson,et al.  Free energy simulations come of age: protein-ligand recognition. , 2002, Accounts of chemical research.

[11]  D. Eisenberg,et al.  Atomic solvation parameters applied to molecular dynamics of proteins in solution , 1992, Protein science : a publication of the Protein Society.

[12]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules , 1995 .

[13]  Thomas Simonson,et al.  A residue-pairwise generalized born scheme suitable for protein design calculations. , 2005, The journal of physical chemistry. B.

[14]  T. Simonson,et al.  Proton binding to proteins: a free-energy component analysis using a dielectric continuum model. , 2005, Biophysical journal.

[15]  M. Karplus,et al.  pKa's of ionizable groups in proteins: atomic detail from a continuum electrostatic model. , 1990, Biochemistry.

[16]  M. Karplus,et al.  A Comprehensive Analytical Treatment of Continuum Electrostatics , 1996 .

[17]  P. Koehl,et al.  Application of a self-consistent mean field theory to predict protein side-chains conformation and estimate their conformational entropy. , 1994, Journal of molecular biology.

[18]  A. D. McLachlan,et al.  Solvation energy in protein folding and binding , 1986, Nature.

[19]  R. Levy,et al.  Enthalpy−Entropy and Cavity Decomposition of Alkane Hydration Free Energies: Numerical Results and Implications for Theories of Hydrophobic Solvation , 2000 .

[20]  J. Ponder,et al.  Tertiary templates for proteins. Use of packing criteria in the enumeration of allowed sequences for different structural classes. , 1987, Journal of molecular biology.

[21]  M Karplus,et al.  Binding free energies and free energy components from molecular dynamics and Poisson-Boltzmann calculations. Application to amino acid recognition by aspartyl-tRNA synthetase. , 2001, Journal of molecular biology.

[22]  J Andrew McCammon,et al.  Optimized Radii for Poisson-Boltzmann Calculations with the AMBER Force Field. , 2005, Journal of chemical theory and computation.

[23]  Cheng-Yan Kao,et al.  GEM: A Gaussian evolutionary method for predicting protein side‐chain conformations , 2002, Protein science : a publication of the Protein Society.

[24]  D. Case,et al.  Generalized born models of macromolecular solvation effects. , 2000, Annual review of physical chemistry.

[25]  Adrian A Canutescu,et al.  Access the most recent version at doi: 10.1110/ps.03154503 References , 2003 .

[26]  D. Case,et al.  Exploring protein native states and large‐scale conformational changes with a modified generalized born model , 2004, Proteins.

[27]  Gregory D. Hawkins,et al.  Parametrized Models of Aqueous Free Energies of Solvation Based on Pairwise Descreening of Solute Atomic Charges from a Dielectric Medium , 1996 .

[28]  C. Brooks,et al.  Comparative Study of the Folding Free Energy Landscape of a Three-Stranded β-Sheet Protein with Explicit and Implicit Solvent Models , 2000 .

[29]  Thomas Simonson,et al.  Solvation Free Energies Estimated from Macroscopic Continuum Theory: An Accuracy Assessment , 1994 .

[30]  B. Honig,et al.  Classical electrostatics in biology and chemistry. , 1995, Science.

[31]  António M. Baptista,et al.  Implicit solvation in the self-consistent mean field theory method: sidechain modelling and prediction of folding free energies of protein mutants , 2001, J. Comput. Aided Mol. Des..

[32]  C. Brooks,et al.  Novel generalized Born methods , 2002 .

[33]  R. Friesner,et al.  Generalized Born Model Based on a Surface Integral Formulation , 1998 .

[34]  Fabrice Leclerc,et al.  Effective atom volumes for implicit solvent models: comparison between Voronoi volumes and minimum fluctuation volumes , 2001, J. Comput. Chem..

[35]  Thomas Simonson,et al.  Electrostatics and dynamics of proteins , 2003 .

[36]  Patrice Koehl,et al.  Protein topology and stability define the space of allowed sequences , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[37]  S. L. Mayo,et al.  Enzyme-like proteins by computational design , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[38]  T. Simonson,et al.  Protein molecular dynamics with the generalized born/ACE solvent model , 2001, Proteins.

[39]  Thomas Simonson,et al.  Implicit solvent models: Combining an analytical formulation of continuum electrostatics with simple models of the hydrophobic effect , 1999, J. Comput. Chem..

[40]  N. Guex,et al.  SWISS‐MODEL and the Swiss‐Pdb Viewer: An environment for comparative protein modeling , 1997, Electrophoresis.

[41]  D. Case,et al.  Continuum solvent molecular dynamics study of flexibility in interleukin-8. , 2001, Journal of molecular graphics & modelling.

[42]  A. Sali,et al.  Comparative protein structure modeling of genes and genomes. , 2000, Annual review of biophysics and biomolecular structure.

[43]  S. L. Mayo,et al.  Protein design automation , 1996, Protein science : a publication of the Protein Society.

[44]  Roland L. Dunbrack,et al.  Backbone-dependent rotamer library for proteins. Application to side-chain prediction. , 1993, Journal of molecular biology.

[45]  Anthony K. Felts,et al.  Distinguishing native conformations of proteins from decoys with an effective free energy estimator based on the OPLS all‐atom force field and the surface generalized born solvent model , 2002, Proteins.

[46]  C. Brooks,et al.  Constant‐pH molecular dynamics using continuous titration coordinates , 2004, Proteins.

[47]  Nick V Grishin,et al.  Effective scoring function for protein sequence design , 2003, Proteins.

[48]  E. Lattman,et al.  High apparent dielectric constants in the interior of a protein reflect water penetration. , 2000, Biophysical journal.

[49]  W. C. Still,et al.  The GB/SA Continuum Model for Solvation. A Fast Analytical Method for the Calculation of Approximate Born Radii , 1997 .

[50]  Johan Desmet,et al.  The dead-end elimination theorem and its use in protein side-chain positioning , 1992, Nature.

[51]  Akinori Sarai,et al.  ProTherm and ProNIT: thermodynamic databases for proteins and protein–nucleic acid interactions , 2005, Nucleic Acids Res..

[52]  L L Looger,et al.  Generalized dead-end elimination algorithms make large-scale protein side-chain structure prediction tractable: implications for protein design and structural genomics. , 2001, Journal of molecular biology.

[53]  Navin Pokala,et al.  Energy functions for protein design I: Efficient and accurate continuum electrostatics and solvation , 2004, Protein science : a publication of the Protein Society.

[54]  B. Roux,et al.  Implicit solvent models. , 1999, Biophysical chemistry.

[55]  M. Karplus,et al.  Electrostatic contributions to molecular free energies in solution. , 1998, Advances in protein chemistry.

[56]  Nathan A. Baker,et al.  Poisson-Boltzmann Methods for Biomolecular Electrostatics , 2004, Numerical Computer Methods, Part D.

[57]  R. Lavery,et al.  A new approach to the rapid determination of protein side chain conformations. , 1991, Journal of biomolecular structure & dynamics.

[58]  Thomas Simonson,et al.  Reintroducing electrostatics into protein X-ray structure refinement: bulk solvent treated as a dielectric continuum. , 2003, Acta crystallographica. Section D, Biological crystallography.

[59]  N. Grishin,et al.  Side‐chain modeling with an optimized scoring function , 2002, Protein science : a publication of the Protein Society.

[60]  B. Dominy,et al.  Development of a generalized Born model parameterization for proteins and nucleic acids , 1999 .

[61]  Gregory D. Hawkins,et al.  Pairwise solute descreening of solute charges from a dielectric medium , 1995 .

[62]  Alfonso Jaramillo,et al.  Automatic Sequence Design of Major Histocompatibility Complex Class I Binding Peptides Impairing CD8+ T Cell Recognition* 210 , 2003, The Journal of Biological Chemistry.

[63]  R A Friesner,et al.  Prediction of loop geometries using a generalized born model of solvation effects , 1999, Proteins.

[64]  A. Roitberg,et al.  All-atom structure prediction and folding simulations of a stable protein. , 2002, Journal of the American Chemical Society.

[65]  D. Case,et al.  Modification of the Generalized Born Model Suitable for Macromolecules , 2000 .

[66]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[67]  S L Mayo,et al.  Pairwise calculation of protein solvent-accessible surface areas. , 1998, Folding & design.

[68]  Kenichiro Koga,et al.  The hydrophobic effect , 2003 .

[69]  D. Case,et al.  Proton binding to proteins: pK(a) calculations with explicit and implicit solvent models. , 2004, Journal of the American Chemical Society.

[70]  J. Warwicker,et al.  Calculation of the electric potential in the active site cleft due to alpha-helix dipoles. , 1982, Journal of molecular biology.

[71]  T. Simonson Free Energy Calculations: Approximate Methods for Biological Macromolecules , 2007 .

[72]  Charles L. Brooks,et al.  Performance comparison of generalized born and Poisson methods in the calculation of electrostatic solvation energies for protein structures , 2004, J. Comput. Chem..

[73]  Bruce Tidor,et al.  Electrostatic interactions in the GCN4 leucine zipper: Substantial contributions arise from intramolecular interactions enhanced on binding , 1999, Protein science : a publication of the Protein Society.

[74]  K. Sharp,et al.  Accurate Calculation of Hydration Free Energies Using Macroscopic Solvent Models , 1994 .

[75]  Luhua Lai,et al.  Estimating protein–ligand binding free energy: Atomic solvation parameters for partition coefficient and solvation free energy calculation , 2004, Proteins.

[76]  Peter Scheffel,et al.  Incorporating variable dielectric environments into the generalized Born model. , 2005, The Journal of chemical physics.

[77]  T. Simonson,et al.  Macromolecular electrostatics: continuum models and their growing pains. , 2001, Current opinion in structural biology.

[78]  D. Case,et al.  Insights into protein-protein binding by binding free energy calculation and free energy decomposition for the Ras-Raf and Ras-RalGDS complexes. , 2003, Journal of molecular biology.

[79]  C. Brooks,et al.  Recent advances in the development and application of implicit solvent models in biomolecule simulations. , 2004, Current opinion in structural biology.

[80]  R. Schulz,et al.  Protein Structure Prediction , 2020, Methods in Molecular Biology.

[81]  J. Apostolakis,et al.  Evaluation of a fast implicit solvent model for molecular dynamics simulations , 2002, Proteins.

[82]  C. Pace,et al.  A helix propensity scale based on experimental studies of peptides and proteins. , 1998, Biophysical journal.

[83]  C. Sander,et al.  Database algorithm for generating protein backbone and side-chain co-ordinates from a C alpha trace application to model building and detection of co-ordinate errors. , 1991, Journal of molecular biology.

[84]  J. Apostolakis,et al.  Exhaustive docking of molecular fragments with electrostatic solvation , 1999, Proteins.

[85]  Axel T. Brunger,et al.  X-PLOR Version 3.1: A System for X-ray Crystallography and NMR , 1992 .

[86]  Alfonso Jaramillo,et al.  Computational protein design is a challenge for implicit solvation models. , 2005, Biophysical journal.

[87]  Roland L. Dunbrack,et al.  Bayesian statistical analysis of protein side‐chain rotamer preferences , 1997, Protein science : a publication of the Protein Society.