Comparing pairwise‐additive and many‐body generalized Born models for acid/base calculations and protein design

Generalized Born (GB) solvent models are common in acid/base calculations and protein design. With GB, the interaction between a pair of solute atoms depends on the shape of the protein/solvent boundary and, therefore, the positions of all solute atoms, so that GB is a many‐body potential. For compute‐intensive applications, the model is often simplified further, by introducing a mean, native‐like protein/solvent boundary, which removes the many‐body property. We investigate a method for both acid/base calculations and protein design that uses Monte Carlo simulations in which side chains can explore rotamers, bind/release protons, or mutate. The fluctuating protein/solvent dielectric boundary is treated in a way that is numerically exact (within the GB framework), in contrast to a mean boundary. Its originality is that it captures the many‐body character while retaining the residue‐pairwise complexity given by a fixed boundary. The method is implemented in the Proteus protein design software. It yields a slight but systematic improvement for acid/base constants in nine proteins and a significant improvement for the computational design of three PDZ domains. It eliminates a source of model uncertainty, which will facilitate the analysis of other model limitations. © 2017 Wiley Periodicals, Inc.

[1]  Savvas Polydorides,et al.  Monte carlo simulations of proteins at constant pH with generalized born solvent, flexible sidechains, and an effective dielectric boundary , 2013, J. Comput. Chem..

[2]  E. J. Arthur,et al.  Predicting extreme pKa shifts in staphylococcal nuclease mutants with constant pH molecular dynamics , 2011, Proteins.

[3]  Brian Kuhlman,et al.  Computer-based design of novel protein structures. , 2006, Annual review of biophysics and biomolecular structure.

[4]  Thomas Simonson,et al.  Reintroducing electrostatics into protein X-ray structure refinement: bulk solvent treated as a dielectric continuum. , 2003, Acta crystallographica. Section D, Biological crystallography.

[5]  Charles L. Brooks,et al.  CHARGE SCREENING AND THE DIELECTRIC CONSTANT OF PROTEINS : INSIGHTS FROM MOLECULAR DYNAMICS , 1996 .

[6]  Gregory D. Hawkins,et al.  Pairwise solute descreening of solute charges from a dielectric medium , 1995 .

[7]  Tim J. P. Hubbard,et al.  SCOP database in 2004: refinements integrate structure and sequence family data , 2004, Nucleic Acids Res..

[8]  Nathan A. Baker Biomolecular Applications of Poisson-Boltzmann Methods , 2005 .

[9]  D. Case,et al.  Constant pH molecular dynamics in generalized Born implicit solvent , 2004, J. Comput. Chem..

[10]  Shaw Pb Theory of the Poisson Green's function for discontinuous dielectric media with an application to protein biophysics. , 1985 .

[11]  J. Andrew McCammon,et al.  Computation of electrostatic forces on solvated molecules using the Poisson-Boltzmann equation , 1993 .

[12]  Thomas Simonson,et al.  A POISSON-BOLTZMANN STUDY OF CHARGE INSERTION IN AN ENZYME ACTIVE SITE : THE EFFECT OF DIELECTRIC RELAXATION , 1999 .

[13]  T. Simonson,et al.  Proton binding to proteins: a free-energy component analysis using a dielectric continuum model. , 2005, Biophysical journal.

[14]  E. Alexov,et al.  Combining conformational flexibility and continuum electrostatics for calculating pK(a)s in proteins. , 2002, Biophysical journal.

[15]  T. Simonson,et al.  Probing the stereospecificity of tyrosyl- and glutaminyl-tRNA synthetase with molecular dynamics. , 2017, Journal of molecular graphics & modelling.

[16]  A. Caflisch,et al.  Efficient electrostatic solvation model for protein‐fragment docking , 2001, Proteins.

[17]  C. Pace,et al.  Protein Ionizable Groups: pK Values and Their Contribution to Protein Stability and Solubility* , 2009, Journal of Biological Chemistry.

[18]  W. C. Still,et al.  Semianalytical treatment of solvation for molecular mechanics and dynamics , 1990 .

[19]  C. Brooks,et al.  Novel generalized Born methods , 2002 .

[20]  Thomas Simonson,et al.  Electrostatics and dynamics of proteins , 2003 .

[21]  D. Case,et al.  A novel view of pH titration in biomolecules. , 2001, Biochemistry.

[22]  Chris Sander,et al.  A Specificity Map for the PDZ Domain Family , 2008, PLoS biology.

[23]  Miguel Machuqueiro,et al.  Acidic range titration of HEWL using a constant‐pH molecular dynamics method , 2008, Proteins.

[24]  Cyrus Chothia,et al.  The SUPERFAMILY database in 2004: additions and improvements , 2004, Nucleic Acids Res..

[25]  Sarah L. Williams,et al.  Progress in the prediction of pKa values in proteins , 2011, Proteins.

[26]  Lawrence S. Kroll Mathematica--A System for Doing Mathematics by Computer. , 1989 .

[27]  W. Lim,et al.  Mechanism and role of PDZ domains in signaling complex assembly. , 2001, Journal of cell science.

[28]  Thomas Simonson,et al.  Computational sidechain placement and protein mutagenesis with implicit solvent models , 2007, Proteins.

[29]  P. Beroza,et al.  Application of a pairwise generalized Born model to proteins and nucleic acids: inclusion of salt effects , 1999 .

[30]  C. Chothia,et al.  Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure. , 2001, Journal of molecular biology.

[31]  Tony J. You,et al.  Conformation and hydrogen ion titration of proteins: a continuum electrostatic model with conformational flexibility. , 1995, Biophysical journal.

[32]  Costas D Maranas,et al.  Recent advances in computational protein design. , 2011, Current opinion in structural biology.

[33]  Berend Smit,et al.  Understanding Molecular Simulation , 2001 .

[34]  Thomas Simonson,et al.  Pairwise decomposition of an MMGBSA energy function for computational protein design , 2014, J. Comput. Chem..

[35]  M. Gunner,et al.  MCCE analysis of the pKas of introduced buried acids and bases in staphylococcal nuclease , 2011, Proteins.

[36]  Tjelvar S. G. Olsson,et al.  Extent of enthalpy–entropy compensation in protein–ligand interactions , 2011, Protein science : a publication of the Protein Society.

[37]  P. Kollman,et al.  Calculating structures and free energies of complex molecules: combining molecular mechanics and continuum models. , 2000, Accounts of chemical research.

[38]  Thomas Simonson Protein: ligand recognition: simple models for electrostatic effects. , 2013, Current pharmaceutical design.

[39]  R. Friesner,et al.  Generalized Born Model Based on a Surface Integral Formulation , 1998 .

[40]  Emil Alexov,et al.  The role of protonation states in ligand-receptor recognition and binding. , 2013, Current pharmaceutical design.

[41]  D. Baker,et al.  A large scale test of computational protein design: folding and stability of nine completely redesigned globular proteins. , 2003, Journal of molecular biology.

[42]  T. Simonson What Is the Dielectric Constant of a Protein When Its Backbone Is Fixed? , 2013, Journal of chemical theory and computation.

[43]  F M Richards,et al.  Areas, volumes, packing and protein structure. , 1977, Annual review of biophysics and bioengineering.

[44]  David A. Case,et al.  Including Side Chain Flexibility in Continuum Electrostatic Calculations of Protein Titration , 1996 .

[45]  Thomas Simonson,et al.  A residue-pairwise generalized born scheme suitable for protein design calculations. , 2005, The journal of physical chemistry. B.

[46]  A. Warshel,et al.  Free energy of charges in solvated proteins: microscopic calculations using a reversible charging process. , 1986, Biochemistry.

[47]  M. Karplus,et al.  A Comprehensive Analytical Treatment of Continuum Electrostatics , 1996 .

[48]  M. Karplus,et al.  Electrostatic contributions to molecular free energies in solution. , 1998, Advances in protein chemistry.

[49]  D. Baker,et al.  Native protein sequences are close to optimal for their structures. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[50]  D. Case,et al.  Generalized born models of macromolecular solvation effects. , 2000, Annual review of physical chemistry.

[51]  Jinrang Kim,et al.  Are acidic and basic groups in buried proteins predicted to be ionized? , 2005, Journal of molecular biology.

[52]  D. Case,et al.  Proton binding to proteins: pK(a) calculations with explicit and implicit solvent models. , 2004, Journal of the American Chemical Society.

[53]  Thomas Simonson,et al.  Testing the Coulomb/Accessible Surface Area solvent model for protein stability, ligand binding, and protein design , 2008, BMC Bioinformatics.

[54]  Junjun Mao,et al.  MCCE2: Improving protein pKa calculations with extensive side chain rotamer sampling , 2009, J. Comput. Chem..

[55]  J. Warwicker,et al.  Simplified methods for pKa and acid pH‐dependent stability estimation in proteins: Removing dielectric and counterion boundaries , 2008, Protein science : a publication of the Protein Society.

[56]  R. Zauhar,et al.  A new method for computing the macromolecular electric potential. , 1985, Journal of molecular biology.

[57]  Charles L Brooks,et al.  Toward the accurate first-principles prediction of ionization equilibria in proteins. , 2006, Biochemistry.

[58]  Yoshitsugu Shiro,et al.  1.25 A resolution crystal structures of human haemoglobin in the oxy, deoxy and carbonmonoxy forms. , 2006, Journal of molecular biology.

[59]  Savvas Polydorides,et al.  Computational protein design: The proteus software and selected applications , 2013, J. Comput. Chem..

[60]  A. Warshel,et al.  The effect of protein relaxation on charge-charge interactions and dielectric constants of proteins. , 1998, Biophysical journal.

[61]  Max F. Perutz,et al.  Mechanisms of Cooperativity and Allosteric Regulation in Proteins , 1990 .

[62]  S. Petersen,et al.  Simulation of protein conformational freedom as a function of pH: constant‐pH molecular dynamics using implicit titration , 1997, Proteins.

[63]  Jan H. Jensen,et al.  Very fast prediction and rationalization of pKa values for protein–ligand complexes , 2008, Proteins.

[64]  Valentin Köhler Protein design : methods and applications , 2014 .

[65]  ANATOLY M. RUVINSKY Role of binding entropy in the refinement of protein–ligand docking predictions: Analysis based on the use of 11 scoring functions , 2007, J. Comput. Chem..

[66]  Jeffrey J. Gray,et al.  Rapid calculation of protein pKa values using Rosetta. , 2012, Biophysical journal.

[67]  P E Wright,et al.  Electrostatic calculations of side-chain pK(a) values in myoglobin and comparison with NMR data for histidines. , 1993, Biochemistry.

[68]  M. Karplus,et al.  Hemoglobin Bohr effects: atomic origin of the histidine residue contributions. , 2013, Biochemistry.

[69]  Jan H. Jensen,et al.  PROPKA3: Consistent Treatment of Internal and Surface Residues in Empirical pKa Predictions. , 2011, Journal of chemical theory and computation.

[70]  C. Castañeda,et al.  Molecular determinants of the pKa values of Asp and Glu residues in staphylococcal nuclease , 2009, Proteins.

[71]  Jeffery G Saven,et al.  Computational protein design: engineering molecular diversity, nonnatural enzymes, nonbiological cofactor complexes, and membrane proteins. , 2011, Current opinion in chemical biology.

[72]  M. Sheng,et al.  PDZ Domains: Structural Modules for Protein Complex Assembly* , 2002, The Journal of Biological Chemistry.

[73]  Thomas Simonson,et al.  Computational protein design: Software implementation, parameter optimization, and performance of a simple model , 2008, J. Comput. Chem..

[74]  P. Harbury,et al.  Accurate, conformation-dependent predictions of solvent effects on protein ionization constants , 2007, Proceedings of the National Academy of Sciences.

[75]  M. Gilson,et al.  Prediction of pH-dependent properties of proteins. , 1994, Journal of molecular biology.

[76]  M. Schaefer,et al.  A precise analytical method for calculating the electrostatic energy of macromolecules in aqueous solution. , 1990, Journal of molecular biology.

[77]  M. Karplus,et al.  pKa's of ionizable groups in proteins: atomic detail from a continuum electrostatic model. , 1990, Biochemistry.

[78]  E. J. Fuentes,et al.  Computational Design of the Tiam1 PDZ Domain and Its Ligand Binding. , 2017, Journal of chemical theory and computation.

[79]  Christopher M. MacDermaid,et al.  Theoretical and computational protein design. , 2011, Annual review of physical chemistry.

[80]  Savvas Polydorides,et al.  Predicting the acid/base behavior of proteins: a constant-pH Monte Carlo approach with generalized born solvent. , 2010, The journal of physical chemistry. B.

[81]  Axel T. Brunger,et al.  X-PLOR Version 3.1: A System for X-ray Crystallography and NMR , 1992 .

[82]  Jana K. Shen,et al.  Predicting pKa values with continuous constant pH molecular dynamics. , 2009, Methods in enzymology.

[83]  J. Ponder,et al.  Force fields for protein simulations. , 2003, Advances in protein chemistry.

[84]  W. C. Still,et al.  The GB/SA Continuum Model for Solvation. A Fast Analytical Method for the Calculation of Approximate Born Radii , 1997 .

[85]  V. Simplaceanu,et al.  Assessment of roles of surface histidyl residues in the molecular basis of the Bohr effect and of beta 143 histidine in the binding of 2,3-bisphosphoglycerate in human normal adult hemoglobin. , 1999, Biochemistry.

[86]  D. Baker,et al.  Prediction and design of macromolecular structures and interactions , 2006, Philosophical Transactions of the Royal Society B: Biological Sciences.

[87]  Miranda Thomas,et al.  PDZ domains: the building blocks regulating tumorigenesis. , 2011, The Biochemical journal.

[88]  Emil Alexov,et al.  Protonation and pK changes in protein–ligand binding , 2013, Quarterly Reviews of Biophysics.

[89]  W. Im,et al.  Continuum solvation model: Computation of electrostatic forces from numerical solutions to the Poisson-Boltzmann equation , 1998 .

[90]  T. Simonson,et al.  Computational protein design with a generalized born solvent model: Application to asparaginyl‐tRNA synthetase , 2011, Proteins.

[91]  Lin Li,et al.  pKa predictions for proteins, RNAs, and DNAs with the Gaussian dielectric function using DelPhi pKa , 2015, Proteins.

[92]  A. R. Fresht Structure and Mechanism in Protein Science: A Guide to Enzyme Catalysis and Protein Folding , 1999 .

[93]  Charles L. Brooks,et al.  Performance comparison of generalized born and Poisson methods in the calculation of electrostatic solvation energies for protein structures , 2004, J. Comput. Chem..

[94]  T. Shepherd,et al.  Structural and thermodynamic analysis of PDZ-ligand interactions. , 2011, Methods in enzymology.

[95]  C. Brooks,et al.  Constant‐pH molecular dynamics using continuous titration coordinates , 2004, Proteins.

[96]  Gary D Bader,et al.  The multiple-specificity landscape of modular peptide recognition domains. , 2011 .

[97]  Cyrus Chothia,et al.  The SUPERFAMILY database in 2007: families and functions , 2006, Nucleic Acids Res..

[98]  J. Pleiss Protein design in metabolic engineering and synthetic biology. , 2011, Current opinion in biotechnology.

[99]  Bruce Tidor,et al.  Progress in computational protein design. , 2007, Current opinion in biotechnology.

[100]  Thomas Simonson,et al.  Comparing three stochastic search algorithms for computational protein design: Monte Carlo, replica exchange Monte Carlo, and a multistart, steepest‐descent heuristic , 2016, J. Comput. Chem..

[101]  Thomas Simonson,et al.  Protein:Ligand binding free energies: A stringent test for computational protein design , 2016, J. Comput. Chem..

[102]  David A. Case,et al.  Effective Born radii in the generalized Born approximation: The importance of being perfect , 2002, J. Comput. Chem..

[103]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules , 1995 .