Monte carlo simulations of proteins at constant pH with generalized born solvent, flexible sidechains, and an effective dielectric boundary

Titratable residues determine the acid/base behavior of proteins, strongly influencing their function; in addition, proton binding is a valuable reporter on electrostatic interactions. We describe a method for pKa calculations, using constant‐pH Monte Carlo (MC) simulations to explore the space of sidechain conformations and protonation states, with an efficient and accurate generalized Born model (GB) for the solvent effects. To overcome the many‐body dependency of the GB model, we use a “Native Environment” approximation, whose accuracy is shown to be good. It allows the precalculation and storage of interactions between all sidechain pairs, a strategy borrowed from computational protein design, which makes the MC simulations themselves very fast. The method is tested for 12 proteins and 167 titratable sidechains. It gives an rms error of 1.1 pH units, similar to the trivial “Null” model. The only adjustable parameter is the protein dielectric constant. The best accuracy is achieved for values between 4 and 8, a range that is physically plausible for a protein interior. For sidechains with large pKa shifts, ≥2, the rms error is 1.6, compared to 2.5 with the Null model and 1.5 with the empirical PROPKA method. © 2013 Wiley Periodicals, Inc.

[1]  Thomas Simonson,et al.  Molecular Dynamics Simulations Show That Conformational Selection Governs the Binding Preferences of Imatinib for Several Tyrosine Kinases* , 2010, The Journal of Biological Chemistry.

[2]  Piotr Cieplak,et al.  R.E.D. Server: a web service for deriving RESP and ESP charges and building force field libraries for new molecules and molecular fragments , 2011, Nucleic Acids Res..

[3]  B. García-Moreno E.,et al.  Salt effects on ionization equilibria of histidines in myoglobin. , 2000, Biophysical journal.

[4]  M. Karplus,et al.  pKa's of ionizable groups in proteins: atomic detail from a continuum electrostatic model. , 1990, Biochemistry.

[5]  Carlos Simmerling,et al.  Improved Generalized Born Solvent Model Parameters for Protein Simulations. , 2013, Journal of chemical theory and computation.

[6]  G Matthias Ullmann,et al.  pH-dependent pKa values in proteins--a theoretical analysis of protonation energies with practical consequences for enzymatic reactions. , 2010, The journal of physical chemistry. B.

[7]  Savvas Polydorides,et al.  Predicting the acid/base behavior of proteins: a constant-pH Monte Carlo approach with generalized born solvent. , 2010, The journal of physical chemistry. B.

[8]  Axel T. Brunger,et al.  X-PLOR Version 3.1: A System for X-ray Crystallography and NMR , 1992 .

[9]  Jana K. Shen,et al.  Predicting pKa values with continuous constant pH molecular dynamics. , 2009, Methods in enzymology.

[10]  Robert E. Blankenship,et al.  Protein electron transfer , 1996, FEBS Letters.

[11]  Boris Aguilar,et al.  Efficient Computation of the Total Solvation Energy of Small Molecules via the R6 Generalized Born Model. , 2012, Journal of chemical theory and computation.

[12]  W R Baker,et al.  Characterization of the pH titration shifts of ribonuclease A by one- and two-dimensional nuclear magnetic resonance spectroscopy. , 1996, Archives of biochemistry and biophysics.

[13]  V L Arcus,et al.  pKA values of carboxyl groups in the native and denatured states of barnase: the pKA values of the denatured state are on average 0.4 units lower than those of model compounds. , 1995, Biochemistry.

[14]  Lawrence S. Kroll Mathematica--A System for Doing Mathematics by Computer. , 1989 .

[15]  P. Beroza,et al.  Application of a pairwise generalized Born model to proteins and nucleic acids: inclusion of salt effects , 1999 .

[16]  Tony J. You,et al.  Conformation and hydrogen ion titration of proteins: a continuum electrostatic model with conformational flexibility. , 1995, Biophysical journal.

[17]  Berend Smit,et al.  Understanding Molecular Simulation , 2001 .

[18]  Kelly K. Lee,et al.  Electrostatic effects in highly charged proteins: salt sensitivity of pKa values of histidines in staphylococcal nuclease. , 2002, Biochemistry.

[19]  Thomas Simonson,et al.  DIELECTRIC CONSTANT OF CYTOCHROME C FROM SIMULATIONS IN A WATER DROPLET INCLUDING ALL ELECTROSTATIC INTERACTIONS , 1998 .

[20]  Thomas Simonson,et al.  Computational protein design: Software implementation, parameter optimization, and performance of a simple model , 2008, J. Comput. Chem..

[21]  P. Harbury,et al.  Accurate, conformation-dependent predictions of solvent effects on protein ionization constants , 2007, Proceedings of the National Academy of Sciences.

[22]  A. Fersht,et al.  Kinetic characterization of the recombinant ribonuclease from Bacillus amyloliquefaciens (barnase) and investigation of key residues in catalysis by site-directed mutagenesis. , 1989, Biochemistry.

[23]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules , 1995 .

[24]  Jan H. Jensen,et al.  Very fast prediction and rationalization of pKa values for protein–ligand complexes , 2008, Proteins.

[25]  Thomas Simonson,et al.  A residue-pairwise generalized born scheme suitable for protein design calculations. , 2005, The journal of physical chemistry. B.

[26]  W. C. Still,et al.  Semianalytical treatment of solvation for molecular mechanics and dynamics , 1990 .

[27]  C. Brooks,et al.  Novel generalized Born methods , 2002 .

[28]  Thomas Simonson,et al.  Testing the Coulomb/Accessible Surface Area solvent model for protein stability, ligand binding, and protein design , 2008, BMC Bioinformatics.

[29]  Thomas Simonson,et al.  Electrostatics and dynamics of proteins , 2003 .

[30]  M. Gilson,et al.  Prediction of pH-dependent properties of proteins. , 1994, Journal of molecular biology.

[31]  S. Linse,et al.  Measurement and modelling of sequence-specific pKa values of lysine residues in calbindin D9k. , 1996, Journal of molecular biology.

[32]  A. Warshel,et al.  Free energy of charges in solvated proteins: microscopic calculations using a reversible charging process. , 1986, Biochemistry.

[33]  M. Karplus,et al.  A Comprehensive Analytical Treatment of Continuum Electrostatics , 1996 .

[34]  M. Karplus,et al.  Electrostatic contributions to molecular free energies in solution. , 1998, Advances in protein chemistry.

[35]  C. Tanford,et al.  Theory of Protein Titration Curves. I. General Equations for Impenetrable Spheres , 1957 .

[36]  C Redfield,et al.  Measurement of the individual pKa values of acidic residues of hen and turkey lysozymes by two-dimensional 1H NMR. , 1994, Biophysical journal.

[37]  C. Brooks,et al.  Constant‐pH molecular dynamics using continuous titration coordinates , 2004, Proteins.

[38]  P E Wright,et al.  Electrostatic calculations of side-chain pK(a) values in myoglobin and comparison with NMR data for histidines. , 1993, Biochemistry.

[39]  Boris Aguilar,et al.  Reducing the Secondary Structure Bias in the Generalized Born Model via R6 Effective Radii , 2010 .

[40]  C. Pace,et al.  Protein Ionizable Groups: pK Values and Their Contribution to Protein Stability and Solubility* , 2009, Journal of Biological Chemistry.

[41]  W. Im,et al.  Continuum solvation model: Computation of electrostatic forces from numerical solutions to the Poisson-Boltzmann equation , 1998 .

[42]  S L Mayo,et al.  Pairwise calculation of protein solvent-accessible surface areas. , 1998, Folding & design.

[43]  Thomas Simonson,et al.  Computational sidechain placement and protein mutagenesis with implicit solvent models , 2007, Proteins.

[44]  M. Gilson,et al.  pKa measurements from nuclear magnetic resonance for the B1 and B2 immunoglobulin G-binding domains of protein G: comparison with calculated values for nuclear magnetic resonance and X-ray structures. , 1997, Biochemistry.

[45]  A. R. Fresht Structure and Mechanism in Protein Science: A Guide to Enzyme Catalysis and Protein Folding , 1999 .

[46]  R. Lavery,et al.  A new approach to the rapid determination of protein side chain conformations. , 1991, Journal of biomolecular structure & dynamics.

[47]  David A. Case,et al.  Including Side Chain Flexibility in Continuum Electrostatic Calculations of Protein Titration , 1996 .

[48]  Jan H. Jensen,et al.  PROPKA3: Consistent Treatment of Internal and Surface Residues in Empirical pKa Predictions. , 2011, Journal of chemical theory and computation.

[49]  J. Pitera,et al.  Dielectric properties of proteins from simulation: the effects of solvent, ligands, pH, and temperature. , 2001, Biophysical journal.

[50]  C. Castañeda,et al.  Molecular determinants of the pKa values of Asp and Glu residues in staphylococcal nuclease , 2009, Proteins.

[51]  Jeffery G Saven,et al.  Computational protein design: engineering molecular diversity, nonnatural enzymes, nonbiological cofactor complexes, and membrane proteins. , 2011, Current opinion in chemical biology.

[52]  Emilio Gallicchio,et al.  On the nonpolar hydration free energy of proteins: surface area and continuum solvent models for the solute-solvent interaction energy. , 2003, Journal of the American Chemical Society.

[53]  J Andrew McCammon,et al.  Optimized Radii for Poisson-Boltzmann Calculations with the AMBER Force Field. , 2005, Journal of chemical theory and computation.

[54]  D. Case,et al.  A novel view of pH titration in biomolecules. , 2001, Biochemistry.

[55]  Charles L. Brooks,et al.  CHARGE SCREENING AND THE DIELECTRIC CONSTANT OF PROTEINS : INSIGHTS FROM MOLECULAR DYNAMICS , 1996 .

[56]  H. Nakamura,et al.  Individual ionization constants of all the carboxyl groups in ribonuclease HI from Escherichia coli determined by NMR. , 1994, Biochemistry.

[57]  J. Ponder,et al.  Force fields for protein simulations. , 2003, Advances in protein chemistry.

[58]  S. Linse,et al.  Ionization Behavior of Acidic Residues in Calbindin D9k , 1999, Proteins.

[59]  D. Case,et al.  Modification of the Generalized Born Model Suitable for Macromolecules , 2000 .

[60]  Charles L. Brooks,et al.  Surveying implicit solvent models for estimating small molecule absolute hydration free energies , 2011, J. Comput. Chem..

[61]  S. L. Mayo,et al.  De novo protein design: fully automated sequence selection. , 1997, Science.

[62]  E. J. Arthur,et al.  Predicting extreme pKa shifts in staphylococcal nuclease mutants with constant pH molecular dynamics , 2011, Proteins.

[63]  Thomas Simonson,et al.  Reintroducing electrostatics into protein X-ray structure refinement: bulk solvent treated as a dielectric continuum. , 2003, Acta crystallographica. Section D, Biological crystallography.

[64]  Emil Alexov,et al.  Using DelPhi capabilities to mimic protein's conformational reorganization with amino acid specific dielectric constants. , 2013, Communications in computational physics.

[65]  Gregory D. Hawkins,et al.  Pairwise solute descreening of solute charges from a dielectric medium , 1995 .

[66]  S. Friend,et al.  Analysis of electrostatic interactions and their relationship to conformation and stability of bovine pancreatic trypsin inhibitor. , 1982, Biochemistry.

[67]  Homme W Hellinga,et al.  An empirical model for electrostatic interactions in proteins incorporating multiple geometry‐dependent dielectric constants , 2003, Proteins.

[68]  Gernot Kieseritzky,et al.  Optimizing pKA computation in proteins with pH adapted conformations , 2008, Proteins.

[69]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[70]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[71]  M K Gilson,et al.  Theoretical and experimental analysis of ionization equilibria in ovomucoid third domain. , 1998, Biochemistry.

[72]  David A. Case,et al.  Effective Born radii in the generalized Born approximation: The importance of being perfect , 2002, J. Comput. Chem..

[73]  Jinrang Kim,et al.  Are acidic and basic groups in buried proteins predicted to be ionized? , 2005, Journal of molecular biology.

[74]  D. Case,et al.  Proton binding to proteins: pK(a) calculations with explicit and implicit solvent models. , 2004, Journal of the American Chemical Society.

[75]  D. Baker,et al.  Prediction and design of macromolecular structures and interactions , 2006, Philosophical Transactions of the Royal Society B: Biological Sciences.

[76]  K Hamaguchi,et al.  Analysis of the acid-base titration curve of hen lysozyme. , 1980, Journal of biochemistry.

[77]  F M Richards,et al.  Areas, volumes, packing and protein structure. , 1977, Annual review of biophysics and bioengineering.

[78]  S. Withers,et al.  Dissection of nucleophilic and acid-base catalysis in glycosidases. , 2001, Current opinion in chemical biology.

[79]  C. Brooks,et al.  Recent advances in the development and application of implicit solvent models in biomolecule simulations. , 2004, Current opinion in structural biology.

[80]  A M Gronenborn,et al.  Ionization equilibria for side-chain carboxyl groups in oxidized and reduced human thioredoxin and in the complex with its target peptide from the transcription factor NF kappa B. , 1996, Biochemistry.

[81]  D. Pérahia,et al.  Internal and interfacial dielectric properties of cytochrome c from molecular dynamics in aqueous solution. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[82]  Jeffrey J. Gray,et al.  Rapid calculation of protein pKa values using Rosetta. , 2012, Biophysical journal.

[83]  S. Boxer,et al.  Effects of buried ionizable amino acids on the reduction potential of recombinant myoglobin. , 1989, Science.

[84]  Brian Kuhlman,et al.  Computer-based design of novel protein structures. , 2006, Annual review of biophysics and biomolecular structure.

[85]  E. Alexov,et al.  Combining conformational flexibility and continuum electrostatics for calculating pK(a)s in proteins. , 2002, Biophysical journal.

[86]  Junjun Mao,et al.  MCCE2: Improving protein pKa calculations with extensive side chain rotamer sampling , 2009, J. Comput. Chem..

[87]  Alan R. Fersht,et al.  Stabilization of protein structure by interaction of α-helix dipole with a charged side chain , 1988, Nature.

[88]  S. Kanaya,et al.  Role of histidine 124 in the catalytic function of ribonuclease HI from Escherichia coli. , 1993, The Journal of biological chemistry.

[89]  L R Brown,et al.  Determination of the dissociation constants of the lysine residues of lysozyme by proton-magnetic-resonance spectroscopy. , 1973, European journal of biochemistry.

[90]  D. Case,et al.  Constant pH molecular dynamics in generalized Born implicit solvent , 2004, J. Comput. Chem..

[91]  Charles L Brooks,et al.  Recent advances in implicit solvent-based methods for biomolecular simulations. , 2008, Current opinion in structural biology.

[92]  K Hamaguchi,et al.  Ionization of the catalytic groups and tyrosyl residues in human lysozyme. , 1980, Journal of biochemistry.

[93]  Sarah L. Williams,et al.  Progress in the prediction of pKa values in proteins , 2011, Proteins.

[94]  Miguel Machuqueiro,et al.  Acidic range titration of HEWL using a constant‐pH molecular dynamics method , 2008, Proteins.

[95]  Jianpeng Ma,et al.  CHARMM: The biomolecular simulation program , 2009, J. Comput. Chem..

[96]  Savvas Polydorides,et al.  Computational protein design: The proteus software and selected applications , 2013, J. Comput. Chem..

[97]  Stephen L Mayo,et al.  Simple electrostatic model improves designed protein sequences , 2006, Protein science : a publication of the Protein Society.

[98]  Charles L Brooks,et al.  Toward the accurate first-principles prediction of ionization equilibria in proteins. , 2006, Biochemistry.

[99]  A. Warshel,et al.  The effect of protein relaxation on charge-charge interactions and dielectric constants of proteins. , 1998, Biophysical journal.

[100]  S. Petersen,et al.  Simulation of protein conformational freedom as a function of pH: constant‐pH molecular dynamics using implicit titration , 1997, Proteins.