Multiscale Monte Carlo Sampling of Protein Sidechains: Application to Binding Pocket Flexibility.

We present a Monte Carlo sidechain sampling procedure and apply it to assessing the flexibility of protein binding pockets. We implemented a multiple "time step" Monte Carlo algorithm to optimize sidechain sampling with a surface generalized Born implicit solvent model. In this approach, certain forces (those due to long-range electrostatics and the implicit solvent model) are updated infrequently, in "outer steps", while short-range forces (covalent, local nonbonded interactions) are updated at every "inner step". Two multistep protocols were studied. The first protocol rigorously obeys detailed balance, and the second protocol introduces an approximation to the solvation term that increases the acceptance ratio. The first protocol gives a 10-fold improvement over a protocol that does not use multiple time steps, while the second protocol generates comparable ensembles and gives a 15-fold improvement. A range of 50-200 inner steps per outer step was found to give optimal performance for both protocols. The resultant method is a practical means to assess sidechain flexibility in ligand binding pockets, as we illustrate with proof-of-principle calculations on six proteins: DB3 antibody, thermolysin, estrogen receptor, PPAR-γ, PI3 kinase, and CDK2. The resulting sidechain ensembles of the apo binding sites correlate well with known induced fit conformational changes and provide insights into binding pocket flexibility.

[1]  R. Friesner,et al.  Evaluation and Reparametrization of the OPLS-AA Force Field for Proteins via Comparison with Accurate Quantum Chemical Calculations on Peptides† , 2001 .

[2]  Ronald M. Levy,et al.  The SGB/NP hydration free energy model based on the surface generalized born solvent reaction field and novel nonpolar hydration free energy estimators , 2002, J. Comput. Chem..

[3]  J. Ilja Siepmann,et al.  Monte Carlo algorithms for simulating systems with adiabatic separation of electronic and nuclear degrees of freedom , 1999 .

[4]  B. Roux,et al.  Implicit solvent models. , 1999, Biophysical chemistry.

[5]  S. Shapiro,et al.  An Analysis of Variance Test for Normality (Complete Samples) , 1965 .

[6]  L. Verlet Computer "Experiments" on Classical Fluids. II. Equilibrium Correlation Functions , 1968 .

[7]  W. L. Jorgensen,et al.  Polypeptide folding using Monte Carlo sampling, concerted rotation, and continuum solvation. , 2004, Journal of the American Chemical Society.

[8]  R. Friesner,et al.  Novel procedure for modeling ligand/receptor induced fit effects. , 2006, Journal of medicinal chemistry.

[9]  Daniel Herschlag,et al.  RNA simulations: probing hairpin unfolding and the dynamics of a GNRA tetraloop. , 2002, Journal of molecular biology.

[10]  Z. Xiang,et al.  On the role of the crystal environment in determining protein side-chain conformations. , 2002, Journal of molecular biology.

[11]  J Andrew McCammon,et al.  Configurational‐bias sampling technique for predicting side‐chain conformations in proteins , 2006, Protein science : a publication of the Protein Society.

[12]  I. Wilson,et al.  Structural analysis of antibody specificity. Detailed comparison of five Fab'-steroid complexes. , 1994, Journal of molecular biology.

[13]  Matthew P. Jacobson,et al.  Conformational Changes in Protein Loops and Helices Induced by Post-Translational Phosphorylation , 2006, PLoS Comput. Biol..

[14]  I. Wilson,et al.  Three-dimensional structure of an anti-steroid Fab' and progesterone-Fab' complex. , 1993, Journal of molecular biology.

[15]  R. Friesner,et al.  Generalized Born Model Based on a Surface Integral Formulation , 1998 .

[16]  Robbie Loewith,et al.  A Pharmacological Map of the PI3-K Family Defines a Role for p110α in Insulin Signaling , 2006, Cell.

[17]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[18]  Jens Meiler,et al.  ROSETTALIGAND: Protein–small molecule docking with full side‐chain flexibility , 2006, Proteins.

[19]  David L Mobley,et al.  The Confine-and-Release Method: Obtaining Correct Binding Free Energies in the Presence of Protein Conformational Change. , 2007, Journal of chemical theory and computation.

[20]  Aaron R. Dinner,et al.  Local deformations of polymers with nonplanar rigid main‐chain internal coordinates , 2000 .

[21]  Mark E. Tuckerman,et al.  Molecular dynamics algorithm for multiple time scales: Systems with disparate masses , 1991 .

[22]  A. Sali,et al.  Modeling of loops in protein structures , 2000, Protein science : a publication of the Protein Society.

[23]  D. Baker,et al.  Native protein sequences are close to optimal for their structures. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[24]  N. Go,et al.  Ring Closure and Local Conformational Deformations of Chain Molecules , 1970 .

[25]  Lev D. Gelb,et al.  Monte Carlo simulations using sampling from an approximate potential , 2003 .

[26]  Mark E. Tuckerman,et al.  Reversible multiple time scale molecular dynamics , 1992 .

[27]  A. Sali,et al.  Modeller: generation and refinement of homology-based protein structure models. , 2003, Methods in enzymology.

[28]  Lauren Wickstrom,et al.  Secondary structure bias in generalized Born solvent models: comparison of conformational ensembles and free energy of solvent polarization from explicit and implicit solvation. , 2007, The journal of physical chemistry. B.

[29]  Michael W. Deem,et al.  Efficient Monte Carlo methods for cyclic peptides , 1999 .

[30]  David A. Agard,et al.  The Structural Basis of Estrogen Receptor/Coactivator Recognition and the Antagonism of This Interaction by Tamoxifen , 1998, Cell.

[31]  Roland L. Dunbrack,et al.  Bayesian statistical analysis of protein side‐chain rotamer preferences , 1997, Protein science : a publication of the Protein Society.

[32]  David Baker,et al.  PROTEINS: Structure, Function, and Bioinformatics 58:893–904 (2005) A “Solvated Rotamer ” Approach to Modeling Water- Mediated Hydrogen Bonds at Protein–Protein Interfaces , 2022 .

[33]  L Meijer,et al.  Inhibition of cyclin-dependent kinases, GSK-3beta and CK1 by hymenialdisine, a marine sponge constituent. , 2000, Chemistry & biology.

[34]  A. W. Rosenbluth,et al.  MONTE CARLO CALCULATION OF THE AVERAGE EXTENSION OF MOLECULAR CHAINS , 1955 .

[35]  T. Willson,et al.  Ligand binding and co-activator assembly of the peroxisome proliferator-activated receptor-γ , 1998, Nature.

[36]  H. Scheraga,et al.  Monte Carlo-minimization approach to the multiple-minima problem in protein folding. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[37]  M Carlquist,et al.  Structural aspects of agonism and antagonism in the oestrogen receptor. , 2000, Biochemical Society transactions.

[38]  T. Schlick Molecular modeling and simulation , 2002 .

[39]  J M Blaney,et al.  A geometric approach to macromolecule-ligand interactions. , 1982, Journal of molecular biology.

[40]  Anthony K. Felts,et al.  Distinguishing native conformations of proteins from decoys with an effective free energy estimator based on the OPLS all‐atom force field and the surface generalized born solvent model , 2002, Proteins.

[41]  B. Honig,et al.  A hierarchical approach to all‐atom protein loop prediction , 2004, Proteins.

[42]  Richard A. Friesner,et al.  First-Shell Solvation of Ion Pairs: Correction of Systematic Errors in Implicit Solvent Models† , 2004 .

[43]  T. Senda,et al.  Crystal structure analyses of thermolysin in complex with its inhibitors , 2002 .

[44]  B. Berne,et al.  Multiple "time step" Monte Carlo , 2002 .

[45]  Julien Michel,et al.  Efficient Generalized Born Models for Monte Carlo Simulations. , 2006, Journal of chemical theory and computation.

[46]  John A Tainer,et al.  Crystal Structure and Mutational Analysis of the Human CDK2 Kinase Complex with Cell Cycle–Regulatory Protein CksHs1 , 1996, Cell.

[47]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[48]  W. C. Still,et al.  Semianalytical treatment of solvation for molecular mechanics and dynamics , 1990 .

[49]  Roland L. Dunbrack,et al.  Conformational analysis of the backbone-dependent rotamer preferences of protein sidechains , 1994, Nature Structural Biology.

[50]  B J Berne,et al.  Multiple "time step" Monte Carlo simulations: application to charged systems with Ewald summation. , 2004, The Journal of chemical physics.

[51]  Millard H. Lambert,et al.  Asymmetry in the PPARγ/RXRα Crystal Structure Reveals the Molecular Basis of Heterodimerization among Nuclear Receptors , 2000 .

[52]  Richard A. Friesner,et al.  What role do surfaces play in GB models? A new‐generation of surface‐generalized born model based on a novel gaussian surface for biomolecules , 2006, J. Comput. Chem..

[53]  M. Deem,et al.  Analytical rebridging Monte Carlo: Application to cis/trans isomerization in proline-containing, cyclic peptides , 1999, physics/9904057.

[54]  B. Shoichet,et al.  Soft docking and multiple receptor conformations in virtual screening. , 2004, Journal of medicinal chemistry.

[55]  David Chandler,et al.  Perturbation Theory of the Thermodynamic Properties of Simple Liquids , 1971 .

[56]  Chaok Seok,et al.  A kinematic view of loop closure , 2004, J. Comput. Chem..

[57]  D. Theodorou,et al.  A concerted rotation algorithm for atomistic Monte Carlo simulation of polymer melts and glasses , 1993 .

[58]  Christopher A. Voigt,et al.  Trading accuracy for speed: A quantitative comparison of search algorithms in protein sequence design. , 2000, Journal of molecular biology.

[59]  K. Dill,et al.  Predicting absolute ligand binding free energies to a simple model site. , 2007, Journal of molecular biology.

[60]  Berend Smit,et al.  Understanding molecular simulation: from algorithms to applications , 1996 .

[61]  Michael R. Shirts,et al.  Atomistic protein folding simulations on the submillisecond time scale using worldwide distributed computing. , 2003, Biopolymers.

[62]  Matthew P Jacobson,et al.  Conformational selection in silico: Loop latching motions and ligand binding in enzymes , 2008, Proteins.

[63]  Barry Honig,et al.  Extending the accuracy limits of prediction for side-chain conformations. , 2001 .