Sampling of Organic Solutes in Aqueous and Heterogeneous Environments Using Oscillating Excess Chemical Potentials in Grand Canonical-like Monte Carlo-Molecular Dynamics Simulations

Solute sampling of explicit bulk-phase aqueous environments in grand canonical (GC) ensemble simulations suffer from poor convergence due to low insertion probabilities of the solutes. To address this, we developed an iterative procedure involving Grand Canonical-like Monte Carlo (GCMC) and molecular dynamics (MD) simulations. Each iteration involves GCMC of both the solutes and water followed by MD, with the excess chemical potential (μex) of both the solute and the water oscillated to attain their target concentrations in the simulation system. By periodically varying the μex of the water and solutes over the GCMC-MD iterations, solute exchange probabilities and the spatial distributions of the solutes improved. The utility of the oscillating-μex GCMC-MD method is indicated by its ability to approximate the hydration free energy (HFE) of the individual solutes in aqueous solution as well as in dilute aqueous mixtures of multiple solutes. For seven organic solutes: benzene, propane, acetaldehyde, methanol, formamide, acetate, and methylammonium, the average μex of the solutes and the water converged close to their respective HFEs in both 1 M standard state and dilute aqueous mixture systems. The oscillating-μex GCMC methodology is also able to drive solute sampling in proteins in aqueous environments as shown using the occluded binding pocket of the T4 lysozyme L99A mutant as a model system. The approach was shown to satisfactorily reproduce the free energy of binding of benzene as well as sample the functional group requirements of the occluded pocket consistent with the crystal structures of known ligands bound to the L99A mutant as well as their relative binding affinities.

[1]  Carl Caleman,et al.  Atomistic simulation of ion solvation in water explains surface preference of halides , 2011, Proceedings of the National Academy of Sciences.

[2]  Alexander D. MacKerell,et al.  Reproducing Crystal Binding Modes of Ligand Functional Groups Using Site-Identification by Ligand Competitive Saturation (SILCS) Simulations , 2011, J. Chem. Inf. Model..

[3]  A. Pohorille,et al.  Excess chemical potential of small solutes across water--membrane and water--hexane interfaces. , 1996, The Journal of chemical physics.

[4]  Matthew Clark,et al.  Grand Canonical Free-Energy Calculations of Protein-Ligand Binding , 2009, J. Chem. Inf. Model..

[5]  L. Fagni,et al.  Mont Sainte‐Odile: a sanctuary for GPCRs , 2003 .

[6]  Berk Hess,et al.  LINCS: A linear constraint solver for molecular simulations , 1997, J. Comput. Chem..

[7]  Hoover,et al.  Canonical dynamics: Equilibrium phase-space distributions. , 1985, Physical review. A, General physics.

[8]  B. Matthews,et al.  Energetic origins of specificity of ligand binding in an interior nonpolar cavity of T4 lysozyme. , 1995, Biochemistry.

[9]  Edward D Harder,et al.  On the origin of the electrostatic potential difference at a liquid-vacuum interface. , 2008, The Journal of chemical physics.

[10]  Mihaly Mezei,et al.  Grand-canonical ensemble Monte Carlo study of dense liquid Lennard-Jones, soft spheres and water , 1987 .

[11]  David L Mobley,et al.  Small molecule hydration free energies in explicit solvent: An extensive test of fixed-charge atomistic simulations. , 2009, Journal of chemical theory and computation.

[12]  Mihaly Mezei,et al.  Grand Canonical Monte Carlo Simulation of Water Positions in Crystal Hydrates , 1994 .

[13]  A. Palmer,et al.  Probing molecular motion by NMR. , 1997, Current opinion in structural biology.

[14]  Benoît Roux,et al.  Hydration of Amino Acid Side Chains: Nonpolar and Electrostatic Contributions Calculated from Staged Molecular Dynamics Free Energy Simulations with Explicit Water Molecules , 2004 .

[15]  K. Umesono,et al.  The nuclear receptor superfamily: The second decade , 1995, Cell.

[16]  Alexander D. MacKerell,et al.  Automation of the CHARMM General Force Field (CGenFF) II: Assignment of Bonded Parameters and Partial Atomic Charges , 2012, J. Chem. Inf. Model..

[17]  Alexander D. MacKerell,et al.  Inclusion of Multiple Fragment Types in the Site Identification by Ligand Competitive Saturation (SILCS) Approach , 2013, J. Chem. Inf. Model..

[18]  Alexander D. MacKerell,et al.  Automation of the CHARMM General Force Field (CGenFF) I: Bond Perception and Atom Typing , 2012, J. Chem. Inf. Model..

[19]  M. Eisenstein,et al.  Computational mapping of anchoring spots on protein surfaces. , 2010, Journal of molecular biology.

[20]  D. Beveridge,et al.  Grand canonical Monte Carlo simulations on aqueous solutions of sodium chloride and sodium DNA: excess chemical potentials and sources of nonideality in electrolyte and polyelectrolyte solutions , 1991 .

[21]  S. Nosé A molecular dynamics method for simulations in the canonical ensemble , 1984 .

[22]  J. P. Valleau,et al.  Electrical double layers. I. Monte Carlo study of a uniformly charged surface , 1980 .

[23]  T. Darden,et al.  Particle mesh Ewald: An N⋅log(N) method for Ewald sums in large systems , 1993 .

[24]  Katrina W Lexa,et al.  Full protein flexibility is essential for proper hot-spot mapping. , 2011, Journal of the American Chemical Society.

[25]  Matthew P. Repasky,et al.  Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. , 2004, Journal of medicinal chemistry.

[26]  Bernard R. Brooks,et al.  New spherical‐cutoff methods for long‐range forces in macromolecular simulation , 1994, J. Comput. Chem..

[27]  D. Tronrud Conjugate-direction minimization: an improved method for the refinement of macromolecules. , 1992, Acta crystallographica. Section A, Foundations of crystallography.

[28]  K. Sharp Statistical Thermodynamics of Binding and Molecular Recognition Models , 2012 .

[29]  Mihaly Mezei,et al.  Automatic Control of Solvent Density in Grand Canonical Ensemble Monte Carlo Simulations. , 2006, Journal of chemical theory and computation.

[30]  Alexander D. MacKerell,et al.  Accurate Calculation of Hydration Free Energies using Pair-Specific Lennard-Jones Parameters in the CHARMM Drude Polarizable Force Field. , 2010, Journal of chemical theory and computation.

[31]  Alexander D. MacKerell,et al.  CHARMM general force field: A force field for drug‐like molecules compatible with the CHARMM all‐atom additive biological force fields , 2009, J. Comput. Chem..

[32]  M. Karplus,et al.  Functionality maps of binding sites: A multiple copy simultaneous search method , 1991, Proteins.

[33]  David L Mobley,et al.  Predicting ligand binding affinity with alchemical free energy methods in a polar model binding site. , 2009, Journal of molecular biology.

[34]  Mihaly Mezei,et al.  A cavity-biased (T, V, μ) Monte Carlo method for the computer simulation of fluids , 1980 .

[35]  M. Gilson,et al.  The statistical-thermodynamic basis for computation of binding affinities: a critical review. , 1997, Biophysical journal.

[36]  Jianpeng Ma,et al.  CHARMM: The biomolecular simulation program , 2009, J. Comput. Chem..

[37]  Alexander D. MacKerell,et al.  Impact of Ribosomal Modification on the Binding of the Antibiotic Telithromycin Using a Combined Grand Canonical Monte Carlo/Molecular Dynamics Simulation Approach , 2013, PLoS Comput. Biol..

[38]  B. Matthews,et al.  Response of a protein structure to cavity-creating mutations and its relation to the hydrophobic effect. , 1992, Science.

[39]  H. C. Andersen Molecular dynamics simulations at constant pressure and/or temperature , 1980 .

[40]  P. Charifson,et al.  Are free energy calculations useful in practice? A comparison with rapid scoring functions for the p38 MAP kinase protein system. , 2001, Journal of medicinal chemistry.

[41]  Alexander D. MacKerell,et al.  Computational Fragment-Based Binding Site Identification by Ligand Competitive Saturation , 2009, PLoS Comput. Biol..

[42]  K. Dill,et al.  Molecular driving forces : statistical thermodynamics in chemistry and biology , 2002 .

[43]  C. Brooks Computer simulation of liquids , 1989 .

[44]  B. Brooks,et al.  Constant pressure molecular dynamics simulation: The Langevin piston method , 1995 .

[45]  Matthew Clark,et al.  Grand Canonical Monte Carlo Simulation of Ligand-Protein Binding , 2006, J. Chem. Inf. Model..

[46]  Jaeeon Chang,et al.  The calculation of chemical potential of organic solutes in dense liquid phases by using expanded ensemble Monte Carlo simulations. , 2009, The Journal of chemical physics.

[47]  B. Matthews,et al.  Specificity of ligand binding in a buried nonpolar cavity of T4 lysozyme: linkage of dynamics and structural plasticity. , 1995, Biochemistry.

[48]  Alexander D. MacKerell,et al.  A polarizable model of water for molecular dynamics simulations of biomolecules , 2006 .

[49]  S. Nosé A molecular dynamics method for simulations in the canonical ensemble , 2002 .

[50]  G. Ciccotti,et al.  Numerical Integration of the Cartesian Equations of Motion of a System with Constraints: Molecular Dynamics of n-Alkanes , 1977 .

[51]  Alexander D. MacKerell,et al.  Rapid estimation of hydration thermodynamics of macromolecular regions. , 2013, The Journal of chemical physics.

[52]  V. Helms,et al.  Transient pockets on protein surfaces involved in protein-protein interaction. , 2007, Journal of medicinal chemistry.

[53]  Benoît Roux,et al.  Grand canonical Monte Carlo simulations of water in protein environments. , 2004, The Journal of chemical physics.

[54]  Peter A. Kollman,et al.  FREE ENERGY CALCULATIONS : APPLICATIONS TO CHEMICAL AND BIOCHEMICAL PHENOMENA , 1993 .

[55]  Alexander D. MacKerell,et al.  Balancing target flexibility and target denaturation in computational fragment‐based inhibitor discovery , 2012, J. Comput. Chem..

[56]  M. Levitt,et al.  Refinement of protein conformations using a macromolecular energy minimization procedure. , 1969, Journal of molecular biology.

[57]  Carsten Kutzner,et al.  GROMACS 4:  Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. , 2008, Journal of chemical theory and computation.

[58]  I. Bahar,et al.  Druggability Assessment of Allosteric Proteins by Dynamics Simulations in the Presence of Probe Molecules. , 2013, Journal of chemical theory and computation.

[59]  Benoît Roux,et al.  Calculation of Standard Binding Free Energies:  Aromatic Molecules in the T4 Lysozyme L99A Mutant. , 2006, Journal of chemical theory and computation.

[60]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[61]  M. Ruiz‐López,et al.  Molecular dynamics simulation of formamide in water using density functional theory and classical potentials , 1999 .

[62]  Mihaly Mezei,et al.  Simulated Annealing of Chemical Potential: A General Procedure for Locating Bound Waters. Application to the Study of the Differential Hydration Propensities of the Major and Minor Grooves of DNA , 1996 .

[63]  L. Kay,et al.  Slow internal dynamics in proteins: application of NMR relaxation dispersion spectroscopy to methyl groups in a cavity mutant of T4 lysozyme. , 2002, Journal of the American Chemical Society.

[64]  F. Guarnieri,et al.  Diverse fragment clustering and water exclusion identify protein hot spots. , 2011, Journal of the American Chemical Society.

[65]  W. L. Jorgensen,et al.  Monte Carlo simulation of differences in free energies of hydration , 1985 .