Identifying low variance pathways for free energy calculations of molecular transformations in solution phase.

Improving the efficiency of free energy calculations is important for many biological and materials design applications, such as protein-ligand binding affinities in drug design, partitioning between immiscible liquids, and determining molecular association in soft materials. We show that for any pair potential, moderately accurate estimation of the radial distribution function for a solute molecule is sufficient to accurately estimate the statistical variance of a sampling along a free energy pathway. This allows inexpensive analytical identification of low statistical error free energy pathways. We employ a variety of methods to estimate the radial distribution function (RDF) and find that the computationally cheap two-body "dilute gas" limit performs as well or better than 3D-RISM theory and other approximations for identifying low variance free energy pathways. With a RDF estimate in hand, we can search for pairwise interaction potentials that produce low variance. We give an example of a search minimizing statistical variance of solvation free energy over the entire parameter space of a generalized "soft core" potential. The free energy pathway arising from this optimization procedure has lower curvature in the variance and reduces the total variance by at least 50% compared to the traditional soft core solvation pathway. We also demonstrate that this optimized pathway allows free energies to be estimated with fewer intermediate states due to its low curvature. This free energy variance optimization technique is generalizable to solvation in any homogeneous fluid and for any type of pairwise potential and can be performed in minutes to hours, depending on the method used to estimate g(r).

[1]  T. Straatsma,et al.  Separation‐shifted scaling, a new scaling method for Lennard‐Jones interactions in thermodynamic integration , 1994 .

[2]  M. Rami Reddy,et al.  Free energy calculations in rational drug design , 2001 .

[3]  Michael R. Shirts,et al.  Comparison of efficiency and bias of free energies computed by exponential averaging, the Bennett acceptance ratio, and thermodynamic integration. , 2005, The Journal of chemical physics.

[4]  H. C. Andersen Rattle: A “velocity” version of the shake algorithm for molecular dynamics calculations , 1983 .

[5]  P. Kollman,et al.  Settle: An analytical version of the SHAKE and RATTLE algorithm for rigid water models , 1992 .

[6]  Ruhong Zhou,et al.  Hydrophobic aided replica exchange: an efficient algorithm for protein folding in explicit solvent. , 2006, The journal of physical chemistry. B.

[7]  B. Roux,et al.  Computations of standard binding free energies with molecular dynamics simulations. , 2009, The journal of physical chemistry. B.

[8]  Jed W. Pitera,et al.  A Comparison of Non-Bonded Scaling Approaches for Free Energy Calculations , 2002 .

[9]  Ulrich H E Hansmann,et al.  Efficient sampling of protein structures by model hopping. , 2005, Physical review letters.

[10]  A. Mark,et al.  Avoiding singularities and numerical instabilities in free energy calculations based on molecular simulations , 1994 .

[11]  David L Mobley,et al.  Small molecule hydration free energies in explicit solvent: An extensive test of fixed-charge atomistic simulations. , 2009, Journal of chemical theory and computation.

[12]  Wilfred F van Gunsteren,et al.  Multiple free energies from a single simulation: extending enveloping distribution sampling to nonoverlapping phase-space distributions. , 2008, The Journal of chemical physics.

[13]  B. Montgomery Pettitt,et al.  A site-site theory for finite concentration saline solutions , 1992 .

[14]  Charles H. Bennett,et al.  Efficient estimation of free energy differences from Monte Carlo data , 1976 .

[15]  Anthony Nicholls,et al.  SAMPL2 challenge: prediction of solvation energies and tautomer ratios , 2010, J. Comput. Aided Mol. Des..

[16]  Thomas Simonson,et al.  Free energy of particle insertion , 1993 .

[17]  Xiao-Li Meng,et al.  Simulating Normalizing Constants: From Importance Sampling to Bridge Sampling to Path Sampling , 1998 .

[18]  Fumio Hirata,et al.  Ab initio study of water. II. Liquid structure, electronic and thermodynamic properties over a wide range of temperature and density , 1999 .

[19]  Fernando A Escobedo,et al.  Variance minimization of free energy estimates from optimized expanded ensembles. , 2008, The journal of physical chemistry. B.

[20]  Michael P Eastwood,et al.  Minimizing thermodynamic length to select intermediate states for free-energy calculations and replica-exchange simulations. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[21]  A. Lyubartsev,et al.  New approach to Monte Carlo calculation of the free energy: Method of expanded ensembles , 1992 .

[22]  Mark E. Tuckerman,et al.  Measure-preserving integrators for molecular dynamics in the isothermal–isobaric ensemble derived from the Liouville operator , 2010 .

[23]  Ken A Dill,et al.  Use of the Weighted Histogram Analysis Method for the Analysis of Simulated and Parallel Tempering Simulations. , 2007, Journal of chemical theory and computation.

[24]  F. Hirata,et al.  Three-dimensional density profiles of water in contact with a solute of arbitrary shape: a RISM approach , 1998 .

[25]  S. Takada,et al.  On the Hamiltonian replica exchange method for efficient sampling of biomolecular systems: Application to protein structure prediction , 2002 .

[26]  Benoît Roux,et al.  Computation of binding free energy with molecular dynamics and grand canonical Monte Carlo simulations. , 2008, The Journal of chemical physics.

[27]  Kenneth M. Merz,et al.  Drug Design : Structure-and Ligand-Based Approaches , 2017 .

[28]  Wilfred F van Gunsteren,et al.  Enveloping distribution sampling: a method to calculate free energy differences from a single simulation. , 2007, The Journal of chemical physics.

[29]  B. Montgomery Pettitt,et al.  Integral equation predictions of liquid state structure for waterlike intermolecular potentials , 1982 .

[30]  Jürgen Bajorath,et al.  Integration of virtual and high-throughput screening , 2002, Nature Reviews Drug Discovery.

[31]  Michael R. Shirts,et al.  Statistically optimal analysis of samples from multiple equilibrium states. , 2008, The Journal of chemical physics.

[32]  Seokmin Shin,et al.  Replica-exchange method using the generalized effective potential. , 2003, Physical review letters.

[33]  Benoît Roux,et al.  An Integral Equation To Describe the Solvation of Polar Molecules in Liquid Water , 1997 .

[34]  Michael R. Shirts,et al.  Solvation free energies of amino acid side chain analogs for common molecular mechanics water models. , 2005, The Journal of chemical physics.

[35]  David L Mobley,et al.  Nonlinear scaling schemes for Lennard-Jones interactions in free energy calculations. , 2007, The Journal of chemical physics.

[36]  R. Swendsen,et al.  THE weighted histogram analysis method for free‐energy calculations on biomolecules. I. The method , 1992 .

[37]  Wilfred F. van Gunsteren,et al.  Basic ingredients of free energy calculations: A review , 2009, J. Comput. Chem..

[38]  G. Parisi,et al.  Simulated tempering: a new Monte Carlo scheme , 1992, hep-lat/9205018.

[39]  Carlos Simmerling,et al.  Three-dimensional molecular theory of solvation coupled with molecular dynamics in Amber. , 2010, Journal of chemical theory and computation.

[40]  J. Essex,et al.  Hit identification and binding mode predictions by rigorous free energy simulations. , 2008, Journal of medicinal chemistry.

[41]  Ruhong Zhou,et al.  Replica exchange with solute tempering: efficiency in large scale systems. , 2007, The journal of physical chemistry. B.

[42]  D. Henderson Exact and approximate values for the radial distribution function at low densities for the 6:12 potential , 1965 .

[43]  Michael R. Shirts,et al.  Extremely precise free energy calculations of amino acid side chain analogs: Comparison of common molecular mechanics force fields for proteins , 2003 .

[44]  B. Montgomery Pettitt,et al.  A dielectrically consistent interaction site theory for solvent—electrolyte mixtures , 1992 .

[45]  Wilfred F van Gunsteren,et al.  Simple, Efficient, and Reliable Computation of Multiple Free Energy Differences from a Single Simulation: A Reference Hamiltonian Parameter Update Scheme for Enveloping Distribution Sampling (EDS). , 2009, Journal of chemical theory and computation.

[46]  Arnaud Blondel,et al.  Ensemble variance in free energy calculations by thermodynamic integration: Theory, optimal “Alchemical” path, and practical solutions , 2004, J. Comput. Chem..

[47]  Wei Yang,et al.  Simulated scaling method for localized enhanced sampling and simultaneous "alchemical" free energy simulations: a general method for molecular mechanical, quantum mechanical, and quantum mechanical/molecular mechanical simulations. , 2007, The Journal of chemical physics.

[48]  David L Mobley,et al.  Predicting small-molecule solvation free energies: an informal blind test for computational chemistry. , 2008, Journal of medicinal chemistry.

[49]  D. Beveridge,et al.  Free energy via molecular simulation: applications to chemical and biomolecular systems. , 1989, Annual review of biophysics and biophysical chemistry.

[50]  M. Tuckerman,et al.  A Liouville-operator derived measure-preserving integrator for molecular dynamics simulations in the isothermal–isobaric ensemble , 2006 .

[51]  Y. Sugita,et al.  Multidimensional replica-exchange method for free-energy calculations , 2000, cond-mat/0009120.

[52]  Shankar Kumar,et al.  Multidimensional free‐energy calculations using the weighted histogram analysis method , 1995, J. Comput. Chem..

[53]  W. C. Swope,et al.  A computer simulation method for the calculation of equilibrium constants for the formation of physi , 1981 .