The SAMPL5 challenge for embedded-cluster integral equation theory: solvation free energies, aqueous pKa, and cyclohexane–water log D

We predict cyclohexane–water distribution coefficients (log D7.4) for drug-like molecules taken from the SAMPL5 blind prediction challenge by the “embedded cluster reference interaction site model” (EC-RISM) integral equation theory. This task involves the coupled problem of predicting both partition coefficients (log P) of neutral species between the solvents and aqueous acidity constants (pKa) in order to account for a change of protonation states. The first issue is addressed by calibrating an EC-RISM-based model for solvation free energies derived from the “Minnesota Solvation Database” (MNSOL) for both water and cyclohexane utilizing a correction based on the partial molar volume, yielding a root mean square error (RMSE) of 2.4 kcal mol−1 for water and 0.8–0.9 kcal mol−1 for cyclohexane depending on the parametrization. The second one is treated by employing on one hand an empirical pKa model (MoKa) and, on the other hand, an EC-RISM-derived regression of published acidity constants (RMSE of 1.5 for a single model covering acids and bases). In total, at most 8 adjustable parameters are necessary (2–3 for each solvent and two for the pKa) for training solvation and acidity models. Applying the final models to the log D7.4 dataset corresponds to evaluating an independent test set comprising other, composite observables, yielding, for different cyclohexane parametrizations, 2.0–2.1 for the RMSE with the first and 2.2–2.8 with the combined first and second SAMPL5 data set batches. Notably, a pure log P model (assuming neutral species only) performs statistically similarly for these particular compounds. The nature of the approximations and possible perspectives for future developments are discussed.

[1]  Jacopo Tomasi,et al.  Continuum solvation models: A new approach to the problem of solute’s charge distribution and cavity boundaries , 1997 .

[2]  L. E. Chirlian,et al.  Atomic charges derived from electrostatic potentials: A detailed study , 1987 .

[3]  W. Guida,et al.  Accurate Prediction of Acidity Constants in Aqueous Solution via Density Functional Theory and Self-Consistent Reaction Field Methods , 2002 .

[4]  C. Cramer,et al.  Adding explicit solvent molecules to continuum solvent calculations for the calculation of aqueous acid dissociation constants. , 2006, The journal of physical chemistry. A.

[5]  Loriano Storchi,et al.  New and Original pKa Prediction Method Using Grid Molecular Interaction Fields , 2007, J. Chem. Inf. Model..

[6]  David S. Palmer,et al.  Solvation thermodynamics of organic molecules by the molecular integral equation theory: approaching chemical accuracy. , 2015, Chemical reviews.

[7]  Jochen Heil,et al.  Quantum chemistry in solution by combining 3D integral equation theory with a cluster embedding approach. , 2008, The journal of physical chemistry. B.

[8]  Stefan M. Kast,et al.  Prediction of tautomer ratios by embedded-cluster integral equation theory , 2010, J. Comput. Aided Mol. Des..

[9]  D. Marx,et al.  Design principles for high-pressure force fields: Aqueous TMAO solutions from ambient to kilobar pressures. , 2016, The Journal of chemical physics.

[10]  Donald G Truhlar,et al.  Generalized Born Solvation Model SM12. , 2013, Journal of chemical theory and computation.

[11]  Donald G Truhlar,et al.  SM6:  A Density Functional Theory Continuum Solvation Model for Calculating Aqueous Solvation Free Energies of Neutrals, Ions, and Solute-Water Clusters. , 2005, Journal of chemical theory and computation.

[12]  Andreas Klamt,et al.  Towards a first principles prediction of pK a: COSMO-RS and the cluster-continuum approach , 2010 .

[13]  Andriy Kovalenko,et al.  SAMPL5: 3D-RISM partition coefficient calculations with partial molar volume corrections and solute conformational sampling , 2016, Journal of Computer-Aided Molecular Design.

[14]  Stefan M. Kast,et al.  Solvation effects on chemical shifts by embedded cluster integral equation theory. , 2014, The journal of physical chemistry. A.

[15]  Andreas Klamt,et al.  First Principles Calculations of Aqueous pKa Values for Organic and Inorganic Acids Using COSMO-RS Reveal an Inconsistency in the Slope of the pKa Scale. , 2003, The journal of physical chemistry. A.

[16]  Benoît Roux,et al.  An Integral Equation To Describe the Solvation of Polar Molecules in Liquid Water , 1997 .

[17]  Andrew T. Fenley,et al.  Analytical electrostatics for biomolecules: beyond the generalized Born approximation. , 2006, The Journal of chemical physics.

[18]  C. Cramer,et al.  Universal solvation model based on solute electron density and on a continuum model of the solvent defined by the bulk dielectric constant and atomic surface tensions. , 2009, The journal of physical chemistry. B.

[19]  Junmei Wang,et al.  Development and testing of a general amber force field , 2004, J. Comput. Chem..

[20]  S. Kast,et al.  Three-Dimensional RISM Integral Equation Theory for Polarizable Solute Models. , 2013, Journal of chemical theory and computation.

[21]  Fumio Hirata,et al.  Ab initio study of water. II. Liquid structure, electronic and thermodynamic properties over a wide range of temperature and density , 1999 .

[22]  S. Kast,et al.  Acidity in DMSO from the embedded cluster integral equation quantum solvation model , 2014, Journal of Molecular Modeling.

[23]  B. Montgomery Pettitt,et al.  A site-site theory for finite concentration saline solutions , 1992 .

[24]  F. Hirata,et al.  Three-dimensional density profiles of water in contact with a solute of arbitrary shape: a RISM approach , 1998 .

[25]  Hirofumi Sato A modern solvation theory: quantum chemistry and statistical chemistry. , 2013, Physical chemistry chemical physics : PCCP.

[26]  Jochen Heil,et al.  3D RISM theory with fast reciprocal-space electrostatics. , 2015, The Journal of chemical physics.

[27]  M. Levesque,et al.  Solvation free-energy pressure corrections in the three dimensional reference interaction site model. , 2015, The Journal of chemical physics.

[28]  S. Ten-no,et al.  Ab initio study of water: self-consistent determination of electronic structure and liquid state properties , 1997 .

[29]  H. Kalbitzer,et al.  The Chemical Shift Baseline for High-Pressure NMR Spectra of Proteins. , 2016, Angewandte Chemie.

[30]  B. Montgomery Pettitt,et al.  A dielectrically consistent interaction site theory for solvent—electrolyte mixtures , 1992 .

[31]  Andreas Klamt,et al.  Accurate prediction of basicity in aqueous solution with COSMO‐RS , 2006, J. Comput. Chem..

[32]  Wilfred F. van Gunsteren,et al.  An improved GROMOS96 force field for aliphatic hydrocarbons in the condensed phase , 2001, J. Comput. Chem..

[33]  David S. Palmer,et al.  Hydration Free Energies of Molecular Ions from Theory and Simulation. , 2016, The journal of physical chemistry. B.

[34]  S. Kast,et al.  Integral equation theory for correcting truncation errors in molecular simulations , 2003 .

[35]  T. Straatsma,et al.  THE MISSING TERM IN EFFECTIVE PAIR POTENTIALS , 1987 .

[36]  S. Kast,et al.  Structure and thermodynamics of nondipolar molecular liquids and solutions from integral equation theory , 2016 .

[37]  S. Kast,et al.  Closed-form expressions of the chemical potential for integral equation closures with certain bridge functions. , 2008, The Journal of chemical physics.

[38]  C. Cramer,et al.  Self-Consistent Reaction Field Model for Aqueous and Nonaqueous Solutions Based on Accurate Polarized Partial Charges. , 2007, Journal of chemical theory and computation.

[39]  Carsten Kutzner,et al.  GROMACS 4:  Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. , 2008, Journal of chemical theory and computation.