Fast and General Method To Predict the Physicochemical Properties of Druglike Molecules Using the Integral Equation Theory of Molecular Liquids.

We report a method to predict physicochemical properties of druglike molecules using a classical statistical mechanics based solvent model combined with machine learning. The RISM-MOL-INF method introduced here provides an accurate technique to characterize solvation and desolvation processes based on solute-solvent correlation functions computed by the 1D reference interaction site model of the integral equation theory of molecular liquids. These functions can be obtained in a matter of minutes for most small organic and druglike molecules using existing software (RISM-MOL) (Sergiievskyi, V. P.; Hackbusch, W.; Fedorov, M. V. J. Comput. Chem. 2011, 32, 1982-1992). Predictions of caco-2 cell permeability and hydration free energy obtained using the RISM-MOL-INF method are shown to be more accurate than the state-of-the-art tools for benchmark data sets. Due to the importance of solvation and desolvation effects in biological systems, it is anticipated that the RISM-MOL-INF approach will find many applications in biophysical and biomedical property prediction.

[1]  Fumio Hirata,et al.  Hydration free energy of hydrophobic solutes studied by a reference interaction site model with a repulsive bridge correction and a thermodynamic perturbation method , 2000 .

[2]  Maxim V. Fedorov,et al.  Wavelet algorithm for solving integral equations of molecular liquids. A test for the reference interaction site model , 2004, J. Comput. Chem..

[3]  Jason Crain,et al.  Improved estimates for hydration free energy obtained by the reference interaction site model , 2007 .

[4]  Yuko Okamoto,et al.  Calculation of solvation free energy using RISM theory for peptide in salt solution , 1998, J. Comput. Chem..

[5]  W. Shiu,et al.  Determination of air-water Henry's law constants for hydrophobic pollutants , 1979 .

[6]  Miguel Jorge,et al.  1-Octanol/Water Partition Coefficients of n-Alkanes from Molecular Simulations of Absolute Solvation Free Energies. , 2009, Journal of chemical theory and computation.

[7]  Andriy Kovalenko,et al.  Calculation of local water densities in biological systems: a comparison of molecular dynamics simulations and the 3D-RISM-KH molecular theory of solvation. , 2011, The journal of physical chemistry. B.

[8]  Mark S. Gordon,et al.  General atomic and molecular electronic structure system , 1993, J. Comput. Chem..

[9]  Donald G Truhlar,et al.  Performance of SM6, SM8, and SMD on the SAMPL1 test set for the prediction of small-molecule solvation free energies. , 2009, The journal of physical chemistry. B.

[10]  David A. Freedman,et al.  Statistical Models: Theory and Practice: References , 2005 .

[11]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[12]  Yuko Okamoto,et al.  Calculation of hydration free energy for a solute with many atomic sites using the RISM theory: A robust and efficient algorithm , 1997 .

[13]  G. K. Dedzo,et al.  Molecule–Surface Recognition between Heterocyclic Aromatic Compounds and Kaolinite in Toluene Investigated by Molecular Theory of Solvation and Thermodynamic and Kinetic Experiments , 2014 .

[14]  Y. Abramov Major Source of Error in QSPR Prediction of Intrinsic Thermodynamic Solubility of Drugs: Solid vs Nonsolid State Contributions? , 2015, Molecular pharmaceutics.

[15]  G. Chuev,et al.  Extraction of atom–atom bridge and direct correlation functions from molecular simulations: A test for ambient water , 2013 .

[16]  Maxim V Fedorov,et al.  An accurate prediction of hydration free energies by combination of molecular integral equations theory with structural descriptors. , 2010, The journal of physical chemistry. B.

[17]  N. Matubayasi,et al.  An approach to the solvation free energy in terms of the distribution functions of the solute–solvent interaction energy , 2005 .

[18]  David Chandler,et al.  Optimized Cluster Expansions for Classical Fluids. II. Theory of Molecular Liquids , 1972 .

[19]  T. Straatsma,et al.  THE MISSING TERM IN EFFECTIVE PAIR POTENTIALS , 1987 .

[20]  Rodrigo L. Silveira,et al.  Supramolecular Interactions in Secondary Plant Cell Walls: Effect of Lignin Chemical Composition Revealed with the Molecular Theory of Solvation. , 2015, The journal of physical chemistry letters.

[21]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. , 2001, Advanced drug delivery reviews.

[22]  Maxim V Fedorov,et al.  Hydration thermodynamics using the reference interaction site model: speed or accuracy? , 2011, The journal of physical chemistry. B.

[23]  F. Hirata,et al.  Theoretical study for volume changes associated with the helix-coil transition of peptides. , 2001, Biopolymers.

[24]  P. Artursson,et al.  Correlation between oral drug absorption in humans and apparent drug permeability coefficients in human intestinal epithelial (Caco-2) cells. , 1991, Biochemical and biophysical research communications.

[25]  W. L. Jorgensen,et al.  Development and Testing of the OPLS All-Atom Force Field on Conformational Energetics and Properties of Organic Liquids , 1996 .

[26]  Fumio Hirata,et al.  Potential of Mean Force between Two Molecular Ions in a Polar Molecular Solvent: A Study by the Three-Dimensional Reference Interaction Site Model , 1999 .

[27]  Maxim V. Fedorov,et al.  Multigrid solver for the reference interaction site model of molecular liquids theory , 2011, J. Comput. Chem..

[28]  R. Conradi,et al.  Caco-2 Cell Monolayers as a Model for Drug Transport Across the Intestinal Mucosa , 1990, Pharmaceutical Research.

[29]  G. Maggiora,et al.  Solvation thermodynamics of polar molecules in aqueous solution by the XRISM method , 1993 .

[30]  I. R. Mcdonald,et al.  Theory of simple liquids , 1998 .

[31]  D. Chandler,et al.  Excess electrons in simple fluids. I. General equilibrium theory for classical hard sphere solvents , 1984 .

[32]  Robert C. Glen,et al.  Random Forest Models To Predict Aqueous Solubility , 2007, J. Chem. Inf. Model..

[33]  Jan H. Jensen,et al.  Prediction and rationalization of protein pKa values using QM and QM/MM methods. , 2005, The journal of physical chemistry. A.

[34]  John B. O. Mitchell,et al.  Is experimental data quality the limiting factor in predicting the aqueous solubility of druglike molecules? , 2014, Molecular pharmaceutics.

[35]  Seiichiro Ten-no,et al.  Free energy of solvation for the reference interaction site model: Critical comparison of expressions , 2001 .

[36]  Maxim V Fedorov,et al.  Accurate calculations of the hydration free energies of druglike molecules using the reference interaction site model. , 2010, The Journal of chemical physics.

[37]  B. Montgomery Pettitt,et al.  A dielectrically consistent interaction site theory for solvent—electrolyte mixtures , 1992 .

[38]  Bernard Pettitt,et al.  A Site‐Site Theory for Finite Concentration Saline Solutions. , 1993 .

[39]  Yuichi Harano,et al.  Theoretical study for partial molar volume of amino acids and polypeptides by the three-dimensional reference interaction site model , 2001 .

[40]  Maxim V Fedorov,et al.  Combination of RISM and Cheminformatics for Efficient Predictions of Hydration Free Energy of Polyfragment Molecules: Application to a Set of Organic Pollutants. , 2011, Journal of chemical theory and computation.

[41]  Tingjun Hou,et al.  ADME evaluation in drug discovery , 2002, Journal of molecular modeling.

[42]  M. Fedorov,et al.  3DRISM Multigrid Algorithm for Fast Solvation Free Energy Calculations. , 2012, Journal of chemical theory and computation.

[43]  Tingjun Hou,et al.  ADME Evaluation in Drug Discovery. 5. Correlation of Caco-2 Permeation with Simple Molecular Properties , 2004, J. Chem. Inf. Model..

[44]  Fumio Hirata,et al.  Ligand mapping on protein surfaces by the 3D-RISM theory: toward computational fragment-based drug design. , 2009, Journal of the American Chemical Society.

[45]  John B. O. Mitchell,et al.  Simultaneous feature selection and parameter optimisation using an artificial ant colony: case study of melting point prediction , 2008, Chemistry Central journal.

[46]  D. Blankschtein,et al.  Liquid-state theory of hydrocarbon-water systems: application to methane, ethane, and propane , 1992 .

[47]  F. Hirata,et al.  Salt Effect on Stability and Solvation Structure of Peptide: An Integral Equation Study. , 2000 .

[48]  John B. O. Mitchell,et al.  First-Principles Calculation of the Intrinsic Aqueous Solubility of Crystalline Druglike Molecules. , 2012, Journal of chemical theory and computation.

[49]  Yanli Wang,et al.  PubChem: Integrated Platform of Small Molecules and Biological Activities , 2008 .

[50]  Seiichiro Ten-no,et al.  Comparative study on solvation free energy expressions in reference interaction site model integral equation theory. , 2005, The journal of physical chemistry. B.

[51]  A. Kovalenko,et al.  Octanol-Water Partition Coefficient from 3D-RISM-KH Molecular Theory of Solvation with Partial Molar Volume Correction. , 2015, The journal of physical chemistry. B.

[52]  J. Andrew Grant,et al.  SAMPL2 and continuum modeling , 2010, J. Comput. Aided Mol. Des..

[53]  Stefan M. Kast,et al.  Prediction of tautomer ratios by embedded-cluster integral equation theory , 2010, J. Comput. Aided Mol. Des..

[54]  Fumio Hirata,et al.  Self-consistent description of a metal–water interface by the Kohn–Sham density functional theory and the three-dimensional reference interaction site model , 1999 .

[55]  A. Kovalenko,et al.  Spatial decomposition of solvation free energy based on the 3D integral equation theory of molecular liquid: application to miniproteins. , 2011, The journal of physical chemistry. B.

[56]  Fumio Hirata,et al.  Molecular Theory of Solvation , 2004 .

[57]  Maxim V Fedorov,et al.  Towards a universal method for calculating hydration free energies: a 3D reference interaction site model with partial molar volume correction , 2010, Journal of physics. Condensed matter : an Institute of Physics journal.

[58]  Arieh Ben-Naim,et al.  Solvation thermodynamics of nonionic solutes , 1984 .

[59]  Peter Dalgaard,et al.  R Development Core Team (2010): R: A language and environment for statistical computing , 2010 .

[60]  Fumio Hirata,et al.  Water molecules in a protein cavity detected by a statistical-mechanical theory. , 2005, Journal of the American Chemical Society.

[61]  David S. Palmer,et al.  In silico screening of bioactive and biomimetic solutes using Integral Equation Theory. , 2011, Current pharmaceutical design.

[62]  J. Kirkwood,et al.  The Statistical Mechanical Theory of Solutions. I , 1951 .

[63]  W. L. Jorgensen The Many Roles of Computation in Drug Discovery , 2004, Science.

[64]  N. Matubayasi Free-energy analysis of solvation with the method of energy representation. , 2009, Frontiers in bioscience.

[65]  David S. Palmer,et al.  Solvent Binding Analysis and Computational Alanine Scanning of the Bovine Chymosin-Bovine κ-Casein Complex Using Molecular Integral Equation Theory. , 2013, Journal of chemical theory and computation.

[66]  John B. O. Mitchell,et al.  Predicting intrinsic aqueous solubility by a thermodynamic cycle. , 2008, Molecular pharmaceutics.

[67]  David Chandler,et al.  Free energy functions in the extended RISM approximation , 1985 .

[68]  Vicente Romero Zaldivar,et al.  Total and Local Quadratic Indices of the “Molecular Pseudograph’s Atom Adjacency Matrix”. Application to Prediction of Caco-2 Permeability of Drugs , 2003 .

[69]  Jean-François Truchon,et al.  Predictions of hydration free energies from continuum solvent with solute polarizable models: the SAMPL2 blind challenge , 2010, J. Comput. Aided Mol. Des..

[70]  A. Bauer-Brandl,et al.  Thermodynamics of solubility, sublimation and solvation processes of parabens. , 2005, European journal of pharmaceutical sciences : official journal of the European Federation for Pharmaceutical Sciences.

[71]  Ron Wehrens,et al.  The pls Package: Principal Component and Partial Least Squares Regression in R , 2007 .

[72]  Fumio Hirata,et al.  Combination of molecular dynamics method and 3D‐RISM theory for conformational sampling of large flexible molecules in solution , 2008, J. Comput. Chem..

[73]  Fumio Hirata,et al.  An extended rism equation for molecular polar fluids , 1981 .

[74]  Florian Nigsch,et al.  Why Are Some Properties More Difficult To Predict than Others? A Study of QSPR Models of Solubility, Melting Point, and Log P , 2008, J. Chem. Inf. Model..

[75]  Maxim V Fedorov,et al.  Toward a universal model to calculate the solvation thermodynamics of druglike molecules: the importance of new experimental databases. , 2011, Molecular pharmaceutics.

[76]  R. Docherty,et al.  Low solubility in drug development: de‐convoluting the relative importance of solvation and crystal packing , 2015, The Journal of pharmacy and pharmacology.

[77]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[78]  David S. Palmer,et al.  Solvation thermodynamics of organic molecules by the molecular integral equation theory: approaching chemical accuracy. , 2015, Chemical reviews.

[79]  Rodrigo L. Silveira,et al.  Plant biomass recalcitrance: effect of hemicellulose composition on nanoscale forces that control cell wall strength. , 2013, Journal of the American Chemical Society.

[80]  Scott Boyer,et al.  Choosing Feature Selection and Learning Algorithms in QSAR , 2014, J. Chem. Inf. Model..

[81]  C. Brooks Computer simulation of liquids , 1989 .