Revisiting the General Solubility Equation: In Silico Prediction of Aqueous Solubility Incorporating the Effect of Topographical Polar Surface Area

The General Solubility Equation (GSE) is a QSPR model based on the melting point and log P of a chemical substance. It is used to predict the aqueous solubility of nonionizable chemical compounds. However, its reliance on experimentally derived descriptors, particularly melting point, limits its applicability to virtual compounds. The studies presented show that the GSE is able to predict, to within 1 log unit, the experimental aqueous solubility (log S) for 81% of the compounds in a data set of 1265 diverse chemical structures (-8.48 < log S < 1.58). However, the predictive ability of the GSE is reduced to 75% when applied to a subset of the data (1160 compounds -6.00 < log S < 0.00), which discounts those compounds occupying the sparsely populated regions of data space. This highlights how sparsely populated extremities of data sets can significantly skew results for linear regression-based models. Replacing the melting point descriptor of the GSE with a descriptor which accounts for topographical polar surface area (TPSA) produces a model of comparable quality to the GSE (the solubility of 81% of compounds in the full data set predicted accurately). As such, we propose an alternative simple model for predicting aqueous solubility which replaces the melting point descriptor of the GSE with TPSA and hence can be applied to virtual compounds. In addition, incorporating TPSA into the GSE in addition to log P and melting point gives a three descriptor model that improves accurate prediction of aqueous solubility over the GSE by 5.1% for the full and 6.6% for the reduced data set, respectively.

[1]  S. Yalkowsky,et al.  Estimation of the aqueous solubility I: application to organic nonelectrolytes. , 2001, Journal of pharmaceutical sciences.

[2]  J. Dearden,et al.  How not to develop a quantitative structure–activity or structure–property relationship (QSAR/QSPR) , 2009, SAR and QSAR in environmental research.

[3]  Junmei Wang,et al.  Recent advances on aqueous solubility prediction. , 2011, Combinatorial chemistry & high throughput screening.

[4]  D. E. Clark,et al.  Rapid calculation of polar molecular surface area and its application to the prediction of transport phenomena. 2. Prediction of blood-brain barrier penetration. , 1999, Journal of pharmaceutical sciences.

[5]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings , 1997 .

[6]  Kristina Luthman,et al.  Polar Molecular Surface Properties Predict the Intestinal Absorption of Drugs in Humans , 1997, Pharmaceutical Research.

[7]  A. Beresford,et al.  The emerging importance of predictive ADME simulation in drug discovery. , 2002, Drug discovery today.

[8]  J. Dearden In silico prediction of aqueous solubility , 2006, Expert opinion on drug discovery.

[9]  Samuel H Yalkowsky,et al.  Prediction of aqueous solubility from SCRATCH. , 2010, International journal of pharmaceutics.

[10]  T.W. Schultz,et al.  Selection of data sets for qsars: Analyses of tetrahymena toxicity from aromatic compounds , 2003, SAR and QSAR in environmental research.

[11]  S. Yalkowsky,et al.  Estimation of aqueous solubility of organic compounds by using the general solubility equation. , 2002, Chemosphere.

[12]  Singh,et al.  Quantitative structure-property relationships in pharmaceutical research - Part 1. , 2000, Pharmaceutical science & technology today.

[13]  C. Hansch,et al.  Linear free-energy relationship between partition coefficients and the aqueous solubility of organic liquids , 1968 .

[14]  P. Selzer,et al.  Fast calculation of molecular polar surface area as a sum of fragment-based contributions and its application to the prediction of drug transport properties. , 2000, Journal of medicinal chemistry.

[15]  J. Delaney Predicting aqueous solubility from structure. , 2005, Drug discovery today.

[16]  P. Ertl,et al.  Computational approaches to determine drug solubility. , 2007, Advanced drug delivery reviews.

[17]  A. Kaplan,et al.  A Beginner's Guide to Partial Least Squares Analysis , 2004 .

[18]  Grover,et al.  Quantitative structure-property relationships in pharmaceutical research - Part 2. , 2000, Pharmaceutical science & technology today.