Prediction of drug solubility from molecular structure using a drug-like training set

Using a training set of 191 drug-like compounds extracted from the AQUASOL database a quantitative structure-property relationship (QSPR) study was conducted employing a set of simple structural and physicochemical properties to predict aqueous solubility. The resultant regression model comprised five parameters (ClogP, molecular weight, indicator variable for aliphatic amine groups, number of rotatable bonds and number of aromatic rings) and demonstrated acceptable statistics (r 2 = 0.87, s = 0.51, F = 243.6, n = 191). The model was applied to two test sets consisting of a drug-like set of compounds (r 2 = 0.80, s = 0.68, n = 174) and a set of agrochemicals (r 2 = 0.88, s = 0.65, n = 200). Using the established general solubility equation (GSE) on the training and drug-like test set gave poorer results than the current study. The agrochemical test set was predicted with equal accuracy using the GSE and the QSPR equation. The results of this study suggest that increasing molecular size, rigidity and lipophilicity decrease solubility whereas increasing conformational flexibility and the presence of a non-conjugated amine group increase the solubility of drug-like compounds. Indeed, the proposed structural parameters make physical sense and provide simple guidelines for modifying solubility during lead optimisation.

[1]  C. Hansch,et al.  Linear free-energy relationship between partition coefficients and the aqueous solubility of organic liquids , 1968 .

[2]  J. Topliss,et al.  Chance factors in studies of quantitative structure-activity relationships. , 1979, Journal of medicinal chemistry.

[3]  Susan Budavari,et al.  The Merck index : an encyclopedia of chemicals, drugs, and biologicals , 1983 .

[4]  S. Wold Validation of QSAR's , 1991 .

[5]  Sujit Banerjee,et al.  Aqueous solubility : methods of estimation for organic compounds , 1992 .

[6]  Robert S. Boethling,et al.  Improved method for estimating water solubility from octanol/water partition coefficient , 1996 .

[7]  Paul Ruelle,et al.  Aqueous solubility prediction of environmentally important chemicals from the mobile order thermodynamics , 1997 .

[8]  Yilin Wang,et al.  QSPR Studies on Vapor Pressure, Aqueous Solubility, and the Prediction of Water-Air Partition Coefficients , 1998, J. Chem. Inf. Comput. Sci..

[9]  Sean C. Sweetman,et al.  Martindale: The Complete Drug Reference , 1999 .

[10]  A. Ghose,et al.  A knowledge-based approach in designing combinatorial or medicinal chemistry libraries for drug discovery. 1. A qualitative and quantitative characterization of known drug databases. , 1999, Journal of combinatorial chemistry.

[11]  M. Abraham,et al.  The correlation and prediction of the solubility of compounds in water using an amended solvation energy relationship. , 1999, Journal of pharmaceutical sciences.

[12]  Pierre Bruneau,et al.  Prediction of Physicochemical Properties , 2000 .

[13]  J Taskinen Prediction of aqueous solubility in drug design. , 2000, Current opinion in drug discovery & development.

[14]  Philip H. Howard,et al.  Estimating log P with atom/fragments and water solubility with log P , 2000 .

[15]  W L Jorgensen,et al.  Prediction of drug solubility from Monte Carlo simulations. , 2000, Bioorganic & medicinal chemistry letters.

[16]  Ruifeng Liu,et al.  Development of Quantitative Structure-Property Relationship Models for Early ADME Evaluation in Drug Discovery. 1. Aqueous Solubility , 2001, J. Chem. Inf. Comput. Sci..

[17]  J. Legendre,et al.  Determination of the Aqueous Solubility of Drugs Using a Convenient 96-Well Plate-Based Assay , 2001, Drug development and industrial pharmacy.

[18]  J. Huuskonen,et al.  Estimation of aqueous solubility in drug design. , 2001, Combinatorial chemistry & high throughput screening.

[19]  S. Yalkowsky,et al.  Estimation of the aqueous solubility I: application to organic nonelectrolytes. , 2001, Journal of pharmaceutical sciences.

[20]  James W. McFarland,et al.  Estimating the Water Solubilities of Crystalline Compounds from Their Chemical Structures Alone , 2001, J. Chem. Inf. Comput. Sci..

[21]  Ruifeng Liu,et al.  Development of Quantitative Structure-Property Relationship Models for Early ADME Evaluation in Drug Discovery. 2. Blood-Brain Barrier Penetration , 2001, J. Chem. Inf. Comput. Sci..

[22]  Neera Jain,et al.  Prediction of Aqueous Solubility of Organic Compounds by the General Solubility Equation (GSE) , 2001, J. Chem. Inf. Comput. Sci..

[23]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. , 2001, Advanced drug delivery reviews.

[24]  Yi Li,et al.  Prediction of aqueous solubility of organic compounds using a quantitative structure-property relationship. , 2002, Journal of pharmaceutical sciences.

[25]  H. van de Waterbeemd,et al.  Can the Internet help to meet the challenges in ADME and e-ADME? , 2002, SAR and QSAR in environmental research.

[26]  W. L. Jorgensen,et al.  Prediction of drug solubility from structure. , 2002, Advanced drug delivery reviews.

[27]  Kristina Luthman,et al.  Theoretical Predictions of Drug Absorption in Drug Discovery and Development , 2002, Clinical pharmacokinetics.

[28]  Jarmo Huuskonen,et al.  Prediction of Soil Sorption Coefficient of a Diverse Set of Organic Chemicals From Molecular Structure , 2003, J. Chem. Inf. Comput. Sci..

[29]  Christel A. S. Bergström,et al.  Absorption classification of oral drugs based on molecular surface properties. , 2003, Journal of medicinal chemistry.

[30]  Jouko Yliruusi,et al.  Prediction of physicochemical properties based on neural network modelling. , 2003, Advanced drug delivery reviews.

[31]  Kenneth M Merz,et al.  Prediction of aqueous solubility of a diverse set of compounds using quantitative structure-property relationships. , 2003, Journal of medicinal chemistry.

[32]  Johann Gasteiger,et al.  Prediction of Aqueous Solubility of Organic Compounds Based on a 3D Structure Representation , 2003, J. Chem. Inf. Comput. Sci..

[33]  John S. Delaney,et al.  ESOL: Estimating Aqueous Solubility Directly from Molecular Structure , 2004, J. Chem. Inf. Model..

[34]  Ulf Norinder,et al.  Global and Local Computational Models for Aqueous Solubility Prediction of Drug-Like Molecules , 2004, J. Chem. Inf. Model..

[35]  Paul Ruelle,et al.  A New Predictive Equation for the Solubility of Drugs Based on the Thermodynamics of Mobile Disorder , 1991, Pharmaceutical Research.

[36]  Veerabahu Shanmugasundaram,et al.  Estimation of Aqueous Solubility of Organic Compounds with QSPR Approach , 2004, Pharmaceutical Research.

[37]  Ulf Norinder,et al.  Experimental and Computational Screening Models for Prediction of Aqueous Drug Solubility , 2002, Pharmaceutical Research.

[38]  Johann Gasteiger,et al.  Linear and nonlinear functions on modeling of aqueous solubility of organic compounds by two structure representation methods , 2004, J. Comput. Aided Mol. Des..

[39]  I. Tetko,et al.  In silico approaches to prediction of aqueous and DMSO solubility of drug-like compounds: trends, problems and solutions. , 2006, Current medicinal chemistry.

[40]  David W. Salt,et al.  An Improved Approximation to the Estimation of the Critical F Values in Best Subset Regression , 2007, J. Chem. Inf. Model..

[41]  J. Wichard,et al.  PREDICTING AQUEOUS SOLUBILITY FROM STRUCTURE , 2009 .