ADME Evaluation in Drug Discovery. 4. Prediction of Aqueous Solubility Based on Atom Contribution Approach

A novel method for the estimation of aqueous solubility was solely based on simple atom contribution. Each atom in a molecule has its own contribution to aqueous solubility and was developed. Altogether 76 atom types were used to classify atoms with different chemical environments. Moreover, two correction factors, including hydrophobic carbon and square of molecular weight, were used to account for the inter-/intramolecular hydrophobic interactions and bulkiness effect. The contribution coefficients of different atom types and correction factors were generated based on a multiple linear regression using a learning set consisting of 1290 organic compounds. The obtained linear regression model possesses good statistical significance with an overall correlation coefficient (r) of 0.96, a standard deviation (s) of 0.61, and an unsigned mean error (UME) of 0.48. The actual prediction potential of the model was validated through an external test set with 21 pharmaceutically and environmentally interesting compounds. For the test set, a predictive r=0.94, s=0.84, and UME=0.52 were achieved. Comparisons among eight procedures of solubility calculation for those 21 molecules demonstrate that our model bears very good accuracy and is comparable to or even better than most reported techniques based on molecular descriptors. Moreover, we compared the performance of our model to a test set of 120 molecules with a popular group contribution method developed by Klopman et al. For this test set, our model gives a very effective prediction (r=0.96, s=0.79, UME=0.57), which is obviously superior to the predicted results (r=0.96, s=0.84, UME=0.70) given by the Klopman's group contribution approach. Because of the adoption of atoms as the basic units, our addition model does not contain a "missing fragment" problem and thus may be more simple and universal than the group contribution models and can give predictions for any organic molecules. A program, drug-LOGS, had been developed to identify the occurrence of atom types and estimate the aqueous solubility of a molecule.

[1]  Ruifeng Liu,et al.  Development of Quantitative Structure-Property Relationship Models for Early ADME Evaluation in Drug Discovery. 1. Aqueous Solubility , 2001, J. Chem. Inf. Comput. Sci..

[2]  Ola Engkvist,et al.  High-Throughput, In Silico Prediction of Aqueous Solubility Based on One- and Two-Dimensional Descriptors , 2002, J. Chem. Inf. Comput. Sci..

[3]  Sujit Banerjee,et al.  Aqueous solubility : methods of estimation for organic compounds , 1992 .

[4]  Samuel H. Yalkowsky,et al.  Aqueous functional group activity coefficients (AQUAFAC) 4: Applications to complex organic compounds , 1996 .

[5]  James W. McFarland,et al.  Estimating the Water Solubilities of Crystalline Compounds from Their Chemical Structures Alone , 2001, J. Chem. Inf. Comput. Sci..

[6]  Peter C. Jurs,et al.  Prediction of Aqueous Solubility of Heteroatom-Containing Organic Compounds from Molecular Structure , 2001, J. Chem. Inf. Comput. Sci..

[7]  Igor V. Tetko,et al.  Estimation of Aqueous Solubility of Chemical Compounds Using E-State Indices , 2001, J. Chem. Inf. Comput. Sci..

[8]  Samuel H. Yalkowsky,et al.  Comment on “Prediction of Aqueous Solubility of Organic Chemicals Based on Molecular Structure. 2. Application to PNAs, PCBs, PCDDs, etc.” , 1989 .

[9]  Johann Gasteiger,et al.  Prediction of Aqueous Solubility of Organic Compounds Based on a 3D Structure Representation , 2003, J. Chem. Inf. Comput. Sci..

[10]  Shaomeng Wang,et al.  Estimation of aqueous solubility of organic molecules by the group contribution approach. Application to the study of biodegradation , 1992, J. Chem. Inf. Comput. Sci..

[11]  Takahiro Suzuki,et al.  Development of an automatic estimation system for both the partition coefficient and aqueous solubility , 1991, J. Comput. Aided Mol. Des..

[12]  A. Ghose,et al.  Atomic Physicochemical Parameters for Three‐Dimensional Structure‐Directed Quantitative Structure‐Activity Relationships I. Partition Coefficients as a Measure of Hydrophobicity , 1986 .

[13]  Samuel H. Yalkowsky,et al.  Estimation of the aqueous solubility of complex organic compounds , 1993 .

[14]  Thomas A. Halgren Merck molecular force field. I. Basis, form, scope, parameterization, and performance of MMFF94 , 1996, J. Comput. Chem..

[15]  Hao Zhu,et al.  Estimation of the Aqueous Solubility of Organic Molecules by the Group Contribution Approach , 2001, J. Chem. Inf. Comput. Sci..

[16]  Darko Butina,et al.  Modeling Aqueous Solubility , 2003, J. Chem. Inf. Comput. Sci..

[17]  Richard E. Speece Reply to comments on "Preditction of aqueous solubility of organic chemicals based on molecular structure. 2. Application to PNAs, PCBs, PCDDs etc." , 1990 .

[18]  Tingjun Hou,et al.  Recent development and application of virtual screening in drug discovery: an overview. , 2004, Current pharmaceutical design.

[19]  Samuel H. Yalkowsky,et al.  Prediction of Drug Solubility by the General Solubility Equation (GSE) , 2001, J. Chem. Inf. Comput. Sci..

[20]  S. Yalkowsky,et al.  Estimation of the aqueous solubility I: application to organic nonelectrolytes. , 2001, Journal of pharmaceutical sciences.

[21]  R E Speece,et al.  Prediction of aqueous solubility of organic chemicals based on molecular structure. , 1988, Environmental science & technology.

[22]  T. Halgren Merck molecular force field. I. Basis, form, scope, parameterization, and performance of MMFF94 , 1996, J. Comput. Chem..

[23]  Andreas Zell,et al.  Prediction of Aqueous Solubility and Partition Coefficient Optimized by a Genetic Algorithm Based Descriptor Selection Method , 2003, J. Chem. Inf. Comput. Sci..

[24]  Tingjun Hou,et al.  ADME Evaluation in Drug Discovery. 2. Prediction of Partition Coefficient by Atom-Additive Approach Based on Atom-Weighted Solvent Accessible Surface Areas , 2003, J. Chem. Inf. Comput. Sci..

[25]  Peter C. Jurs,et al.  Prediction of Aqueous Solubility of Heteroatom‐Containing Organic Compounds from Molecular Structure. , 2001 .

[26]  Ruifeng Liu,et al.  Development of Quantitative Structure—Property Relationship Models for Early ADME Evaluation in Drug Discovery. Part 2. Blood‐Brain Barrier Penetration. , 2002 .

[27]  Ralph Kühne,et al.  Group contribution methods to estimate water solubility of organic chemicals , 1995 .

[28]  Tingjun Hou,et al.  ADME evaluation in drug discovery , 2002, Journal of molecular modeling.

[29]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[30]  Jyrki Taskinen,et al.  Aqueous Solubility Prediction of Drugs Based on Molecular Topology and Neural Network Modeling , 1998, J. Chem. Inf. Comput. Sci..

[31]  Tingjun Hou,et al.  ADME Evaluation in Drug Discovery. 3. Modeling Blood-Brain Barrier Partitioning Using Simple Molecular Descriptors , 2003, J. Chem. Inf. Comput. Sci..