New QSPR study for the prediction of aqueous solubility of drug-like compounds.

Solubility has become one of the key physicochemical screens at early stages of the drug development process. Solubility prediction through Quantitative Structure-Property Relationships (QSPR) modeling is a growing area of modern pharmaceutical research, being compatible with both High Throughput Screening technologies and limited compound availability characteristic of early stages of drug development. We resort to the QSPR theory for analyzing the aqueous solubility exhibited by 145 diverse drug-like organic compounds (0.781 being the average Tanimoto distances between all possible pairs of compounds in the training set). An accurate and generally applicable model is derived, consisting on a linear regression equation that involves three DRAGON molecular descriptors selected from more than a thousand available. Alternatively, we apply the linear QSPR to other 21 commonly employed validation compounds, leading to solubility estimations that compare fairly well with the performance achieved by previously reported Group Contribution Methods.

[1]  S. Hirono,et al.  Simple Method of Calculating Octanol/Water Partition Coefficient. , 1992 .

[2]  Paola Gramatica,et al.  Structure/Response Correlations and Similarity/Diversity Analysis by GETAWAY Descriptors, 2. Application of the Novel 3D Molecular Descriptors to QSAR/QSPR Studies , 2002, J. Chem. Inf. Comput. Sci..

[3]  H. Kubinyi Variable Selection in QSAR Studies. II. A Highly Efficient Combination of Systematic Search and Evolution , 1994 .

[4]  Johann Gasteiger,et al.  Prediction of Aqueous Solubility of Organic Compounds Based on a 3D Structure Representation , 2003, J. Chem. Inf. Comput. Sci..

[5]  M Karplus,et al.  Evolutionary optimization in quantitative structure-activity relationship: an application of genetic neural networks. , 1996, Journal of medicinal chemistry.

[6]  Han van de Waterbeemd,et al.  Chemometric methods in molecular design , 1995 .

[7]  Li Di,et al.  Biological assay challenges from compound solubility: strategies for bioassay optimization. , 2006, Drug discovery today.

[8]  M. Fernández,et al.  QSAR for non-nucleoside inhibitors of HIV-1 reverse transcriptase. , 2006, Bioorganic & medicinal chemistry.

[9]  Milan Randic,et al.  Orthogonal molecular descriptors , 1991 .

[10]  Tingjun Hou,et al.  ADME Evaluation in Drug Discovery. 4. Prediction of Aqueous Solubility Based on Atom Contribution Approach , 2004, J. Chem. Inf. Model..

[11]  J. J. Goodwin Rationale and benefit of using high throughput solubility screens in drug discovery. , 2006, Drug discovery today. Technologies.

[12]  C. Hansch,et al.  The relative toxicity of compounds in mainstream cigarette smoke condensate. , 2000, Food and chemical toxicology : an international journal published for the British Industrial Biological Research Association.

[13]  A. Talevi,et al.  Application of descriptors based on Lipinski's rules in the QSPR study of aqueous solubilities. , 2007, Bioorganic & medicinal chemistry.

[14]  Takahiro Suzuki,et al.  Development of an automatic estimation system for both the partition coefficient and aqueous solubility , 1991, J. Comput. Aided Mol. Des..

[15]  Roberto Todeschini,et al.  Handbook of Molecular Descriptors , 2002 .

[16]  Maykel Pérez González,et al.  A new search algorithm for QSPR/QSAR theories: Normal boiling points of some organic molecules , 2005 .

[17]  Chon-lin Lee,et al.  Use of oyster, Crassostrea gigas, and ambient water to assess metal pollution status of the charting coastal area, Taiwan, after the 1986 green oyster incident , 1996 .

[18]  Stephen R. Johnson,et al.  Molecular properties that influence the oral bioavailability of drug candidates. , 2002, Journal of medicinal chemistry.

[19]  Samuel H. Yalkowsky,et al.  Aqueous functional group activity coefficients (AQUAFAC) 4: Applications to complex organic compounds , 1996 .

[20]  Shaomeng Wang,et al.  Estimation of aqueous solubility of organic molecules by the group contribution approach. Application to the study of biodegradation , 1992, J. Chem. Inf. Comput. Sci..

[21]  Hao Zhu,et al.  Estimation of the Aqueous Solubility of Organic Molecules by the Group Contribution Approach , 2001, J. Chem. Inf. Comput. Sci..

[22]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings , 1997 .

[23]  Jarmo Huuskonen,et al.  Estimation of Aqueous Solubility for a Diverse Set of Organic Compounds Based on Molecular Topology , 2000, J. Chem. Inf. Comput. Sci..

[24]  N. Draper,et al.  Applied Regression Analysis , 1966 .

[25]  Frank Harary,et al.  Graph Theory , 2016 .

[26]  J. Delaney Predicting aqueous solubility from structure. , 2005, Drug discovery today.

[27]  W Patrick Walters,et al.  Prediction of 'drug-likeness'. , 2002, Advanced drug delivery reviews.

[28]  Alban Arrault,et al.  Managing, profiling and analyzing a library of 2.6 million compounds gathered from 32 chemical providers , 2006, Molecular Diversity.

[29]  I. Tetko,et al.  In silico approaches to prediction of aqueous and DMSO solubility of drug-like compounds: trends, problems and solutions. , 2006, Current medicinal chemistry.

[30]  Shobha N. Bhattachar,et al.  Solubility: it's not just for physical chemists. , 2006, Drug discovery today.

[31]  A. Tropsha,et al.  Beware of q2! , 2002, Journal of molecular graphics & modelling.

[32]  W. Patrick Walters,et al.  Filtering databases and chemical libraries , 2004, Molecular Diversity.

[33]  Maykel Pérez González,et al.  Application of the replacement method as a novel variable selection strategy in QSAR. 1. Carcinogenic potential , 2006 .

[34]  Manfred Kansy,et al.  High throughput solubility measurement in drug discovery and development. , 2007, Advanced drug delivery reviews.

[35]  Stephen R. Johnson,et al.  Recent progress in the computational prediction of aqueous solubility and absorption , 2006, The AAPS Journal.

[36]  Milan Randic,et al.  Resolution of ambiguities in structure-property studies by use of orthogonal descriptors , 1991, J. Chem. Inf. Comput. Sci..

[37]  C. Laggner,et al.  Why drugs fail--a study on side effects in new chemical entities. , 2005 .

[38]  S. Stegemann,et al.  When poor solubility becomes an issue: from early stage to proof of concept. , 2007, European journal of pharmaceutical sciences : official journal of the European Federation for Pharmaceutical Sciences.

[39]  Douglas M. Hawkins,et al.  Assessing Model Fit by Cross-Validation , 2003, J. Chem. Inf. Comput. Sci..

[40]  Ralph Kühne,et al.  Group contribution methods to estimate water solubility of organic chemicals , 1995 .