Estimation of Aqueous Solubility of Chemical Compounds Using E-State Indices

The molecular weight and electrotopological E-state indices were used to estimate by Artificial Neural Networks aqueous solubility for a diverse set of 1291 organic compounds. The neural network with 33-4-1 neurons provided highly predictive results with r(2) = 0.91 and RMS = 0.62. The used parameters included several combinations of E-state indices with similar properties. The calculated results were similar to those published for these data by Huuskonen (2000). However, in the current study only E-state indices were used without need of additional indices (the molecular connectivity, shape, flexibility and indicator indices) also considered in the previous study. In addition, the present neural network contained three times less hidden neurons. Smaller neural networks and use of one homogeneous set of parameters provides a more robust model for prediction of aqueous solubility of chemical compounds. Limitations of the developed method for prediction of large compounds are discussed. The developed approach is available online at http://www.lnh.unil.ch/~itetko/logp.

[1]  Igor V. Tetko,et al.  Neural network studies, 1. Comparison of overfitting and overtraining , 1995, J. Chem. Inf. Comput. Sci..

[2]  L. Lai,et al.  Calculating partition coefficient by atom-additive method , 2000 .

[3]  N. Bodor,et al.  A new method for the estimation of the aqueous solubility of organic compounds. , 1992, Journal of pharmaceutical sciences.

[4]  W. Meylan,et al.  Atom/fragment contribution method for estimating octanol-water partition coefficients. , 1995, Journal of pharmaceutical sciences.

[5]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[6]  Ralph Kühne,et al.  Group contribution methods to estimate water solubility of organic chemicals , 1995 .

[7]  Igor V. Tetko,et al.  Neural Network Studies, 2. Variable Selection , 1996, J. Chem. Inf. Comput. Sci..

[8]  Peter C. Jurs,et al.  Prediction of Aqueous Solubility for a Diverse Set of Heteroatom-Containing Organic Compounds Using a Quantitative Structure-Property Relationship , 1996, J. Chem. Inf. Comput. Sci..

[9]  L. Hall,et al.  Molecular Structure Description: The Electrotopological State , 1999 .

[10]  Igor V. Tetko,et al.  Prediction of n-Octanol/Water Partition Coefficients from PHYSPROP Database Using Artificial Neural Networks and E-State Indices , 2001, J. Chem. Inf. Comput. Sci..

[11]  M. Abraham,et al.  The correlation and prediction of the solubility of compounds in water using an amended solvation energy relationship. , 1999, Journal of pharmaceutical sciences.

[12]  Peter C. Jurs,et al.  Prediction of Aqueous Solubility of Organic Compounds , 1994, J. Chem. Inf. Comput. Sci..

[13]  S. Unger Molecular Connectivity in Structure–activity Analysis , 1987 .

[14]  Samuel H. Yalkowsky,et al.  Aqueous functional group activity coefficients (AQUAFAC) 4: Applications to complex organic compounds , 1996 .

[15]  G. S. Patil Prediction of aqueous solubility and octanol—water partition coefficient for pesticides based on their molecular structure , 1994 .

[16]  Igor V. Tetko,et al.  Neural Network Studies. 3. Variable Selection in the Cascade-Correlation Learning Architecture , 1998, J. Chem. Inf. Comput. Sci..

[17]  Igor V. Tetko,et al.  Efficient Partition of Learning Data Sets for Neural Network Training , 1997, Neural Networks.

[18]  R E Speece,et al.  Prediction of aqueous solubility of organic chemicals based on molecular structure. , 1988, Environmental science & technology.

[19]  Corwin Hansch,et al.  Role of hydrophobic effects in mechanistic QSAR , 1999 .

[20]  A. Leo CALCULATING LOG POCT FROM STRUCTURES , 1993 .

[21]  Samuel H. Yalkowsky,et al.  Prediction of Drug Solubility by the General Solubility Equation (GSE) , 2001, J. Chem. Inf. Comput. Sci..

[22]  Samuel H. Yalkowsky,et al.  AQUAFAC 3: aqueous functional group activity coefficients; application to the estimation of aqueous solubility , 1995 .

[23]  Shaomeng Wang,et al.  Estimation of aqueous solubility of organic molecules by the group contribution approach. Application to the study of biodegradation , 1992, J. Chem. Inf. Comput. Sci..

[24]  T. A. Andrea,et al.  Applications of neural networks in quantitative structure-activity relationships of dihydrofolate reductase inhibitors. , 1991, Journal of medicinal chemistry.

[25]  H Ichikawa,et al.  Neural networks applied to quantitative structure-activity relationship analysis. , 1990, Journal of medicinal chemistry.

[26]  Igor V. Tetko,et al.  Internet Software for the Calculation of the Lipophilicity and Aqueous Solubility of Chemical Compounds , 2001, J. Chem. Inf. Comput. Sci..

[27]  Alan R. Katritzky,et al.  Correlation of the Aqueous Solubility of Hydrocarbons and Halogenated Hydrocarbons with Molecular Structure , 1998, J. Chem. Inf. Comput. Sci..