Computation of Octanol-Water Partition Coefficients by Guiding an Additive Model with Knowledge

We have developed a new method, i.e., XLOGP3, for logP computation. XLOGP3 predicts the logP value of a query compound by using the known logP value of a reference compound as a starting point. The difference in the logP values of the query compound and the reference compound is then estimated by an additive model. The additive model implemented in XLOGP3 uses a total of 87 atom/group types and two correction factors as descriptors. It is calibrated on a training set of 8199 organic compounds with reliable logP data through a multivariate linear regression analysis. For a given query compound, the compound showing the highest structural similarity in the training set will be selected as the reference compound. Structural similarity is quantified based on topological torsion descriptors. XLOGP3 has been tested along with its predecessor, i.e., XLOGP2, as well as several popular logP methods on two independent test sets: one contains 406 small-molecule drugs approved by the FDA and the other contains 219 oligopeptides. On both test sets, XLOGP3 produces more accurate predictions than most of the other methods with average unsigned errors of 0.24-0.51 units. Compared to conventional additive methods, XLOGP3 does not rely on an extensive classification of fragments and correction factors in order to improve accuracy. It is also able to utilize the ever-increasing experimentally measured logP data more effectively.

[1]  Sudhir A. Kulkarni,et al.  Three-Dimensional QSAR Using the k-Nearest Neighbor Method and Its Interpretation , 2006, J. Chem. Inf. Model..

[2]  Gordon M. Crippen,et al.  Prediction of Physicochemical Parameters by Atomic Contributions , 1999, J. Chem. Inf. Comput. Sci..

[3]  C. Hansch,et al.  p-σ-π Analysis. A Method for the Correlation of Biological Activity and Chemical Structure , 1964 .

[4]  Vijay K. Gombar,et al.  Assessment of n-Octanol/Water Partition Coefficient: When Is the Assessment Reliable? , 1996, J. Chem. Inf. Comput. Sci..

[5]  I. Tetko,et al.  Application of ALOGPS to predict 1-octanol/water distribution coefficients, logP, and logD, of AstraZeneca in-house database. , 2004, Journal of pharmaceutical sciences.

[6]  Marvin Johnson,et al.  Concepts and applications of molecular similarity , 1990 .

[7]  Luhua Lai,et al.  A New Atom-Additive Method for Calculating Partition Coefficients , 1997, J. Chem. Inf. Comput. Sci..

[8]  A. Leo,et al.  Partition coefficients and their uses , 1971 .

[9]  A. Petrauskas,et al.  ACD/Log P method description , 2000 .

[10]  Arup K. Ghose,et al.  Atomic physicochemical parameters for three dimensional structure directed quantitative structure-activity relationships. 4. Additional parameters for hydrophobic and dispersive interactions and their application for an automated superposition of certain naturally occurring nucleoside antibiotics , 1989, J. Chem. Inf. Comput. Sci..

[11]  I Kövesdi,et al.  Reliability of logP predictions based on calculated molecular descriptors: a critical review. , 2002, Current medicinal chemistry.

[12]  Ralph Kühne,et al.  Model Selection Based on Structural Similarity-Method Description and Application to Water Solubility Prediction , 2006, J. Chem. Inf. Model..

[13]  W. Meylan,et al.  Atom/fragment contribution method for estimating octanol-water partition coefficients. , 1995, Journal of pharmaceutical sciences.

[14]  L. Lai,et al.  Calculating partition coefficient by atom-additive method , 2000 .

[15]  Luhua Lai,et al.  Calculating Partition Coefficients of Peptides by the Addition Method , 1999 .

[16]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings , 1997 .

[17]  Igor V. Tetko,et al.  Application of Associative Neural Networks for Prediction of Lipophilicity in ALOGPS 2.1 Program , 2002, J. Chem. Inf. Comput. Sci..

[18]  David S. Wishart,et al.  DrugBank: a comprehensive resource for in silico drug discovery and exploration , 2005, Nucleic Acids Res..

[19]  Arup K. Ghose,et al.  Estimating aqueous solvation and lipophilicity of small organic molecules: A comparative overview of atom/group contribution methods , 2000 .

[20]  Ramaswamy Nilakantan,et al.  Topological torsion: a new molecular descriptor for SAR applications. Comparison with other descriptors , 1987, J. Chem. Inf. Comput. Sci..

[21]  Bernard Testa,et al.  Computational Approaches to Lipophilicity: Methods and Applications , 2007 .

[22]  Corwin Hansch,et al.  Role of hydrophobic effects in mechanistic QSAR , 1999 .

[23]  Matthew Segall,et al.  In silico prediction of ADME properties: are we making progress? , 2004, Current opinion in drug discovery & development.

[24]  C. Hansch,et al.  A NEW SUBSTITUENT CONSTANT, PI, DERIVED FROM PARTITION COEFFICIENTS , 1964 .

[25]  Gilles Klopman,et al.  A Structural Analogue Approach to the Prediction of the Octanol-Water Partition Coefficient , 2006, J. Chem. Inf. Model..

[26]  Matthew Walker,et al.  Training ACD/LogP with Experimental Data , 2004 .

[27]  H. van de Waterbeemd,et al.  ADMET in silico modelling: towards prediction paradise? , 2003, Nature reviews. Drug discovery.

[28]  A. Ghose,et al.  Atomic Physicochemical Parameters for Three‐Dimensional Structure‐Directed Quantitative Structure‐Activity Relationships I. Partition Coefficients as a Measure of Hydrophobicity , 1986 .

[29]  Peter C. Jurs,et al.  Computer-Assisted Computation of Partition Coefficients from Molecular Structures Using Fragment Constants , 1979, J. Chem. Inf. Comput. Sci..

[30]  Hao Zhu,et al.  A New Group Contribution Approach to the Calculation of LogP , 2005 .

[31]  Alexander Tropsha,et al.  k Nearest Neighbors QSAR Modeling as a Variational Problem: Theory and Applications , 2005, J. Chem. Inf. Model..

[32]  Philip H. Howard,et al.  Estimating log P with atom/fragments and water solubility with log P , 2000 .

[33]  Raimund Mannhold,et al.  Substructure versus Whole‐molecule Approaches for Calculating Log P , 2003 .

[34]  Glen Eugene Kellogg,et al.  HINT: A new method of empirical hydrophobic field calculation for CoMFA , 1991, J. Comput. Aided Mol. Des..

[35]  Andrew M Davis,et al.  Predictive ADMET studies, the challenges and the opportunities. , 2004, Current opinion in chemical biology.

[36]  A. Ghose,et al.  Prediction of Hydrophobic (Lipophilic) Properties of Small Organic Molecules Using Fragmental Methods: An Analysis of ALOGP and CLOGP Methods , 1998 .

[37]  Peter D J Grootenhuis,et al.  Progress in computational methods for the prediction of ADMET properties. , 2002, Current opinion in drug discovery & development.

[38]  D. Hoekman Exploring QSAR Fundamentals and Applications in Chemistry and Biology, Volume 1. Hydrophobic, Electronic and Steric Constants, Volume 2 J. Am. Chem. Soc. 1995, 117, 9782 , 1996 .