Prediction of tissue-air partition coefficients: A comparison of structure-based and property-based methods

Three linear regression methods were used to develop models for the prediction of rat tissue-air partition coefficient ( P ). In general, ridge regression (RR) was found to be superior to principal component regression (PCR) and partial least squares regression (PLS). A set of 46 diverse low molecular-weight volatile chemicals was used to model fat-air, liver-air and muscle-air partition coefficients for male Fischer 344 rats. Comparisons were made between models developed using descriptors based solely on molecular structure and those developed using experimental properties, including saline-air and olive oil-air partition coefficients, as independent variables, indicating that the structure-property correlations are comparable to the property-property correlations. Multiple structure-based models were developed utilizing various classes of structural descriptors based on level of complexity, i.e. topostructural (TS), topochemical (TC), 3-dimensional (3D) and calculated octanol-water partition coefficient. In most cases, the structure-based models developed using only the TC descriptors were found to be superior to those developed using other structural descriptor classes. Haloalkane subgroups were modeled separately for comparative purposes, and although models based on the congeneric compounds were superior, the models developed on the complete sets of diverse compounds were acceptable. Comparisons were also made with respect to the types of descriptors important for partitioning across the various media.

[1]  Lemont B. Kier,et al.  The electrotopological state: structure information at the atomic level for molecular graphs , 1991, J. Chem. Inf. Comput. Sci..

[2]  A. Balaban Highly discriminating distance-based topological index , 1982 .

[3]  A. Höskuldsson PLS regression methods , 1988 .

[4]  A. Atkinson Subset Selection in Regression , 1992 .

[5]  Gerald J. Niemi,et al.  Prediction of octanol/water partition coefficient ( K OW ) with algorithmically derived variables , 1992 .

[6]  M E Andersen,et al.  Partition coefficients of low-molecular-weight volatile chemicals in various liquids and tissues. , 1989, Toxicology and applied pharmacology.

[7]  S C Basak,et al.  Assessment of the mutagenicity of aromatic amines from theoretical structural parameters: a hierarchical approach. , 1999, SAR and QSAR in environmental research.

[8]  Subhash C. Basak,et al.  ESTIMATION OF LIPOPHILICITY FROM MOLECULAR STRUCTURAL SIMILARITY , 1995 .

[9]  Information Theoretic Indices of Neighborhood Complexity and their Applications , 2000 .

[10]  Subhash C. Basak,et al.  Use of Topostructural, Topochemical, and Geometric Parameters in the Prediction of Vapor Pressure: A Hierarchical QSAR Approach , 1997, J. Chem. Inf. Comput. Sci..

[11]  S. Unger Molecular Connectivity in Structure–activity Analysis , 1987 .

[12]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[13]  V. Fiserova-Bergerova,et al.  Effects of biosolubility on pulmonary uptake and disposition of gases and vapors of lipophilic chemicals. , 1984, Drug metabolism reviews.

[14]  Gerald J. Niemi,et al.  Optimal characterization of structure for prediction of properties , 1990 .

[15]  Gerald J. Niemi,et al.  A comparative study of molecular similarity, statistical, and neural methods for predicting toxic modes of action , 1998 .

[16]  S C Basak,et al.  Prediction of Mutagenicity Utilizing A Hierarchical Qsar Approach , 2001, SAR and QSAR in environmental research.

[17]  Subhash C. Basak,et al.  Recent developments in the characterization of chemical structure using graph-theoretic indices , 1990 .

[18]  Douglas M. Hawkins,et al.  QSAR with Few Compounds and Many Features , 2001, J. Chem. Inf. Comput. Sci..

[19]  Douglas M. Hawkins,et al.  Predicting Mutagenicity of Congeneric and Diverse Sets of Chemicals Using Computed Molecular Descriptors: A Hierarchical Approach , 2003 .

[20]  L B Kier,et al.  Molecular connectivity V: connectivity series concept applied to density. , 1976, Journal of pharmaceutical sciences.

[21]  Subhash C Basak,et al.  Prediction of Human Blood: Air Partition Coefficient: A Comparison of Structure‐Based and Property‐Based Methods , 2003, Risk analysis : an official publication of the Society for Risk Analysis.

[22]  Subhash C. Basak,et al.  Quantitative Structure-Property Relationships (QSPRs) for the Estimation of Vapor Pressure: A Hierarchical Approach Using Mathematical Structural Descriptors , 2001, J. Chem. Inf. Comput. Sci..

[23]  Subhash C. Basak,et al.  Topological Indices: Their Nature and Mutual Relatedness , 2000, J. Chem. Inf. Comput. Sci..

[24]  Subhash C. Basak,et al.  Determining structural similarity of chemicals using graph-theoretic indices , 1988, Discret. Appl. Math..

[25]  A Sato,et al.  Partition coefficients of some aromatic hydrocarbons and ketones in water, blood and oil. , 1979, British journal of industrial medicine.

[26]  W. Massy Principal Components Regression in Exploratory Statistical Research , 1965 .

[27]  Alexandru T. Balaban,et al.  Topological indices based on topological distances in molecular graphs , 1983 .

[28]  Alexandru T. Balaban,et al.  A new approach for devising local graph invariants: Derived topological indices with low degeneracy and good correlation ability , 1987 .

[29]  Subhash C. Basak,et al.  Prediction of Mutagenicity of Aromatic and Heteroaromatic Amines from Structure: A Hierarchical QSAR Approach , 2001, J. Chem. Inf. Comput. Sci..

[30]  L. Hall,et al.  Molecular Structure Description: The Electrotopological State , 1999 .

[31]  A. C. Rencher,et al.  Inflation of R2 in Best Subset Regression , 1980 .

[32]  Subhash C. Basak,et al.  A Comparative Study of Topological and Geometrical Parameters in Estimating Normal Boiling Point and Octanol/Water Partition Coefficient , 1996, J. Chem. Inf. Comput. Sci..

[33]  C. Raychaudhury,et al.  Discrimination of isomeric structures using information theoretic topological indices , 1984 .

[34]  S C Basak,et al.  Comparative study of lipophilicity versus topological molecular descriptors in biological correlations. , 1984, Journal of pharmaceutical sciences.

[35]  A comparative study of lipophilicity and topological indices in biological correlation , 1986 .

[36]  J. W. Gorman,et al.  Selection of Variables for Fitting Equations to Data , 1966 .

[37]  Subhash C. Basak,et al.  Use of mathematical structural invariants in the development of QSPR models , 2001 .

[38]  Agnar Höskuldsson,et al.  A combined theory for PCA and PLS , 1995 .

[39]  M. Randic Characterization of molecular branching , 1975 .

[40]  Michael H. Abraham,et al.  Linear solvation energy relationships. 23. A comprehensive collection of the solvatochromic parameters, .pi.*, .alpha., and .beta., and some methods for simplifying the generalized solvatochromic equation , 1983 .