Assessment of Methods To Define the Applicability Domain of Structural Alert Models

It is important that in silico models for use in chemical safety legislation, such as REACH, are compliant with the OECD Principles for the Validation of (Q)SARs. Structural alert models can be useful under these circumstances but lack an adequately defined applicability domain. This paper examines several methods of domain definition for structural alert models with the aim of assessing which were the most useful. Specifically, these methods were the use of fragments, chemical descriptor ranges, structural similarity, and specific applicability domain definition software. Structural alerts for mutagenicity in Derek for Windows (DfW) were used as examples, and Ames test data were used to define and test the domain of chemical space where the alerts produce reliable results. The usefulness of each domain was assessed on the criterion that confidence in the correctness of predictions should be greater inside the domain than outside it. By using a combination of structural similarity and chemical fragments a domain was produced where the majority of correct positive predictions for mutagenicity were within the domain and a large proportion of the incorrect positive predictions outside it. However this was not found for the negative predictions; there was little difference between the percentage of true and false predictions for inactivity which were found as either within or outside the applicability domain. A hypothesis for the occurrence of this difference between positive and negative predictions is that differences in structure between training and test compounds are more likely to remove the toxic potential of a compound containing a structural alert than to add an unknown mechanism of action (structural alert) to a molecule which does not already contain an alert. This could be especially true for well studied end points such as the Ames assay where the majority of mechanisms of action are likely to be known.

[1]  N. Nikolova,et al.  International Union of Pure and Applied Chemistry, LUMO energy ± The Lowest Unoccupied Molecular Orbital (LUMO) , 2022 .

[2]  Horvath Dragos,et al.  Predicting the predictability: a unified approach to the applicability domain problem of QSAR models. , 2009, Journal of chemical information and modeling.

[3]  Judith C. Madden,et al.  Formation of Structural Categories to Allow for Read‐Across for Teratogenicity , 2009 .

[4]  Gergana Dimitrova,et al.  A Stepwise Approach for Defining the Applicability Domain of SAR and QSAR Models , 2005, J. Chem. Inf. Model..

[5]  John D. Walker,et al.  Use of structural alerts to develop rules for identifying chemical substances with skin irritation or skin corrosion potential , 2005 .

[6]  Tatiana I Netzeva,et al.  QSARs for the aquatic toxicity of aromatic aldehydes from Tetrahymena data. , 2005, Chemosphere.

[7]  Rolf Altenburger,et al.  Structural alerts--a new classification model to discriminate excess toxicity from narcotic effect levels of organic compounds in the acute daphnid assay. , 2005, Chemical research in toxicology.

[8]  D Mackay,et al.  The physicochemical basis of QSARs for baseline toxicity , 2009, SAR and QSAR in environmental research.

[9]  Darren R. Flower,et al.  On the Properties of Bit String-Based Measures of Chemical Similarity , 1998, J. Chem. Inf. Comput. Sci..

[10]  J C Madden,et al.  Integrating (Q)SAR models, expert systems and read-across approaches for the prediction of developmental toxicity. , 2010, Reproductive toxicology.

[11]  G Patlewicz,et al.  An evaluation of selected global (Q)SARs/expert systems for the prediction of skin sensitisation potential , 2007, SAR and QSAR in environmental research.

[12]  L Maslankiewicz,et al.  Can chemical structure predict reproductive toxicity , 2005 .

[13]  G M Pearl,et al.  Integration of computational analysis as a sentinel tool in toxicological assessments. , 2001, Current topics in medicinal chemistry.

[14]  Ralph Kühne,et al.  Model Selection Based on Structural Similarity-Method Description and Application to Water Solubility Prediction , 2006, J. Chem. Inf. Model..

[15]  A P Worth,et al.  The role of the European centre for the validation of alternative methods (ECVAM) in the validation of (Q)SARs , 2004, SAR and QSAR in environmental research.

[16]  M. Hewitt,et al.  Assessing Applicability Domains of Toxicological QSARs: Definition, Confidence in Predicted Values, and the Role of Mechanisms of Action , 2007 .

[17]  Scott D. Kahn,et al.  Current Status of Methods for Defining the Applicability Domain of (Quantitative) Structure-Activity Relationships , 2005, Alternatives to laboratory animals : ATLA.

[18]  Nina Nikolova-Jeliazkova,et al.  An Approach to Determining Applicability Domains for QSAR Group Contribution Models: An Analysis of SRC KOWWIN , 2005, Alternatives to laboratory animals : ATLA.

[19]  Gerald M. Maggiora,et al.  On Outliers and Activity Cliffs-Why QSAR Often Disappoints , 2006, J. Chem. Inf. Model..

[20]  J. Kazius,et al.  Derivation and validation of toxicophores for mutagenicity prediction. , 2005, Journal of medicinal chemistry.

[21]  Eva Schlede,et al.  Development and Prevalidation of a List of Structure–Activity Relationship Rules to be Used in Expert Systems for Prediction of the Skin-sensitising Properties of Chemicals , 2004, Alternatives to laboratory animals : ATLA.

[22]  T. Schultz,et al.  Verification of the structural alerts for Michael acceptors. , 2007, Chemical research in toxicology.

[23]  R. Venkataraghavan,et al.  Atom pairs as molecular features in structure-activity studies: definition and applications , 1985, J. Chem. Inf. Comput. Sci..

[24]  Nina Nikolova-Jeliazkova,et al.  QSAR Applicability Domain Estimation by Projection of the Training Set in Descriptor Space: A Review , 2005, Alternatives to laboratory animals : ATLA.

[25]  R Posthumus,et al.  Validity and validation of expert (Q)SAR systems. , 2005, SAR and QSAR in environmental research.

[26]  Paola Gramatica,et al.  Methods for reliability and uncertainty assessment and for applicability evaluations of classification- and regression-based QSARs. , 2003, Environmental health perspectives.

[27]  Philip Judson,et al.  Definition of the Applicability Domains of Knowledge-based Predictive Toxicology Expert Systems by Using a Structural Fragment-based Approach , 2009, Alternatives to laboratory animals : ATLA.

[28]  Paola Gramatica,et al.  Principles of QSAR models validation: internal and external , 2007 .

[29]  E. Hulzebos,et al.  (Q)SARS: gatekeepers against risk on chemicals? , 2003, SAR and QSAR in environmental research.

[30]  Neal F. Cariello,et al.  Comparison of the computer programs DEREK and TOPKAT to predict bacterial mutagenicity. Deductive Estimate of Risk from Existing Knowledge. Toxicity Prediction by Komputer Assisted Technology. , 2002, Mutagenesis.