Evaluating the applicability domain in the case of classification predictive models for carcinogenicity based on the counter propagation artificial neural network

The applicability domain (AD) of models developed for regulatory use has attached great attention recently. The AD of quantitative structure–activity relationship (QSAR) models is the response and chemical structure space in which the model makes predictions with a given reliability. The evaluation of AD of regressions QSAR models for congeneric sets of chemicals can be find in many papers and books while the issue about metrics for the evaluation of an AD for the non-linear models (like neural networks) for the diverse set of chemicals represents the new field of investigations in QSAR studies. The scientific society is standing before the challenge to find out reliable way for the evaluation of an AD of non linear models. The new metrics for the evaluation of the AD of the counter propagation artificial neural network (CP ANN) models are discussed in the article: the Euclidean distances between an object (molecule) and the corresponding excited neuron of the neural network and between an object (molecule) and the representative object (vector of average values of descriptors). The investigation of the training and test sets chemicals coverage in the descriptors space was made with the respect to false predicted chemicals. The leverage approach was used to compare non linear (CP ANN) models with linear ones.

[1]  Gallegos Saliner Ana,et al.  A Similarity Based Approach for Chemical Category Classification , 2005 .

[2]  Paola Gramatica,et al.  Introduction General Considerations , 2022 .

[3]  C Helma,et al.  Predictive Models for Carcinogenicity and Mutagenicity: Frameworks, State-of-the-Art, and Perspectives , 2009, Journal of environmental science and health. Part C, Environmental carcinogenesis & ecotoxicology reviews.

[4]  Nina Nikolova-Jeliazkova,et al.  QSAR Applicability Domain Estimation by Projection of the Training Set in Descriptor Space: A Review , 2005, Alternatives to laboratory animals : ATLA.

[5]  Gergana Dimitrova,et al.  A Stepwise Approach for Defining the Applicability Domain of SAR and QSAR Models , 2005, J. Chem. Inf. Model..

[6]  Emilio Benfenati,et al.  New public QSAR model for carcinogenicity , 2010, Chemistry Central journal.

[7]  Paola Gramatica,et al.  Principles of QSAR models validation: internal and external , 2007 .

[8]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[9]  David G. Stork,et al.  Pattern Classification , 1973 .

[10]  Igor V. Tetko,et al.  Critical Assessment of QSAR Models of Environmental Toxicity against Tetrahymena pyriformis: Focusing on Applicability Domain and Overfitting by Variable Selection , 2008, J. Chem. Inf. Model..

[11]  Johann Gasteiger,et al.  Neural networks in chemistry and drug design , 1999 .

[12]  Joanna Jaworska,et al.  Improving Opportunities for Regulatory Acceptance of QSARs: The Importance of Model Domain, Uncertainty, Validity and Predictability , 2003 .

[13]  Scott D. Kahn,et al.  Current Status of Methods for Defining the Applicability Domain of (Quantitative) Structure-Activity Relationships , 2005, Alternatives to laboratory animals : ATLA.

[14]  P Barbieri,et al.  Application of counterpropagation artificial neural network for modelling properties of fish antibiotics , 2004, SAR and QSAR in environmental research.

[15]  Romualdo Benigni,et al.  The Benigni / Bossa rulebase for mutagenicity and carcinogenicity - a module of Toxtree , 2008 .

[16]  N. Nikolova,et al.  International Union of Pure and Applied Chemistry, LUMO energy ± The Lowest Unoccupied Molecular Orbital (LUMO) , 2022 .

[17]  Paola Gramatica,et al.  The Importance of Being Earnest: Validation is the Absolute Essential for Successful Application and Interpretation of QSPR Models , 2003 .

[18]  Ralph Kühne,et al.  Quantitative and qualitative models for carcinogenicity prediction for non-congeneric chemicals using CP ANN method for regulatory uses , 2010, Molecular Diversity.

[19]  Paola Gramatica,et al.  Predicting the NO3 radical tropospheric degradability of organic pollutants by theoretical molecular descriptors , 2003 .