Toxmatch--a chemical classification and activity prediction tool based on similarity measures.

Chemical similarity forms the underlying basis for the development of (Quantitative) Structure-Activity Relationships ((Q)SARs), expert systems and chemical groupings. Recently a new software tool to facilitate chemical similarity calculations named Toxmatch was developed. Toxmatch encodes a number of similarity indices to help in the systematic development of chemical groupings, including endpoint specific groupings and read-across, and the comparison of model training and test sets. Two rule-based classification schemes were additionally implemented, namely: the Verhaar scheme for assigning mode of action for aquatic toxicants and the BfR rulebase for skin irritation and corrosion. In this study, a variety of different descriptor-based similarity indices were used to evaluate and compare the BfR training set with respect to its test set. The descriptors utilised in this comparison were the same as those used to derive the original BfR rules i.e. the descriptors selected were relevant for skin irritation/corrosion. The Euclidean distance index was found to be the most predictive of the indices in assessing the performance of the rules.

[1]  M. Uslenghi,et al.  The World Space Observatory (WSO-UV) - Current status , 2008, 0801.2080.

[2]  Manuela Pavan,et al.  The Characterisation of (Quantitative) Structure-Activity Relationships: Preliminary Guidance , 2005 .

[3]  Michael M. Cone,et al.  Molecular structure comparison program for the identification of maximal common substructures , 1977 .

[4]  J Jaworska,et al.  How can structural similarity analysis help in category formation? , 2007, SAR and QSAR in environmental research.

[5]  Robert C. Glen,et al.  Novel Methods for the Prediction of logP, pKa, and logD , 2002, J. Chem. Inf. Comput. Sci..

[6]  M. Pavan,et al.  Evaluation of SARs for the prediction of skin irritation/corrosion potential–structural inclusion rules in the BfR decision support system , 2007, SAR and QSAR in environmental research.

[7]  Ramon Carbo,et al.  How similar is a molecule to another? An electron density measure of similarity between two molecular structures , 1980 .

[8]  P. Botham,et al.  Skin Irritation / Corrosion , 2004 .

[9]  Edward E. Hodgkin,et al.  Molecular similarity based on electrostatic potential and electric field , 1987 .

[10]  Petra S. Kern,et al.  Skin Sensitization: Modeling Based on Skin Metabolism Simulation and Formation of Protein Conjugates , 2005, International journal of toxicology.

[11]  Andrew C. Good,et al.  Utilization of Gaussian functions for the rapid evaluation of molecular similarity , 1992, J. Chem. Inf. Comput. Sci..

[12]  David Robert,et al.  A Formal Comparison between Molecular Quantum Similarity Measures and Indices , 1998, J. Chem. Inf. Comput. Sci..

[13]  J. Ashby Fundamental structural alerts to potential carcinogenicity or noncarcinogenicity. , 1985, Environmental mutagenesis.

[14]  F. McLafferty,et al.  Computer-aided interpretation of mass spectra. 20. Molecular structure comparison program for the identification of maximal common substructures , 1977 .

[15]  J. Hermens,et al.  Classifying environmental pollutants: Part 3. External validation of the classification system. , 2000, Chemosphere.

[16]  Scott D. Kahn,et al.  Current Status of Methods for Defining the Applicability Domain of (Quantitative) Structure-Activity Relationships , 2005, Alternatives to laboratory animals : ATLA.

[17]  David W Roberts,et al.  Mechanistic applicability domains for nonanimal-based prediction of toxicological end points: general principles and application to reactive toxicity. , 2006, Chemical research in toxicology.

[18]  P. Willett,et al.  A Fast Algorithm For Selecting Sets Of Dissimilar Molecules From Large Chemical Databases , 1995 .

[19]  Hugo Kubinyi,et al.  Chemical similarity and biological activities , 2002 .

[20]  Eva Schlede,et al.  Development and Prevalidation of a List of Structure–Activity Relationship Rules to be Used in Expert Systems for Prediction of the Skin-sensitising Properties of Chemicals , 2004, Alternatives to laboratory animals : ATLA.

[21]  R. Carbó,et al.  Molecular quantum similarity measures and N-dimensional representation of quantum objects. I. Theoretical foundations† , 1992 .

[22]  John D. Walker,et al.  Use of Physicochemical Property Limits to Develop Rules for Identifying Chemical Substances with no Skin Irritation or Corrosion Potential , 2004 .

[23]  Julius T. Tou,et al.  Pattern Recognition Principles , 1974 .

[24]  N. Nikolova,et al.  International Union of Pure and Applied Chemistry, LUMO energy ± The Lowest Unoccupied Molecular Orbital (LUMO) , 2022 .

[25]  M. Pavan,et al.  The role of the European Chemicals Bureau in promoting the regulatory use of (Q)SAR methods , 2007, SAR and QSAR in environmental research.

[26]  John D. Walker,et al.  Use of QSARs in international decision-making frameworks to predict health effects of chemical substances. , 2003, Environmental health perspectives.

[27]  R A Ford,et al.  Estimation of toxic hazard--a decision tree approach. , 1978, Food and cosmetics toxicology.

[28]  Andreas Bender,et al.  Molecular Similarity Searching Using Atom Environments, Information-Based Feature Selection, and a Naïve Bayesian Classifier , 2004, J. Chem. Inf. Model..

[29]  J. Hermens,et al.  Classifying environmental pollutants , 1992 .

[30]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[31]  R Posthumus,et al.  Validity and validation of expert (Q)SAR systems. , 2005, SAR and QSAR in environmental research.

[32]  Worth Andrew,et al.  The Use of Computational Methods in the Grouping and Assessment of Chemicals - Preliminary Investigations , 2007 .

[33]  Gallegos Saliner Ana Mini-Review on Chemical Similarity and Prediction of Toxicity , 2006 .

[34]  G Patlewicz,et al.  Toxmatch–a new software tool to aid in the development and evaluation of chemically similar groups , 2008, SAR and QSAR in environmental research.

[35]  David Robert,et al.  Analyzing the Triple Density Molecular Quantum Similarity Measures with the INDSCAL Model , 1998, J. Chem. Inf. Comput. Sci..

[36]  John D. Walker,et al.  The Skin Irritation Corrosion Rules Estimation Tool (SICRET) , 2005 .

[37]  Egon L. Willighagen,et al.  The Chemistry Development Kit (CDK): An Open-Source Java Library for Chemo-and Bioinformatics , 2003, J. Chem. Inf. Comput. Sci..

[38]  M. P. Payne,et al.  Structure-activity relationships for skin sensitization potential: Development of structural alerts for use in knowledge-based toxicity prediction systems , 1994, J. Chem. Inf. Comput. Sci..

[39]  Andrew P. Worth,et al.  Review of Literature-Based Models for Skin and Eye Irritation and Corrosion , 2006 .

[40]  John D. Walker,et al.  Use of structural alerts to develop rules for identifying chemical substances with skin irritation or skin corrosion potential , 2005 .

[41]  Silvia Lanteri,et al.  Topics in Current Chemistry, 151, 93-143 (1987) , 1987 .