Computational Approaches to Chemical Hazard Assessment

Summary Computational prediction of toxicity has reached new heights as a result of decades of growth in the magnitude and diversity of biological data. Public packages for statistics and machine learning make model creation faster. New theory in machine learning and cheminformatics enables integration of chemical structure, toxicogenomics, simulated and physical data in the prediction of chemical health hazards, and other toxicological information. Our earlier publications have characterized a toxicological dataset of unprecedented scale resulting from the European REACH legislation (Registration Evaluation Authorisation and Restriction of Chemicals). These publications dove into potential use cases for regulatory data and some models for exploiting this data. This article analyzes the options for the identification and categorization of chemicals, moves on to the derivation of descriptive features for chemicals, discusses different kinds of targets modeled in computational toxicology, and ends with a high-level perspective of the algorithms used to create computational toxicology models.

[1]  Daniel P. Russo,et al.  Global Analysis of Publicly Available Safety Data for 9,801 Substances Registered under REACH from 2008–2014 , 2016, ALTEX.

[2]  J. Belden,et al.  Challenges in regulating pesticide mixtures , 2004 .

[3]  Ronan Bureau,et al.  Introduction of Jumping Fragments in Combination with QSARs for the Assessment of Classification in Ecotoxicology , 2010, J. Chem. Inf. Model..

[4]  Xin Wen,et al.  BindingDB: a web-accessible database of experimentally determined protein–ligand binding affinities , 2006, Nucleic Acids Res..

[5]  Bin Chen,et al.  Assessing Drug Target Association Using Semantic Linked Data , 2012, PLoS Comput. Biol..

[6]  Hao Zhu,et al.  Analysis of Draize Eye Irritation Testing and its Prediction by Mining Publicly Available 2008–2014 REACH Data , 2016, ALTEX.

[7]  H. Kubinyi Comparative Molecular Field Analysis (CoMFA) , 2002 .

[8]  Alexander Tropsha,et al.  Trust, But Verify: On the Importance of Chemical Structure Curation in Cheminformatics and QSAR Modeling Research , 2010, J. Chem. Inf. Model..

[9]  Florian Nigsch,et al.  Computational toxicology: an overview of the sources of data and of modelling methods. , 2009, Expert opinion on drug metabolism & toxicology.

[10]  I. Gutman Degree-Based Topological Indices , 2013 .

[11]  Thomas Hartung,et al.  Food for thought ... on alternative methods for cosmetics safety testing. , 2008, ALTEX.

[12]  M. Randic Characterization of molecular branching , 1975 .

[13]  Luis G Valerio,et al.  Characterization and validation of an in silico toxicology model to predict the mutagenic potential of drug impurities. , 2012, Toxicology and applied pharmacology.

[14]  Kunal Roy,et al.  Advances in QSAR Modeling: Applications in Pharmaceutical, Chemical, Food, Agricultural and Environmental Sciences , 2017 .

[15]  D. Blacker Food for thought. , 2013, JAMA neurology.

[16]  Richard A Becker,et al.  Read-across approaches--misconceptions, promises and challenges ahead. , 2014, ALTEX.

[17]  Christoph Steinbeck,et al.  The ChEBI reference database and ontology for biologically relevant chemistry: enhancements for 2013 , 2012, Nucleic Acids Res..

[18]  Andrew Worth,et al.  Applying Adverse Outcome Pathways (AOPs) to support Integrated Approaches to Testing and Assessment (IATA). , 2014, Regulatory toxicology and pharmacology : RTP.

[19]  Volkmar Schröder,et al.  UN-GHS — Physical hazard classifications of chemicals: A critical review of combinations of hazard classes , 2017 .

[20]  Alexandra Maertens,et al.  Analysis of Publically Available Skin Sensitization Data from REACH Registrations 2008–2014 , 2016, ALTEX.

[21]  Chris Winder,et al.  The development of the globally harmonized system (GHS) of classification and labelling of hazardous chemicals. , 2005, Journal of hazardous materials.

[22]  Kenichi Fukui,et al.  A Molecular Orbital Theory of Reactivity in Aromatic Hydrocarbons , 1952 .

[23]  Bin Chen,et al.  Predicting drug target interactions using meta-path-based semantic network analysis , 2016, BMC Bioinformatics.

[24]  A Mani-Varnosfaderani,et al.  Therapeutic index modeling and predictive QSAR of novel thiazolidin-4-one analogs against Toxoplasma gondii. , 2015, European journal of pharmaceutical sciences : official journal of the European Federation for Pharmaceutical Sciences.

[25]  Bin Chen,et al.  Chem2Bio2RDF: a semantic framework for linking and data mining chemogenomic and systems chemical biology data , 2010, BMC Bioinformatics.

[26]  John M. Barnard,et al.  Clustering of chemical structures on the basis of two-dimensional similarity measures , 1992, J. Chem. Inf. Comput. Sci..

[27]  Igor Linkov,et al.  From "Weight of Evidence" to Quantitative Data Integration using Multicriteria Decision Analysis and Bayesian Methods , 2015, ALTEX.

[28]  Gang Fu,et al.  PubChem Substance and Compound databases , 2015, Nucleic Acids Res..

[29]  M Liebsch,et al.  Regulatory use of (Q)SARs in toxicological hazard assessment strategies , 2004, SAR and QSAR in environmental research.

[30]  Desire L. Massart,et al.  Prediction of gas chromatographic retention indexes with topological, physicochemical, and quantum chemical parameters , 1983 .

[31]  Hilda Witters,et al.  A European perspective on alternatives to animal testing for environmental hazard identification and risk assessment. , 2013, Regulatory toxicology and pharmacology : RTP.

[32]  N. Nikolova,et al.  International Union of Pure and Applied Chemistry, LUMO energy ± The Lowest Unoccupied Molecular Orbital (LUMO) , 2022 .

[33]  Manuela Pavan,et al.  The Characterisation of (Quantitative) Structure-Activity Relationships: Preliminary Guidance , 2005 .

[34]  Paola Gramatica,et al.  Introduction General Considerations , 2022 .

[35]  Maik Moeller,et al.  An Introduction To Chemoinformatics , 2016 .

[36]  E. Matthews,et al.  An in silico expert system for the identification of eye irritants , 2015, SAR and QSAR in environmental research.

[37]  Thomas Hartung,et al.  Integrated Testing Strategies (ITS) for safety assessment. , 2015, ALTEX.

[38]  Julie B. Zimmerman,et al.  Toward molecular design for hazard reduction—fundamental relationships between chemical properties and toxicity , 2010 .

[39]  M. Karelson,et al.  Quantum-Chemical Descriptors in QSAR/QSPR Studies. , 1996, Chemical reviews.

[40]  Lourdes Santana,et al.  A QSAR model for in silico screening of MAO-A inhibitors. Prediction, synthesis, and biological assay of novel coumarins. , 2006, Journal of medicinal chemistry.

[41]  Alexandra Maertens,et al.  Integrated testing strategies for safety assessments. , 2013, ALTEX.

[42]  Gergana Dimitrova,et al.  A Stepwise Approach for Defining the Applicability Domain of SAR and QSAR Models , 2005, J. Chem. Inf. Model..

[43]  Anna Tsantili-Kakoulidou,et al.  Applying quantitative structure-activity relationship (QSAR) methodology for modeling postmortem redistribution of benzodiazepines and tricyclic antidepressants. , 2014, Journal of analytical toxicology.

[44]  Mohanad El-Harbawi,et al.  Development of a novel mathematical model using a group contribution method for prediction of ionic liquid toxicities. , 2011, Chemosphere.

[45]  T. Hartung Evolution of toxicological science: the need for change , 2017 .

[46]  Valérie Zuang,et al.  First alternative method validated by a retrospective weight-of-evidence approach to replace the Draize eye test for the identification of non-irritant substances for a defined applicability domain. , 2010, ALTEX.

[47]  R. J. Doerksen,et al.  Topological polar surface area: a useful descriptor in 2D-QSAR. , 2009, Current medicinal chemistry.

[48]  Tao Jiang,et al.  ChemmineR: a compound mining framework for R , 2008, Bioinform..

[49]  Chris Morley,et al.  Open Babel: An open chemical toolbox , 2011, J. Cheminformatics.

[50]  Michael J. Keiser,et al.  Relating protein pharmacology by ligand chemistry , 2007, Nature Biotechnology.

[51]  V. Poroikov,et al.  Directions in QSAR Modeling for Regulatory Uses in OECD Member Countries, EU and in Russia , 2008, Journal of environmental science and health. Part C, Environmental carcinogenesis & ecotoxicology reviews.

[52]  M. Cronin,et al.  Computational methods for the prediction of drug toxicity. , 2000, Current opinion in drug discovery & development.

[53]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[54]  Charles W. Schmidt,et al.  TSCA 2.0: A New Era in Chemical Risk Management , 2016, Environmental health perspectives.

[55]  Lingxiao Zhou,et al.  Combining spatial and chemical information for clustering pharmacophores , 2014, BMC Bioinformatics.

[56]  Walter Thiel,et al.  Semiempirical quantum–chemical methods , 2014 .

[57]  Joanna Jaworska Integrated Testing Strategies for Skin Sensitization Hazard and Potency Assessment—State of the Art and Challenges , 2016 .

[58]  H. Shiraishi,et al.  Application of chemical reaction mechanistic domains to an ecotoxicity QSAR model, the KAshinhou Tool for Ecotoxicity (KATE) , 2011, SAR and QSAR in environmental research.

[59]  Thomas Hartung,et al.  Making big sense from big data in toxicology by read-across. , 2016, ALTEX.

[60]  Gavin Maxwell,et al.  From pathways to people: applying the adverse outcome pathway (AOP) for skin sensitization to risk assessment. , 2013, ALTEX.

[61]  Alexandra Maertens,et al.  Analysis of Public Oral Toxicity Data from REACH Registrations 2008–2014 , 2016, ALTEX.

[62]  T Hartung,et al.  Toward an evidence-based toxicology , 2006, Human & experimental toxicology.

[63]  Alexandra Maertens,et al.  Probabilistic hazard assessment for skin sensitization potency by dose–response modeling using feature elimination instead of quantitative structure–activity relationships , 2015, Journal of applied toxicology : JAT.

[64]  D. Weed Weight of Evidence: A Review of Concept and Methods , 2005, Risk analysis : an official publication of the Society for Risk Analysis.

[65]  Ivonne M C M Rietjens,et al.  Promises and pitfalls of quantitative structure-activity relationship approaches for predicting metabolism and toxicity. , 2008, Chemical research in toxicology.

[66]  J C Madden,et al.  Structure-based modelling in reproductive toxicology: (Q)SARs for the placental barrier , 2007, SAR and QSAR in environmental research.

[67]  Ellen K Silbergeld,et al.  Regulating chemicals: law, science, and the unbearable burdens of regulation. , 2015, Annual review of public health.

[68]  Timothy F. Malloy,et al.  Leveraging the new predictive toxicology paradigm: alternative testing strategies in regulatory decision-making , 2016 .

[69]  Egon L. Willighagen,et al.  The Chemistry Development Kit (CDK) v2.0: atom typing, depiction, molecular formulas, and substructure searching , 2017, Journal of Cheminformatics.

[70]  R. Cramer,et al.  Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins. , 1988, Journal of the American Chemical Society.

[71]  Alexandra Maertens,et al.  t4 report: Toward Good Read-Across Practice (GRAP) Guidance , 2016, ALTEX.

[72]  Stephen R. Heller,et al.  InChI, the IUPAC International Chemical Identifier , 2015, Journal of Cheminformatics.

[73]  J E Ridings,et al.  Computer prediction of possible toxic action from chemical structure: an update on the DEREK system. , 1996, Toxicology.

[74]  J. Dearden,et al.  QSAR modeling: where have you been? Where are you going to? , 2014, Journal of medicinal chemistry.

[75]  Naomi L Kruhlak,et al.  Identification of structure-activity relationships for adverse effects of pharmaceuticals in humans: Part C: use of QSAR and an expert system for the estimation of the mechanism of action of drug-induced hepatobiliary and urinary tract toxicities. , 2009, Regulatory toxicology and pharmacology : RTP.

[76]  Tomasz Puzyn,et al.  Calculation of Quantum-Mechanical Descriptors for QSPR at the DFT Level: Is It Necessary? , 2008, J. Chem. Inf. Model..

[77]  Jan A. Kors,et al.  Consistency of systematic chemical identifiers within and between small-molecule databases , 2012, Journal of Cheminformatics.

[78]  John P. Overington,et al.  ChEMBL: a large-scale bioactivity database for drug discovery , 2011, Nucleic Acids Res..

[79]  Sabcho D Dimitrov,et al.  A systematic approach to simulating metabolism in computational toxicology. I. The TIMES heuristic modelling framework. , 2004, Current pharmaceutical design.

[80]  Thomas Hartung,et al.  Food for thought... on evidence-based toxicology. , 2009, ALTEX.

[81]  Francois Busquet,et al.  The need for strategic development of safety sciences. , 2017, ALTEX.

[82]  Judy Strickland,et al.  Bayesian integrated testing strategy (ITS) for skin sensitization potency assessment: a decision support system for quantitative weight of evidence and adaptive testing strategy , 2015, Archives of Toxicology.

[83]  I Kimber,et al.  Contact allergenic potency: correlation of human and local lymph node assay data. , 2001, American journal of contact dermatitis : official journal of the American Contact Dermatitis Society.

[84]  Arthur Dalby,et al.  Description of several chemical structure file formats used by computer programs developed at Molecular Design Limited , 1992, J. Chem. Inf. Comput. Sci..

[85]  Evan Bolton,et al.  An overview of the PubChem BioAssay resource , 2009, Nucleic Acids Res..

[86]  Evan Bolton,et al.  Similar compounds versus similar conformers: complementarity between PubChem 2-D and 3-D neighboring sets , 2016, Journal of Cheminformatics.

[87]  Thomas Hartung,et al.  Food for thought...on alternative methods for chemical safety testing. , 2010, ALTEX.

[88]  Nicole Kleinstreuer,et al.  Supporting read-across using biological data. , 2016, ALTEX.

[89]  E Benfenati,et al.  Automatic knowledge extraction from chemical structures: the case of mutagenicity prediction , 2013, SAR and QSAR in environmental research.

[90]  Ivan Rusyn,et al.  Predicting drug-induced hepatotoxicity using QSAR and toxicogenomics approaches. , 2011, Chemical research in toxicology.

[91]  Grace Patlewicz,et al.  Computational Methods to Predict Drug Safety , 2006 .

[92]  Igor V. Tetko,et al.  Applicability Domains for Classification Problems: Benchmarking of Distance to Models for Ames Mutagenicity Set , 2010, J. Chem. Inf. Model..