Generalized Read-Across (GenRA): A workflow implemented into the EPA CompTox Chemicals Dashboard.

Generalized Read-Across (GenRA) is a data driven approach which makes read-across predictions on the basis of a similarity weighted activity of source analogues (nearest neighbors). GenRA has been described in more detail in the literature (Shah et al., 2016; Helman et al., 2018). Here we present its implementation within the EPA's CompTox Chemicals Dashboard to provide public access to a GenRA module structured as a read-across workflow. GenRA assists researchers in identifying source analogues, evaluating their validity and making predictions of in vivo toxicity effects for a target substance. Predictions are presented as binary outcomes reflecting presence or absence of toxicity together with quantitative measures of uncertainty. The approach allows users to identify analogues in different ways, quickly assess the availability of relevant in vivo data for those analogues and visualize these in a data matrix to evaluate the consistency and concordance of the available experimental data for those analogues before making a GenRA prediction. Predictions can be exported into a tab-separated value (TSV) or Excel file for additional review and analysis (e.g., doses of analogues associated with production of toxic effects).  GenRA offers a new capability of making reproducible read-across predictions in an easy-to use-interface.

[1]  Antony J. Williams,et al.  The CompTox Chemistry Dashboard: a community data resource for environmental chemistry , 2017, Journal of Cheminformatics.

[2]  Antony J. Williams,et al.  ToxCast Chemical Landscape: Paving the Road to 21st Century Toxicology. , 2016, Chemical research in toxicology.

[3]  Imran Shah,et al.  Navigating through the minefield of read-across frameworks: A commentary perspective , 2018 .

[4]  Nicole Kleinstreuer,et al.  Supporting read-across using biological data. , 2016, ALTEX.

[5]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[6]  Johann Gasteiger,et al.  New Publicly Available Chemical Query Language, CSRML, To Support Chemotype Representations for Application to Data Mining and Modeling , 2015, J. Chem. Inf. Model..

[7]  Imran Shah,et al.  Systematically evaluating read-across prediction and performance using a local validity approach characterized by chemical structure and bioactivity information. , 2016, Regulatory toxicology and pharmacology : RTP.

[8]  Alexander Golbraikh,et al.  Integrative chemical-biological read-across approach for chemical hazard classification. , 2013, Chemical research in toxicology.

[9]  Terry W Schultz,et al.  Lessons learned from read-across case studies for repeated-dose toxicity. , 2017, Regulatory toxicology and pharmacology : RTP.

[10]  Imran Shah,et al.  Navigating through the minefield of read-across tools: A review of in silico tools for grouping , 2017 .

[11]  Ramaswamy Nilakantan,et al.  Topological torsion: a new molecular descriptor for SAR applications. Comparison with other descriptors , 1987, J. Chem. Inf. Comput. Sci..

[12]  Imran Shah,et al.  Extending the Generalised Read-Across approach (GenRA): A systematic analysis of the impact of physicochemical property information on read-across performance. , 2018, Computational toxicology.