Collaborative development of predictive toxicology applications

OpenTox provides an interoperable, standards-based Framework for the support of predictive toxicology data management, algorithms, modelling, validation and reporting. It is relevant to satisfying the chemical safety assessment requirements of the REACH legislation as it supports access to experimental data, (Quantitative) Structure-Activity Relationship models, and toxicological information through an integrating platform that adheres to regulatory requirements and OECD validation principles. Initial research defined the essential components of the Framework including the approach to data access, schema and management, use of controlled vocabularies and ontologies, architecture, web service and communications protocols, and selection and integration of algorithms for predictive modelling. OpenTox provides end-user oriented tools to non-computational specialists, risk assessors, and toxicological experts in addition to Application Programming Interfaces (APIs) for developers of new applications. OpenTox actively supports public standards for data representation, interfaces, vocabularies and ontologies, Open Source approaches to core platform components, and community-based collaboration approaches, so as to progress system interoperability goals.The OpenTox Framework includes APIs and services for compounds, datasets, features, algorithms, models, ontologies, tasks, validation, and reporting which may be combined into multiple applications satisfying a variety of different user needs. OpenTox applications are based on a set of distributed, interoperable OpenTox API-compliant REST web services. The OpenTox approach to ontology allows for efficient mapping of complementary data coming from different datasets into a unifying structure having a shared terminology and representation.Two initial OpenTox applications are presented as an illustration of the potential impact of OpenTox for high-quality and consistent structure-activity relationship modelling of REACH-relevant endpoints: ToxPredict which predicts and reports on toxicities for endpoints for an input chemical structure, and ToxCreate which builds and validates a predictive toxicity model based on an input toxicology dataset. Because of the extensible nature of the standardised Framework design, barriers of interoperability between applications and content are removed, as the user may combine data, models and validation from multiple sources in a dependable and time-effective way.

[1]  Igor V. Tetko,et al.  Combinatorial QSAR Modeling of Chemical Toxicants Tested against Tetrahymena pyriformis , 2008, J. Chem. Inf. Model..

[2]  Carl Bedingfield Review of "Spinning the semantic web: Bringing the world wide web to its full potential" edited by Dieter Fensel, James Hendler, Henry Lieberman, and Wolfgang Wahlster, The MIT press , 2003, UBIQ.

[3]  S. Anzali,et al.  Discriminating between drugs and nondrugs by prediction of activity spectra for substances (PASS). , 2001, Journal of medicinal chemistry.

[4]  Jiawei Han,et al.  gSpan: graph-based substructure pattern mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[5]  G. Patlewicz,et al.  An evaluation of the implementation of the Cramer classification scheme in the Toxtree software , 2008, SAR and QSAR in environmental research.

[6]  Indira Ghosh,et al.  Developing an Antituberculosis Compounds Database and Data Mining in the Search of a Motif Responsible for the Activity of a Diverse Class of Antituberculosis Agents , 2006, J. Chem. Inf. Model..

[7]  V. Cogliano International Agency for Research on Cancer (IARC) , 2018, The Grants Register 2019.

[8]  J. G. Hengstler,et al.  Alternative methods to safety studies in experimental animals: role in the risk assessment of chemicals under the new European Chemicals Legislation (REACH) , 2008, Archives of Toxicology.

[9]  CChem FRSC,et al.  Guidelines for the testing of chemicals for mutagenicity. Committee on Mutagenicity of Chemicals in Food, Consumer Products and the Environment. , 1989, Reports on health and social subjects.

[10]  Katharina Jahn,et al.  Optimizing gSpan for Molecular Datasets , 2005 .

[11]  Egon L. Willighagen,et al.  The Chemistry Development Kit (CDK): An Open-Source Java Library for Chemo-and Bioinformatics , 2003, J. Chem. Inf. Comput. Sci..

[12]  Roger A. Smith,et al.  Combinatorial Chemistry and High‐throughput Screening , 2006 .

[13]  Vicki Dellarco,et al.  Use of mechanism-based structure-activity relationships analysis in carcinogenic potential ranking for drinking water disinfection by-products. , 2002, Environmental health perspectives.

[14]  Guidance on information requirements and chemical safety assessment , 2008 .

[15]  Roy Fielding,et al.  Architectural Styles and the Design of Network-based Software Architectures"; Doctoral dissertation , 2000 .

[16]  Sean C. Sweetman,et al.  Martindale: The Complete Drug Reference , 1999 .

[17]  R. Biswas,et al.  Metagraph-Based Substructure Pattern Mining , 2008, 2008 International Conference on Advanced Computer Theory and Engineering.

[18]  Nigel Shadbolt,et al.  Resource Description Framework (RDF) , 2009 .

[19]  Thomas Hartung,et al.  Chemical regulators have overreached , 2009, Nature.

[20]  Deborah L. McGuinness,et al.  OWL Web ontology language overview , 2004 .

[21]  Ann M Richard,et al.  Distributed structure-searchable toxicity (DSSTox) public database network: a proposal. , 2002, Mutation research.

[22]  3rd World Congress on Alternatives and Animal Use in the Life Sciences , 1999 .

[23]  Gobinda G. Chowdhury,et al.  Spinning the Semantic Web: Bringing the World Wide Web to Its Full Potential , 2004 .

[24]  H WittenIan,et al.  The WEKA data mining software , 2009 .

[25]  I. Gutman,et al.  Simplified molecular input line entry system (SMILES) as an alternative for constructing quantitative structure-property relationships (QSPR) , 2005 .

[26]  R. Cooper,et al.  Nature of the binding interaction for 50 structurally diverse chemicals with rat estrogen receptors. , 2006, Toxicological sciences : an official journal of the Society of Toxicology.

[27]  Romualdo Benigni,et al.  The Benigni / Bossa rulebase for mutagenicity and carcinogenicity - a module of Toxtree , 2008 .

[28]  Ann Richard,et al.  ACToR--Aggregated Computational Toxicology Resource. , 2008, Toxicology and applied pharmacology.

[29]  S Jacobi,et al.  REPDOSE: A database on repeated dose toxicity studies of commercial chemicals--A multifunctional tool. , 2006, Regulatory toxicology and pharmacology : RTP.

[30]  Ian Kimber,et al.  Compilation of Historical Local Lymph Node Data for Evaluation of Skin Sensitization Alternative Methods , 2005, Dermatitis : contact, atopic, occupational, drug.

[31]  Martin H. Abramson,et al.  Complete Drug Reference , 1996 .

[32]  Jeff Z. Pan,et al.  Resource Description Framework , 2020, Definitions.

[33]  Christopher K. I. Williams,et al.  Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning) , 2005 .

[34]  Vladimir Poroikov,et al.  Why relevant chemical information cannot be exchanged without disclosing structures , 2005, J. Comput. Aided Mol. Des..

[35]  Timothy E. McMahon,et al.  National library of medicine web site: (http: //www.nlm.nih.gov/.) National Institutes of Health, U.S. Department of Health and Human Services. Reviewed in July 1998 , 1999, Gov. Inf. Q..

[36]  Chihae Yang,et al.  Toxicity Data Informatics: Supporting a New Paradigm for Toxicity Prediction , 2008, Toxicology mechanisms and methods.

[37]  H M Schoolman,et al.  The United States National Library of Medicine. , 1989, Seminars in dermatology.

[38]  W. Russell,et al.  Ethical and Scientific Considerations Regarding Animal Testing and Research , 2011, PloS one.

[39]  Huan Liu,et al.  Chi2: feature selection and discretization of numeric attributes , 1995, Proceedings of 7th IEEE International Conference on Tools with Artificial Intelligence.

[40]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[41]  Deborah L. McGuinness,et al.  Ontologies Come of Age , 2003, Spinning the Semantic Web.