Chemical-Induced Phenotypes at CTD Help Inform the Predisease State and Construct Adverse Outcome Pathways

The Comparative Toxicogenomics Database (CTD; http://ctdbase.org) is a public resource that manually curates the scientific literature to provide content that illuminates the molecular mechanisms by which environmental exposures affect human health. We introduce our new chemical-phenotype module that describes how chemicals can affect molecular, cellular, and physiological phenotypes. At CTD, we operationally distinguish between phenotypes and diseases, wherein a phenotype refers to a nondisease biological event: eg, decreased cell cycle arrest (phenotype) versus liver cancer (disease), increased fat cell proliferation (phenotype) versus morbid obesity (disease), etc. Chemical-phenotype interactions are expressed in a formal structured notation using controlled terms for chemicals, phenotypes, taxon, and anatomical descriptors. Combining this information with CTD's chemical-disease module allows inferences to be made between phenotypes and diseases, yielding potential insight into the predisease state. Integration of all 4 CTD modules furnishes unique opportunities for toxicologists to generate computationally predictive adverse outcome pathways, linking chemical-gene molecular initiating events with phenotypic key events, adverse diseases, and population-level health outcomes. As examples, we present 3 diverse case studies discerning the effect of vehicle emissions on altered leukocyte migration, the role of cadmium in influencing phenotypes preceding Alzheimer disease, and the connection of arsenic-induced glucose metabolic phenotypes with diabetes. To date, CTD contains over 165 000 interactions that connect more than 6400 chemicals to 3900 phenotypes for 760 anatomical terms in 215 species, from over 19 000 scientific articles. To our knowledge, this is the first comprehensive set of manually curated, literature-based, contextualized, chemical-induced, nondisease phenotype data provided to the public.

[1]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[2]  Howard L. Bleich,et al.  Technical Milestone: Medical Subject Headings Used to Search the Biomedical Literature , 2001, J. Am. Medical Informatics Assoc..

[3]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[4]  Michael C. Rosenstein,et al.  The comparative toxicogenomics database: a cross-species resource for building chemical-gene interaction networks. , 2006, Toxicological sciences : an official journal of the Society of Toxicology.

[5]  Thomas C. Wiegers,et al.  The Comparative Toxicogenomics Database facilitates identification and understanding of chemical-gene-disease associations: arsenic as a case study , 2008, BMC Medical Genomics.

[6]  Allan Peter Davis,et al.  Genetic and environmental pathways to complex diseases , 2009, BMC Systems Biology.

[7]  Thomas C. Wiegers,et al.  Comparative Toxicogenomics Database: a knowledgebase and discovery tool for chemical–gene–disease networks , 2008, Nucleic Acids Res..

[8]  A. Hubbard,et al.  Toxicogenomic profiling of chemically exposed humans in risk assessment. , 2010, Mutation research.

[9]  Mark A. Ragan,et al.  Automatic, context-specific generation of Gene Ontology slims , 2010, BMC Bioinformatics.

[10]  Thomas C. Wiegers,et al.  The curation paradigm and application tool used for manual curation of the scientific literature at the Comparative Toxicogenomics Database , 2011, Database J. Biol. Databases Curation.

[11]  Thomas C. Wiegers,et al.  The Comparative Toxicogenomics Database: update 2011 , 2010, Nucleic Acids Res..

[12]  T. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2010, Nucleic Acids Res..

[13]  Scott Federhen,et al.  The NCBI Taxonomy database , 2011, Nucleic Acids Res..

[14]  Thomas C. Wiegers,et al.  MEDIC: a practical disease vocabulary used at the Comparative Toxicogenomics Database , 2012, Database J. Biol. Databases Curation.

[15]  Thomas C. Wiegers,et al.  Targeted journal curation as a method to improve data currency at the Comparative Toxicogenomics Database , 2012, Database J. Biol. Databases Curation.

[16]  Thomas C. Wiegers,et al.  The Comparative Toxicogenomics Database: update 2013 , 2012, Nucleic Acids Res..

[17]  Weisong Liu,et al.  PhenoMiner: quantitative phenotype curation at the rat genome database , 2013, Database J. Biol. Databases Curation.

[18]  Thomas C. Wiegers,et al.  A CTD–Pfizer collaboration: manual curation of 88 000 scientific articles text mined for drug–disease and drug–phenotype interactions , 2013, Database J. Biol. Databases Curation.

[19]  V. Leuzzi,et al.  A new case of malonic aciduria with a presymptomatic diagnosis and an early treatment , 2013, Brain and Development.

[20]  Vera Rogiers,et al.  Adverse outcome pathways: hype or hope? , 2013, Archives of Toxicology.

[21]  Thomas C. Wiegers,et al.  Text Mining Effectively Scores and Ranks the Literature for Improving Chemical-Gene-Disease Curation at the Comparative Toxicogenomics Database , 2013, PloS one.

[22]  Sharon Munn,et al.  Adverse outcome pathway development II: best practices. , 2014, Toxicological sciences : an official journal of the Society of Toxicology.

[23]  Sharon Munn,et al.  Adverse outcome pathway (AOP) development I: strategies and principles. , 2014, Toxicological sciences : an official journal of the Society of Toxicology.

[24]  Laura M. Jackson,et al.  Finding Our Way through Phenotypes , 2015, PLoS biology.

[25]  Thomas C. Wiegers,et al.  The Comparative Toxicogenomics Database's 10th year anniversary: update 2015 , 2014, Nucleic Acids Res..

[26]  Kyoung-Bok Min,et al.  Blood cadmium levels and Alzheimer’s disease mortality risk in older US adults , 2016, Environmental Health.

[27]  Thomas C. Wiegers,et al.  Advancing Exposure Science through Chemical Data Curation and Integration in the Comparative Toxicogenomics Database , 2016, Environmental health perspectives.

[28]  N. Sakai,et al.  Challenge of phenotype estimation for optimal treatment of Krabbe disease , 2016, Journal of neuroscience research.

[29]  Thomas C. Wiegers,et al.  ToxEvaluator: an integrated computational platform to aid the interpretation of toxicology study-related findings , 2016, Database J. Biol. Databases Curation.

[30]  Shannon M. Bell,et al.  Accelerating Adverse Outcome Pathway Development Using Publicly Available Data Sources , 2016, Current Environmental Health Reports.

[31]  Thomas C. Wiegers,et al.  Generating Gene Ontology-Disease Inferences to Explore Mechanisms of Human Disease at the Comparative Toxicogenomics Database , 2016, PloS one.

[32]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[33]  Thomas C. Wiegers,et al.  The Comparative Toxicogenomics Database: update 2017 , 2016, Nucleic Acids Res..

[34]  Maurice Whelan,et al.  How Adverse Outcome Pathways Can Aid the Development and Use of Computational Prediction Models for Regulatory Toxicology , 2016, Toxicological sciences : an official journal of the Society of Toxicology.

[35]  Xiangtian Yu,et al.  Individual-specific edge-network analysis for disease prediction , 2017, Nucleic acids research.

[36]  Tudor Groza,et al.  The Monarch Initiative: an integrative data and analytic platform connecting phenotypes to genotypes across species , 2016, bioRxiv.

[37]  K. Audouze,et al.  Human Environmental Disease Network: A computational model to assess toxicology of contaminants. , 2017, ALTEX.

[38]  Kara Dolinski,et al.  The BioGRID interaction database: 2017 update , 2016, Nucleic Acids Res..

[39]  Monte Westerfield,et al.  The Zebrafish Model Organism Database: new support for human disease models, mutation details, gene expression phenotypes and searching , 2016, Nucleic Acids Res..

[40]  Adrian J. Green,et al.  Heavy Metal Exposure and Metabolic Syndrome: Evidence from Human and Model System Studies , 2018, Current Environmental Health Reports.

[41]  Elissa J. Chesler,et al.  Mouse Phenome Database: an integrative database and analysis suite for curated empirical phenotype data from laboratory mice , 2017, Nucleic Acids Res..

[42]  Georgia Tsiliki,et al.  A Data Fusion Pipeline for Generating and Enriching Adverse Outcome Pathway Descriptions , 2017, Toxicological sciences : an official journal of the Society of Toxicology.

[43]  Holly M. Mortensen,et al.  Leveraging human genetic and adverse outcome pathway (AOP) data to inform susceptibility in human health risk assessment , 2018, Mammalian Genome.

[44]  Thomas C. Wiegers,et al.  Accessing an Expanded Exposure Science Module at the Comparative Toxicogenomics Database , 2018, Environmental health perspectives.

[45]  Evan Bolton,et al.  Database resources of the National Center for Biotechnology Information , 2017, Nucleic Acids Res..