The Comparative Toxicogenomics Database's 10th year anniversary: update 2015

Ten years ago, the Comparative Toxicogenomics Database (CTD; http://ctdbase.org/) was developed out of a need to formalize, harmonize and centralize the information on numerous genes and proteins responding to environmental toxic agents across diverse species. CTD's initial approach was to facilitate comparisons of nucleotide and protein sequences of toxicologically significant genes by curating these sequences and electronically annotating them with chemical terms from their associated references. Since then, however, CTD has vastly expanded its scope to robustly represent a triad of chemical–gene, chemical–disease and gene–disease interactions that are manually curated from the scientific literature by professional biocurators using controlled vocabularies, ontologies and structured notation. Today, CTD includes 24 million toxicogenomic connections relating chemicals/drugs, genes/proteins, diseases, taxa, phenotypes, Gene Ontology annotations, pathways and interaction modules. In this 10th year anniversary update, we outline the evolution of CTD, including our increased data content, new ‘Pathway View’ visualization tool, enhanced curation practices, pilot chemical–phenotype results and impending exposure data set. The prototype database originally described in our first report has transformed into a sophisticated resource used actively today to help scientists develop and test hypotheses about the etiologies of environmentally influenced diseases.

[1]  Thomas C. Wiegers,et al.  Comparative Toxicogenomics Database: a knowledgebase and discovery tool for chemical–gene–disease networks , 2008, Nucleic Acids Res..

[2]  G. Colby,et al.  Promoting comparative molecular studies in environmental health research: an overview of the comparative toxicogenomics database (CTD) , 2004, The Pharmacogenomics Journal.

[3]  Thomas C. Wiegers,et al.  A CTD–Pfizer collaboration: manual curation of 88 000 scientific articles text mined for drug–disease and drug–phenotype interactions , 2013, Database J. Biol. Databases Curation.

[4]  Thomas C. Wiegers,et al.  Ranking Transitive Chemical-Disease Inferences Using Local Network Topology in the Comparative Toxicogenomics Database , 2012, PloS one.

[5]  J. Blake,et al.  Providing the Missing Link: the Exposure Science Ontology ExO , 2012, Environmental science & technology.

[6]  Howard L. Bleich,et al.  Technical Milestone: Medical Subject Headings Used to Search the Biomedical Literature , 2001, J. Am. Medical Informatics Assoc..

[7]  David M. Reif,et al.  Zebrafish developmental screening of the ToxCast™ Phase I chemical library. , 2012, Reproductive toxicology.

[8]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[9]  Carol A. Bocchini,et al.  A new face and new challenges for Online Mendelian Inheritance in Man (OMIM®) , 2011, Human mutation.

[10]  C. Mattingly,et al.  The Comparative Toxicogenomics Database (CTD). , 2003, Environmental health perspectives.

[11]  Michael C. Rosenstein,et al.  The Comparative Toxicogenomics Database (CTD): a resource for comparative toxicological studies. , 2006, Journal of experimental zoology. Part A, Comparative experimental biology.

[12]  Thomas C. Wiegers,et al.  Web services-based text-mining demonstrates broad impacts for interoperability and process simplification , 2014, Database J. Biol. Databases Curation.

[13]  Thomas C. Wiegers,et al.  The Comparative Toxicogenomics Database facilitates identification and understanding of chemical-gene-disease associations: arsenic as a case study , 2008, BMC Medical Genomics.

[14]  Robert J Kavlock,et al.  Phenotypic screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms , 2014, Nature Biotechnology.

[15]  Christie S. Chang,et al.  The BioGRID interaction database: 2013 update , 2012, Nucleic Acids Res..

[16]  K. Bretonnel Cohen,et al.  Text mining and manual curation of chemical-gene-disease networks for the Comparative Toxicogenomics Database (CTD) , 2009, BMC Bioinformatics.

[17]  Henning Hermjakob,et al.  The Reactome pathway Knowledgebase , 2015, Nucleic acids research.

[18]  Peter D. Karp,et al.  Curation accuracy of model organism databases , 2014, Database J. Biol. Databases Curation.

[19]  Thomas C. Wiegers,et al.  Text Mining Effectively Scores and Ranks the Literature for Improving Chemical-Gene-Disease Curation at the Comparative Toxicogenomics Database , 2013, PloS one.

[20]  Thomas C. Wiegers,et al.  The Comparative Toxicogenomics Database: update 2011 , 2010, Nucleic Acids Res..

[21]  Thomas C. Wiegers,et al.  Targeted journal curation as a method to improve data currency at the Comparative Toxicogenomics Database , 2012, Database J. Biol. Databases Curation.

[22]  Ronald Frank,et al.  EU-OPENSCREEN--a European infrastructure of open screening platforms for chemical biology. , 2014, ACS chemical biology.

[23]  Thomas C. Wiegers,et al.  The curation paradigm and application tool used for manual curation of the scientific literature at the Comparative Toxicogenomics Database , 2011, Database J. Biol. Databases Curation.

[24]  Gary D Bader,et al.  BioPAX – A community standard for pathway data sharing , 2010, Nature Biotechnology.

[25]  Thomas C. Wiegers,et al.  MEDIC: a practical disease vocabulary used at the Comparative Toxicogenomics Database , 2012, Database J. Biol. Databases Curation.

[26]  Thomas C. Wiegers,et al.  Collaborative biocuration—text-mining development task for document prioritization for curation , 2012, Database J. Biol. Databases Curation.

[27]  M. Fielden,et al.  Development of a large-scale chemogenomics database to improve drug candidate selection and to understand mechanisms of chemical toxicity and action. , 2005, Journal of biotechnology.

[28]  Thomas C. Wiegers,et al.  The Comparative Toxicogenomics Database: update 2013 , 2012, Nucleic Acids Res..

[29]  C. Mattingly Chemical databases for environmental health and clinical research. , 2009, Toxicology letters.

[30]  Michael C. Rosenstein,et al.  The comparative toxicogenomics database: a cross-species resource for building chemical-gene interaction networks. , 2006, Toxicological sciences : an official journal of the Society of Toxicology.

[31]  Susumu Goto,et al.  KEGG for integration and interpretation of large-scale molecular data sets , 2011, Nucleic Acids Res..