Comparative Toxicogenomics Database: a knowledgebase and discovery tool for chemical–gene–disease networks

The Comparative Toxicogenomics Database (CTD) is a curated database that promotes understanding about the effects of environmental chemicals on human health. Biocurators at CTD manually curate chemical–gene interactions, chemical–disease relationships and gene–disease relationships from the literature. This strategy allows data to be integrated to construct chemical–gene–disease networks. CTD is unique in numerous respects: curation focuses on environmental chemicals; interactions are manually curated; interactions are constructed using controlled vocabularies and hierarchies; additional gene attributes (such as Gene Ontology, taxonomy and KEGG pathways) are integrated; data can be viewed from the perspective of a chemical, gene or disease; results and batch queries can be downloaded and saved; and most importantly, CTD acts as both a knowledgebase (by reporting data) and a discovery tool (by generating novel inferences). Over 116 000 interactions between 3900 chemicals and 13 300 genes have been curated from 270 species, and 5900 gene–disease and 2500 chemical–disease direct relationships have been captured. By integrating these data, 350 000 gene–disease relationships and 77 000 chemical–disease relationships can be inferred. This wealth of chemical–gene–disease information yields testable hypotheses for understanding the effects of environmental chemicals on human health. CTD is freely available at http://ctd.mdibl.org.

[1]  Weida Tong,et al.  ArrayTrack--supporting toxicogenomic research at the U.S. Food and Drug Administration National Center for Toxicological Research. , 2003, Environmental health perspectives.

[2]  Olivier Humblet,et al.  Environmental pollutants and breast cancer: epidemiologic studies. , 2007, Cancer.

[3]  Christian J. A. Sigrist,et al.  Nucleic Acids Research Advance Access published November 14, 2007 The 20 years of PROSITE , 2007 .

[4]  Jérôme Gouzy,et al.  ProDom: Automated Clustering of Homologous Domains , 2002, Briefings Bioinform..

[5]  P. Stenson,et al.  Human Gene Mutation Database (HGMD , 2003 .

[6]  T. Jenssen,et al.  A literature network of human genes for high-throughput analysis of gene expression , 2001, Nature Genetics.

[7]  P. Stenson,et al.  Human Gene Mutation Database (HGMD®): 2003 update , 2003, Human mutation.

[8]  Christian von Mering,et al.  STITCH: interaction networks of chemicals and proteins , 2007, Nucleic Acids Res..

[9]  C. Mattingly,et al.  The Comparative Toxicogenomics Database (CTD). , 2003, Environmental health perspectives.

[10]  Francis S. Collins,et al.  Environmental Biology and Human Disease , 2007, Science.

[11]  Tsviya Olender,et al.  Human Gene-Centric Databases at the Weizmann Institute of Science: GeneCards, UDB, CroW 21 and HORDE , 2003, Nucleic Acids Res..

[12]  Mark Craven,et al.  EDGE: A Centralized Resource for the Comparison, Analysis, and Distribution of Toxicogenomic Information , 2005, Molecular Pharmacology.

[13]  Peer Bork,et al.  SMART 4.0: towards genomic data integration , 2004, Nucleic Acids Res..

[14]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[15]  Michael C. Rosenstein,et al.  The Comparative Toxicogenomics Database (CTD): a resource for comparative toxicological studies. , 2006, Journal of experimental zoology. Part A, Comparative experimental biology.

[16]  Helen E. Parkinson,et al.  ArrayExpress—a public database of microarray experiments and gene expression profiles , 2006, Nucleic Acids Res..

[17]  Joshua L. Goodman,et al.  FlyBase: integration and improvements to query tools , 2007, Nucleic Acids Res..

[18]  J. Essigmann,et al.  Biological properties of single chemical-DNA adducts: a twenty year perspective. , 2008, Chemical research in toxicology.

[19]  Francis Collins,et al.  Medicine. Environmental biology and human disease. , 2007, Science.

[20]  V. McKusick Mendelian Inheritance in Man and Its Online Version, OMIM , 2007, The American Journal of Human Genetics.

[21]  Robert D. Finn,et al.  New developments in the InterPro database , 2007, Nucleic Acids Res..

[22]  Martin Serrano,et al.  Nucleic Acids Research Advance Access published October 18, 2007 ChemBank: a small-molecule screening and , 2007 .

[23]  P. Wexler The U.S. National Library of Medicine's Toxicology and Environmental Health Information Program. , 2004, Toxicology.

[24]  Yoshihiro Yamanishi,et al.  KEGG for linking genomes to life and the environment , 2007, Nucleic Acids Res..

[25]  Randi Vita,et al.  The Biocurator: Connecting and Enhancing Scientific Data , 2006, PLoS Comput. Biol..

[26]  R. Jirtle,et al.  Environmental epigenomics in human health and disease , 2008, Environmental and molecular mutagenesis.

[27]  Terri K. Attwood,et al.  PRINTS and its automatic supplement, prePRINTS , 2003, Nucleic Acids Res..

[28]  F. Gonzalez,et al.  Role of human cytochrome P-450s in risk assessment and susceptibility to environmentally based disease. , 1993, Journal of toxicology and environmental health.

[29]  Hideaki Sugawara,et al.  DDBJ with new system and face , 2007, Nucleic Acids Res..

[30]  Joshua M. Stuart,et al.  Integrating genotype and phenotype information: an overview of the PharmGKB project , 2001, The Pharmacogenomics Journal.

[31]  Robert D. Finn,et al.  Pfam: clans, web tools and services , 2005, Nucleic Acids Res..

[32]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[33]  Samuel H. Wilson,et al.  Environmental health and genomics: visions and implications , 2000, Nature Reviews Genetics.

[34]  Ruthann A Rudel,et al.  Environmental pollutants and breast cancer , 2007, Cancer.

[35]  Evelyn Camon,et al.  The EMBL Nucleotide Sequence Database , 2000, Nucleic Acids Res..

[36]  Michael C. Rosenstein,et al.  The comparative toxicogenomics database: a cross-species resource for building chemical-gene interaction networks. , 2006, Toxicological sciences : an official journal of the Society of Toxicology.

[37]  K. Dixon,et al.  Genetic alterations and DNA repair in human carcinogenesis. , 2004, Seminars in cancer biology.

[38]  Michael Darsow,et al.  ChEBI: a database and ontology for chemical entities of biological interest , 2007, Nucleic Acids Res..

[39]  Pierre R. Bushel,et al.  CEBS—Chemical Effects in Biological Systems: a public data repository integrating study design and toxicity data with microarray and proteomics data , 2007, Nucleic Acids Res..

[40]  Patricia Tomasulo,et al.  ChemIDplus-Super Source for Chemical and Drug Information , 2002, Medical reference services quarterly.

[41]  W SEWELL,et al.  MEDICAL SUBJECT HEADINGS IN MEDLARS. , 1964, Bulletin of the Medical Library Association.

[42]  Jacqueline Clavel,et al.  Progress in the epidemiological understanding of gene-environment interactions in major diseases: cancer. , 2007, Comptes rendus biologies.

[43]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..