Chapter 3: Small Molecules and Disease

“Big” molecules such as proteins and genes still continue to capture the imagination of most biologists, biochemists and bioinformaticians. “Small” molecules, on the other hand, are the molecules that most biologists, biochemists and bioinformaticians prefer to ignore. However, it is becoming increasingly apparent that small molecules such as amino acids, lipids and sugars play a far more important role in all aspects of disease etiology and disease treatment than we realized. This particular chapter focuses on an emerging field of bioinformatics called “chemical bioinformatics” – a discipline that has evolved to help address the blended chemical and molecular biological needs of toxicogenomics, pharmacogenomics, metabolomics and systems biology. In the following pages we will cover several topics related to chemical bioinformatics. First, a brief overview of some of the most important or useful chemical bioinformatic resources will be given. Second, a more detailed overview will be given on those particular resources that allow researchers to connect small molecules to diseases. This section will focus on describing a number of recently developed databases or knowledgebases that explicitly relate small molecules – either as the treatment, symptom or cause – to disease. Finally a short discussion will be provided on newly emerging software tools that exploit these databases as a means to discover new biomarkers or even new treatments for disease.

[1]  David S. Wishart,et al.  Nucleic Acids Research Polysearch: a Web-based Text Mining System for Extracting Relationships between Human Diseases, Genes, Mutations, Drugs Polysearch: a Web-based Text Mining System for Extracting Relationships between Human Diseases, Genes, Mutations, Drugs and Metabolites , 2008 .

[2]  Takao Shimizu,et al.  Basic analytical systems for lipidomics by mass spectrometry in Japan. , 2007, Methods in enzymology.

[3]  David S Wishart,et al.  DrugBank and its relevance to pharmacogenomics. , 2008, Pharmacogenomics.

[4]  John L Markley,et al.  Metabolite identification via the Madison Metabolomics Consortium Database , 2008, Nature Biotechnology.

[5]  D. Wishart Proteomics and the Human Metabolome Project , 2007, Expert review of proteomics.

[6]  Alexander R. Pico,et al.  WikiPathways: Pathway Editing for the People , 2008, PLoS biology.

[7]  Thomas C. Wiegers,et al.  Comparative Toxicogenomics Database: a knowledgebase and discovery tool for chemical–gene–disease networks , 2008, Nucleic Acids Res..

[8]  F. Brown Chapter 35 – Chemoinformatics: What is it and How does it Impact Drug Discovery. , 1998 .

[9]  Yves Gibon,et al.  GMD@CSB.DB: the Golm Metabolome Database , 2005, Bioinform..

[10]  Peter D. Karp,et al.  Eco Cyc: encyclopedia of Escherichia coli genes and metabolism , 1999, Nucleic Acids Res..

[11]  Michael Darsow,et al.  ChEBI: a database and ontology for chemical entities of biological interest , 2007, Nucleic Acids Res..

[12]  David S. Wishart,et al.  DrugBank: a comprehensive resource for in silico drug discovery and exploration , 2005, Nucleic Acids Res..

[13]  Robert B. Russell,et al.  SuperTarget and Matador: resources for exploring drug-target relationships , 2007, Nucleic Acids Res..

[14]  Liam J. McGuffin,et al.  The PSIPRED protein structure prediction server , 2000, Bioinform..

[15]  David S. Wishart,et al.  SMPDB: The Small Molecule Pathway Database , 2009, Nucleic Acids Res..

[16]  Eoin Fahy,et al.  LIPID MAPS online tools for lipid research , 2007, Nucleic Acids Res..

[17]  Michel Schneider,et al.  UniProtKB/Swiss-Prot. , 2007, Methods in molecular biology.

[18]  C L Hatfield,et al.  Quality of consumer drug information provided by four Web sites. , 1999, American journal of health-system pharmacy : AJHP : official journal of the American Society of Health-System Pharmacists.

[19]  Anthony J Williams,et al.  Public chemical compound databases. , 2008, Current opinion in drug discovery & development.

[20]  Lu Huang,et al.  Update of TTD: Therapeutic Target Database , 2009, Nucleic Acids Res..

[21]  T. N. Bhat,et al.  The Protein Data Bank: unifying the archive , 2002, Nucleic Acids Res..

[22]  R. Abagyan,et al.  METLIN: A Metabolite Mass Spectral Database , 2005, Therapeutic drug monitoring.

[23]  Qingming Luo,et al.  Mass spectrometry in systems biology: an overview. , 2008, Mass spectrometry reviews.

[24]  S. Linder,et al.  Target specificity and off-target effects as determinants of cancer drug efficacy. , 2008, Expert opinion on drug metabolism & toxicology.

[25]  Yanli Wang,et al.  PubChem: a public information system for analyzing bioactivities of small molecules , 2009, Nucleic Acids Res..

[26]  D. Wishart Applications of Metabolomics in Drug Discovery and Development , 2008, Drugs in R&D.

[27]  Peter D. Karp,et al.  EcoCyc: Encyclopedia of Escherichia coli genes and metabolism , 1998, Nucleic Acids Res..

[28]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[29]  David S. Wishart,et al.  T3DB: a comprehensively annotated database of common toxins and their targets , 2009, Nucleic Acids Res..

[30]  Zhaohui S. Qin,et al.  A second generation human haplotype map of over 3.1 million SNPs , 2007, Nature.

[31]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[32]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[33]  Peter D. Karp,et al.  Querying and computing with BioCyc databases , 2005, Bioinform..

[34]  S. Kanaya,et al.  KNApSAcK: A Comprehensive Species-Metabolite Relationship Database , 2006 .

[35]  Lincoln Stein,et al.  Reactome: a knowledgebase of biological pathways , 2004, Nucleic Acids Res..

[36]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[37]  S. Amladi,et al.  Online Mendelian Inheritance in Man 'OMIM'. , 2003, Indian journal of dermatology, venereology and leprology.

[38]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[39]  David S. Wishart,et al.  MSEA: a web-based tool to identify biologically meaningful patterns in quantitative metabolomic data , 2010, Nucleic Acids Res..

[40]  Ann Richard,et al.  ACToR--Aggregated Computational Toxicology Resource. , 2008, Toxicology and applied pharmacology.

[41]  John Milner,et al.  Nutrigenomics, proteomics, metabolomics, and the practice of dietetics. , 2006, Journal of the American Dietetic Association.

[42]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[43]  Ulrike Schmidt,et al.  SuperToxic: a comprehensive database of toxic compounds , 2008, Nucleic Acids Res..

[44]  R. Altman,et al.  PharmGKB: understanding the effects of individual genetic variants. , 2008, Drug metabolism reviews.

[45]  Lincoln Stein,et al.  The SNP Consortium website: past, present and future , 2003, Nucleic Acids Res..

[46]  David S. Wishart,et al.  HMDB: a knowledgebase for the human metabolome , 2008, Nucleic Acids Res..

[47]  Kevin A Clauson,et al.  Ability of online drug databases to assist in clinical decision-making with infectious disease therapies , 2008, BMC infectious diseases.

[48]  Miron Livny,et al.  BioMagResBank , 2007, Nucleic Acids Res..