Introduction to cheminformatics.

Cheminformatics is a relatively new field of information technology that focuses on the collection, storage, analysis, and manipulation of chemical data. The chemical data of interest typically includes information on small molecule formulas, structures, properties, spectra, and activities (biological or industrial). Cheminformatics originally emerged as a vehicle to help the drug discovery and development process, however cheminformatics now plays an increasingly important role in many areas of biology, chemistry, and biochemistry. The intent of this unit is to give readers some introduction into the field of cheminformatics and to show how cheminformatics not only shares many similarities with the field of bioinformatics, but that it can also enhance much of what is currently done in bioinformatics.

[1]  Igor V Tetko,et al.  The WWW as a tool to obtain molecular parameters. , 2003, Mini reviews in medicinal chemistry.

[2]  Ying Zhang,et al.  HMDB: the Human Metabolome Database , 2007, Nucleic Acids Res..

[3]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[4]  F. Brown Chapter 35 – Chemoinformatics: What is it and How does it Impact Drug Discovery. , 1998 .

[5]  J F Gibrat,et al.  Surprising similarities in structure comparison. , 1996, Current opinion in structural biology.

[6]  George W. A. Milne,et al.  National Cancer Institute Drug Information System 3D Database , 1994, J. Chem. Inf. Comput. Sci..

[7]  Michael Lappe,et al.  A fully automatic evolutionary classification of protein folds: Dali Domain Dictionary version 3 , 2001, Nucleic Acids Res..

[8]  Peter D. Karp,et al.  MetaCyc: a multiorganism database of metabolic pathways and enzymes. , 2004, Nucleic acids research.

[9]  Catherine Brooksbank,et al.  The European Bioinformatics Institute's data resources: towards systems biology , 2004, Nucleic Acids Res..

[10]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[11]  Brian K. Shoichet,et al.  ZINC - A Free Database of Commercially Available Compounds for Virtual Screening , 2005, J. Chem. Inf. Model..

[12]  T. N. Bhat,et al.  The Protein Data Bank: unifying the archive , 2002, Nucleic Acids Res..

[13]  X. Chen,et al.  TTD: Therapeutic Target Database , 2002, Nucleic Acids Res..

[14]  Adel Golovin,et al.  MSDsite: A database search and retrieval system for the analysis and viewing of bound ligands and active sites , 2004, Proteins.

[15]  J. Gasteiger,et al.  FROM ATOMS AND BONDS TO THREE-DIMENSIONAL ATOMIC COORDINATES : AUTOMATIC MODEL BUILDERS , 1993 .

[16]  C. Hansch,et al.  Quantitative structure-activity relationships of cytochrome P-450. , 1993, Drug metabolism reviews.

[17]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[18]  Russ B. Altman,et al.  PharmGKB: the Pharmacogenetics Knowledge Base , 2002, Nucleic Acids Res..

[19]  Mark Watson,et al.  Optimizing the use of open-source software applications in drug discovery. , 2006, Drug discovery today.

[20]  Philip E. Bourne,et al.  A database and tools for 3-D protein structure comparison and alignment using the Combinatorial Extension (CE) algorithm , 2001, Nucleic Acids Res..

[21]  Hege S. Beard,et al.  Glide: a new approach for rapid, accurate docking and scoring. 2. Enrichment factors in database screening. , 2004, Journal of medicinal chemistry.

[22]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[23]  Zukang Feng,et al.  Ligand Depot: a data warehouse for ligands bound to macromolecules , 2004, Bioinform..

[24]  T Lengauer,et al.  CASP2 experiences with docking flexible ligands using FLEXX , 1997, Proteins.

[25]  Maria Jesus Martin,et al.  High-quality Protein Knowledge Resource: SWISS-PROT and TrEMBL , 2002, Briefings Bioinform..

[26]  I. Kuntz,et al.  Matching chemistry and shape in molecular docking. , 1993, Protein engineering.

[27]  R. Beger,et al.  Monitoring the health to disease continuum with global metabolic profiling and systems biology. , 2006, Pharmacogenomics.

[28]  Julian R. Ullmann,et al.  An Algorithm for Subgraph Isomorphism , 1976, J. ACM.

[29]  Johannes H. Voigt,et al.  Comparison of the NCI Open Database with Seven Large Chemical Structural Databases , 2001, J. Chem. Inf. Comput. Sci..

[30]  X Yang,et al.  A collaborative hit-to-lead investigation leveraging medicinal chemistry expertise with high throughput library design, synthesis and purification capabilities. , 2006, Combinatorial chemistry & high throughput screening.

[31]  H. Waterbeemd,et al.  Can the Internet help to meet the challenges in ADME and e-ADME? , 2002 .

[32]  Frank Oellien,et al.  Enhanced CACTVS Browser of the Open NCI Database , 2002, J. Chem. Inf. Comput. Sci..

[33]  Götz Schlotterbeck,et al.  Metabolic profiling technologies for biomarker discovery in biomedicine and drug development. , 2006, Pharmacogenomics.

[34]  Jaime Prilusky,et al.  GeneCards: a novel functional genomics compendium with automated data mining and query reformulation support , 1998, Bioinform..

[35]  Tingjun Hou,et al.  ADME Evaluation in Drug Discovery. 3. Modeling Blood-Brain Barrier Partitioning Using Simple Molecular Descriptors , 2003, J. Chem. Inf. Comput. Sci..

[36]  Cathy H. Wu,et al.  The Universal Protein Resource (UniProt) , 2004, Nucleic Acids Res..

[37]  David S. Wishart,et al.  DrugBank: a comprehensive resource for in silico drug discovery and exploration , 2005, Nucleic Acids Res..

[38]  Søren Brunak,et al.  Prediction methods and databases within chemoinformatics : Emphasis on drugs and drug candidates , 2005 .