Nucleic Acids Research Advance Access published October 18, 2007 ChemBank: a small-molecule screening and

ChemBank (http://chembank.broad.harvard.edu/) is a public, web-based informatics environment developed through a collaboration between the Chemical Biology Program and Platform at the Broad Institute of Harvard and MIT. This knowledge environment includes freely available data derived from small molecules and small-molecule screens and resources for studying these data. ChemBank is unique among small-molecule databases in its dedication to the storage of raw screening data, its rigorous definition of screening experiments in terms of statistical hypothesis testing, and its metadata-based organization of screening experiments into projects involving collections of related assays. ChemBank stores an increasingly varied set of measurements derived from cells and other biological assay systems treated with small molecules. Analysis tools are available and are continuously being developed that allow the relationships between small molecules, cell measurements, and cell states to be studied. Currently, ChemBank stores information on hundreds of thousands of small molecules and hundreds of biomedically relevant assays that have been performed at the Broad Institute by collaborators from the worldwide research community. The goal of ChemBank is to provide life scientists unfettered access to biomedically relevant data and tools heretofore available primarily in the private sector.

[1]  R. Strausberg,et al.  From Knowing to Controlling: A Path from Genomics to Drugs Using Small Molecule Probes , 2003, Science.

[2]  Paul A Clemons,et al.  Chemical genomic profiling of biological networks using graph theory and combinations of small molecule perturbations. , 2003, Journal of the American Chemical Society.

[3]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[4]  Stuart L Schreiber,et al.  Synthesis and cellular profiling of diverse organosilicon small molecules. , 2007, Journal of the American Chemical Society.

[5]  Ji-Hu Zhang,et al.  Probing the Primary Screening Efficiency by Multiple Replicate Testing: A Quantitative Analysis of Hit Confirmation and False Screening Results of a Biochemical Assay , 2005, Journal of biomolecular screening.

[6]  Bert Gunter,et al.  Improved Statistical Methods for Hit Selection in High-Throughput Screening , 2003, Journal of biomolecular screening.

[7]  Paul A Clemons,et al.  High-throughput Identification of Phage-derived Imaging Agents , 2006, Molecular imaging.

[8]  Bert Gunter,et al.  Statistical and Graphical Methods for Quality Control Determination of High-Throughput Screening Data , 2003, Journal of biomolecular screening.

[9]  Gene Ontology Consortium,et al.  The Gene Ontology (GO) project in 2006 , 2005, Nucleic Acids Res..

[10]  Naoki Kimura,et al.  Methylation profiles of genes utilizing newly developed CpG island methylation microarray on colorectal cancer patients , 2005, Nucleic acids research.

[11]  David M. Rocke,et al.  Predicting ligand binding to proteins by affinity fingerprinting. , 1995, Chemistry & biology.

[12]  P. Clemons,et al.  Small molecules, big players: the National Cancer Institute's Initiative for Chemical Genetics. , 2006, Cancer research.

[13]  Catherine Brooksbank,et al.  The European Bioinformatics Institute's data resources: towards systems biology , 2004, Nucleic Acids Res..

[14]  J H Zhang,et al.  Confirmation of primary active substances from high throughput screening of chemical and biological populations: a statistical approach and practical considerations. , 2000, Journal of combinatorial chemistry.

[15]  David E Root,et al.  Detecting Spatial Patterns in Biological Array Experiments , 2003, Journal of biomolecular screening.

[16]  Peter Ertl,et al.  WWW-based chemical information system , 1997 .

[17]  Brian K. Shoichet,et al.  ZINC - A Free Database of Commercially Available Compounds for Virtual Screening , 2005, J. Chem. Inf. Model..

[18]  P. R. Bevington,et al.  Data Reduction and Error Analysis for the Physical Sciences , 1969 .

[19]  A. Fliri,et al.  Biological spectra analysis: Linking biological activity profiles to molecular structure. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Paul A Clemons,et al.  A pipeline for ligand discovery using small-molecule microarrays. , 2007, Current opinion in chemical biology.

[21]  Xin Wen,et al.  BindingDB: a web-accessible database of experimentally determined protein–ligand binding affinities , 2006, Nucleic Acids Res..

[22]  Paul A Clemons,et al.  Relationship of stereochemical and skeletal diversity of small molecules to cellular measurement space. , 2004, Journal of the American Chemical Society.

[23]  D. Zaharevitz,et al.  COMPARE: a web accessible tool for investigating mechanisms of cell growth inhibition. , 2002, Journal of molecular graphics & modelling.

[24]  Andrew I Su,et al.  An efficient rapid system for profiling the cellular activities of molecular libraries. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Thomas D. Y. Chung,et al.  A Simple Statistical Parameter for Use in Evaluation and Validation of High Throughput Screening Assays , 1999, Journal of biomolecular screening.

[26]  David S. Wishart,et al.  DrugBank: a comprehensive resource for in silico drug discovery and exploration , 2005, Nucleic Acids Res..

[27]  Tatiana A. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2004, Nucleic Acids Res..

[28]  David E Root,et al.  A flexible data analysis tool for chemical genetic screens. , 2004, Chemistry & biology.