Chemical compound navigator: A web‐based chem‐BLAST, chemical taxonomy‐based search engine for browsing compounds

A novel technique to annotate, query, and analyze chemical compounds has been developed and is illustrated by using the inhibitor data on HIV protease‐inhibitor complexes. In this method, all chemical compounds are annotated in terms of standard chemical structural fragments. These standard fragments are defined by using criteria, such as chemical classification; structural, chemical, or functional groups; and commercial, scientific or common names or synonyms. These fragments are then organized into a data tree based on their chemical substructures. Search engines have been developed to use this data tree to enable query on inhibitors of HIV protease (http://xpdb.nist.gov/hivsdb/hivsdb.html). These search engines use a new novel technique, Chemical Block Layered Alignment of Substructure Technique (Chem‐BLAST) to search on the fragments of an inhibitor to look for its chemical structural neighbors. This novel technique to annotate and query compounds lays the foundation for the use of the Semantic Web concept on chemical compounds to allow end users to group, sort, and search structural neighbors accurately and efficiently. During annotation, it enables the attachment of “meaning” (i.e., semantics) to data in a manner that far exceeds the current practice of associating “metadata” with data by creating a knowledge base (or ontology) associated with compounds. Intended users of the technique are the research community and pharmaceutical industry, for which it will provide a new tool to better identify novel chemical structural neighbors to aid drug discovery. Proteins 2006. © 2006 Wiley‐Liss, Inc.

[1]  Alexander Wlodawer,et al.  Application of InChI to curate, index, and query 3‐D structures , 2005, Proteins.

[2]  Michael M. Hann,et al.  RECAP-Retrosynthetic Combinatorial Analysis Procedure: A Powerful New Technique for Identifying Privileged Molecular Fragments with Useful Applications in Combinatorial Chemistry , 1998, J. Chem. Inf. Comput. Sci..

[3]  T. Bhat,et al.  Structure of HIV-1 protease with KNI-272, a tight-binding transition-state analog containing allophenylnorstatine. , 1995, Structure.

[4]  D. Joseph-McCarthy Computational approaches to structure-based ligand design. , 1999, Pharmacology & therapeutics.

[5]  I. Kuntz Structure-Based Strategies for Drug Design and Discovery , 1992, Science.

[6]  T L Blundell,et al.  Protein structure--based drug design. , 1994, Annual review of biophysics and biomolecular structure.

[7]  J. Drews Drug discovery: a historical perspective. , 2000, Science.

[8]  Michael F. Lynch,et al.  Strategic Considerations in the Design of a Screening System for Substructure Searches of Chemical Structure Files , 1973 .

[9]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[10]  T. N. Bhat,et al.  The PDB data uniformity project , 2001, Nucleic Acids Res..

[11]  T. N. Bhat,et al.  The Protein Data Bank: unifying the archive , 2002, Nucleic Acids Res..

[12]  J. Frank,et al.  Identification of the versatile scaffold protein RACK1 on the eukaryotic ribosome by cryo-EM , 2004, Nature Structural &Molecular Biology.

[13]  George W. A. Milne,et al.  An Interactive Substructure Search System , 1977, J. Chem. Inf. Comput. Sci..

[14]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[15]  G. Bemis,et al.  The properties of known drugs. 1. Molecular frameworks. , 1996, Journal of medicinal chemistry.

[16]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..

[17]  A. Wlodawer,et al.  Structure-based inhibitors of HIV-1 protease. , 1993, Annual review of biochemistry.

[18]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[19]  Alexander Wlodawer,et al.  HIVdb: A Database of the Structures of Human Immunodeficiency Virus Protease , 2002, Proteins.

[20]  T. Bhat,et al.  X-ray structure of a tethered dimer for HIV-1 protease. , 1995, Advances in experimental medicine and biology.

[21]  D. Adam Chemists synthesize a single naming system , 2002, Nature.