Recovering the true targets of specific ligands by virtual screening of the protein data bank

The Protein Data Bank (PDB) has been processed to extract a screening protein library (sc‐PDB) of 2148 entries. A knowledge‐based detection algorithm has been applied to 18,000 PDB files to find regular expressions corresponding to either protein, ions, co‐factors, solvent, or ligand atoms. The sc‐PDB database comprises high‐resolution X‐ray structures of proteins for which (i) a well‐defined active site exists, (ii) the bound‐ligand is a small molecular weight molecule. The database has been screened by an inverse docking tool derived from the GOLD program to recover the known target of four unrelated ligands. Both the database and the inverse screening procedures are accurate enough to rank the true target of the four investigated ligands among the top 1% scorers, with 70–100 fold enrichment with respect to random screening. Applying the proposed screening procedure to a small‐sized generic ligand was much less accurate suggesting that inverse screening shall be reserved to rather selective compounds. Proteins 2004;54:000–000. © 2004 Wiley‐Liss, Inc.

[1]  J M Blaney,et al.  A geometric approach to macromolecule-ligand interactions. , 1982, Journal of molecular biology.

[2]  J. Bolin,et al.  Crystal structures of Escherichia coli and Lactobacillus casei dihydrofolate reductase refined at 1.7 A resolution. I. General features and binding of methotrexate. , 1982, The Journal of biological chemistry.

[3]  H Schriefers,et al.  Antagonistic action of estrogens, flutamide, and human growth hormone on androgen-induced changes in the activities of some enzymes of hepatic steroid metabolism in the rat. , 1983, Endocrinology.

[4]  J. Wendoloski,et al.  Structural origins of high-affinity biotin binding to streptavidin. , 1989, Science.

[5]  R. Cramer,et al.  Validation of the general purpose tripos 5.2 force field , 1989 .

[6]  F A Quiocho,et al.  Atomic structure of adenosine deaminase complexed with a transition-state analog: understanding catalysis and immunodeficiency mutations. , 1991, Science.

[7]  B. Cronstein,et al.  The antiinflammatory mechanism of methotrexate. Increased adenosine release at inflamed sites diminishes leukocyte accumulation in an in vivo model of inflammation. , 1993, The Journal of clinical investigation.

[8]  R. Laskowski SURFNET: a program for visualizing molecular surfaces, cavities, and intermolecular interactions. , 1995, Journal of molecular graphics.

[9]  MDL Information Systems, Inc. , 1995, Environmental science & technology.

[10]  A. Alhomida,et al.  Kinetics for camel (Camelus dromedarius) retina acetylcholinesterase inhibition by methotrexate in vitro. , 1996, Japanese journal of pharmacology.

[11]  P Willett,et al.  Development and validation of a genetic algorithm for flexible docking. , 1997, Journal of molecular biology.

[12]  M Hendlich,et al.  LIGSITE: automatic and efficient detection of potential small molecule-binding sites in proteins. , 1997, Journal of molecular graphics & modelling.

[13]  J. Kraut,et al.  Loop and subdomain movements in the mechanism of Escherichia coli dihydrofolate reductase: crystallographic evidence. , 1997, Biochemistry.

[14]  David A. Agard,et al.  The Structural Basis of Estrogen Receptor/Coactivator Recognition and the Antagonism of This Interaction by Tamoxifen , 1998, Cell.

[15]  Thomas Lengauer,et al.  Evaluation of the FLEXX incremental construction algorithm for protein–ligand docking , 1999, Proteins.

[16]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[17]  G. Habrioux,et al.  Effects of phytoestrogens on aromatase, 3β and 17β-hydroxysteroid dehydrogenase activities and human breast cancer cells. , 2000 .

[18]  Amos Bairoch,et al.  The ENZYME database in 2000 , 2000, Nucleic Acids Res..

[19]  Y.Z. Chen,et al.  Ligand–protein inverse docking and its potential use in the computer search of protein targets of a small molecule , 2001, Proteins.

[20]  C L Brooks,et al.  Ligand-protein database: linking protein-ligand complex structures to binding data. , 2001, Journal of medicinal chemistry.

[21]  R Abagyan,et al.  High-throughput docking for lead generation. , 2001, Current opinion in chemical biology.

[22]  Roman A. Laskowski,et al.  PDBsum: summaries and analyses of PDB structures , 2001, Nucleic Acids Res..

[23]  Gisbert Schneider,et al.  Virtual screening and fast automated docking methods. , 2002, Drug discovery today.

[24]  Robin Taylor,et al.  A new test set for validating predictions of protein–ligand interaction , 2002, Proteins.

[25]  Valentin A. Ilyin,et al.  LigBase: a database of families of aligned ligand binding sites in known protein sequences and structures , 2002, Bioinform..

[26]  1L1400 Protein Ligand Interaction Database PrlLINT and Its Application to QSAR Analysis , 2002 .

[27]  Some Coumarins and Triphenylethene Derivatives as Inhibitors of Human Testes Microsomal 17β-hydroxysteroid Dehydrogenase (17β-HSD Type 3): Further Studies with Tamoxifen on the Rat Testes Microsomal Enzyme , 2002, Journal of enzyme inhibition and medicinal chemistry.

[28]  Ruth Nussinov,et al.  Principles of docking: An overview of search algorithms and a guide to scoring functions , 2002, Proteins.

[29]  Didier Rognan,et al.  ConsDock: A new program for the consensus analysis of protein–ligand interactions , 2002, Proteins.

[30]  Gerhard Klebe,et al.  Relibase: design and development of a database for comprehensive analysis of protein-ligand interactions. , 2003, Journal of molecular biology.

[31]  Didier Rognan,et al.  Protein‐based virtual screening of chemical databases. II. Are homology models of g‐protein coupled receptors suitable targets? , 2002, Proteins.

[32]  E. Heiss,et al.  Mechanism-based in vitro screening of potential cancer chemopreventive agents. , 2003, Mutation research.

[33]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..