Critical evaluation of search algorithms for automated molecular docking and database screening

The DOCK program explores possible orientations of a molecule within a macromolecular active site by superimposing atoms onto precomputed site points. Here we compare a number of different search methods, including an exhaustive matching algorithm based on a single docking graph. We evaluate the performance of each method by screening a small database of molecules to a variety of macromolecular targets. By varying the amount of sampling, we can monitor the time convergence of scores and rankings. We not only show that the site point–directed search is tenfold faster than a random search, but that the single graph matching algorithm boosts the speed of database screening up to 60‐fold. The new algorithm, in fact, outperforms the bipartite graph matching algorithm currently used in DOCK. The results indicate that a critical issue for rapid database screening is the extent to which a search method biases run time toward the highest‐ranking molecules. The single docking graph matching algorithm will be incorporated into DOCK version 4.0. © 1997 John Wiley & Sons, Inc. J Comput Chem 18: 1175–1189

[1]  A. Rich,et al.  Molecular structure of the A-tract DNA dodecamer d(CGCAAATTTGCG) complexed with the minor groove binding drug netropsin. , 1993, Biochemistry.

[2]  M Karplus,et al.  HOOK: A program for finding novel molecular architectures that satisfy the chemical and steric requirements of a macromolecule binding site , 1994, Proteins.

[3]  D Fischer,et al.  Molecular surface representations by sparse critical points , 1994, Proteins.

[4]  Ruben Abagyan,et al.  ICM—A new method for protein modeling and design: Applications to docking and structure prediction from the distorted native conformation , 1994, J. Comput. Chem..

[5]  A Aitken,et al.  Expression and structural analysis of 14-3-3 proteins. , 1995, Journal of molecular biology.

[6]  M. Karplus,et al.  Functionality maps of binding sites: A multiple copy simultaneous search method , 1991, Proteins.

[7]  Daniel A. Gschwend,et al.  Orientational sampling and rigid-body minimization in molecular docking revisited: On-the-fly optimization and degeneracy removal , 1996, J. Comput. Aided Mol. Des..

[8]  Robert P. Sheridan,et al.  Using CONCORD to construct a large database of three-dimensional coordinates from connection tables , 1989, J. Chem. Inf. Comput. Sci..

[9]  J. Hermans,et al.  A different best rigid-body molecular fit routine , 1977 .

[10]  M. L. Mason,et al.  Three‐dimensional structure of a fluorescein–Fab complex crystallized in 2‐methyl‐2,4‐pentanediol , 1989, Proteins.

[11]  D. Norbeck,et al.  Design, activity, and 2.8 A crystal structure of a C2 symmetric inhibitor complexed to HIV-1 protease. , 1990, Science.

[12]  R. Nussinov,et al.  A geometry-based suite of molecular docking processes. , 1995, Journal of molecular biology.

[13]  Thomas Lengauer,et al.  Placement of medium-sized molecular fragments into active sites of proteins , 1996, J. Comput. Aided Mol. Des..

[14]  A. Leach,et al.  Ligand docking to proteins with discrete side-chain flexibility. , 1994, Journal of molecular biology.

[15]  I. Kuntz,et al.  Automated docking with grid‐based energy evaluation , 1992 .

[16]  Andrew Smellie,et al.  Fast drug-receptor mapping by site-directed distances: a novel method of predicting new pharmacological leads , 1991, J. Chem. Inf. Comput. Sci..

[17]  Brian K. Shoichet,et al.  Molecular docking using shape descriptors , 1992 .

[18]  I. Kuntz,et al.  Conformational analysis of flexible ligands in macromolecular receptor sites , 1992 .

[19]  W. Howe,et al.  Computer design of bioactive molecules: A method for receptor‐based de novo ligand design , 1991, Proteins.

[20]  I. Kuntz Structure-Based Strategies for Drug Design and Discovery , 1992, Science.

[21]  E. Katchalski‐Katzir,et al.  Molecular surface recognition: determination of geometric fit between proteins and their ligands by correlation techniques. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[22]  I. Kuntz,et al.  Docking flexible ligands to macromolecular receptors by molecular shape. , 1986, Journal of medicinal chemistry.

[23]  J M Blaney,et al.  A geometric approach to macromolecule-ligand interactions. , 1982, Journal of molecular biology.

[24]  Robert P. Sheridan,et al.  FLOG: A system to select ‘quasi-flexible’ ligands complementary to a receptor of known three-dimensional structure , 1994, J. Comput. Aided Mol. Des..

[25]  C. Bron,et al.  Algorithm 457: finding all cliques of an undirected graph , 1973 .

[26]  J. Scott Dixon,et al.  A shape- and chemistry-based docking method and its use in the design of HIV-1 protease inhibitors , 1994, J. Comput. Aided Mol. Des..

[27]  M. Lawrence,et al.  CLIX: A search algorithm for finding novel ligands capable of binding proteins of known three‐dimensional structure , 1992, Proteins.

[28]  J. Scott Dixon,et al.  Flexible ligand docking using a genetic algorithm , 1995, J. Comput. Aided Mol. Des..

[29]  A. Treasurywala,et al.  A genetic algorithm based method for docking flexible molecules , 1994 .

[30]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[31]  Yuan-Ping Pang,et al.  Prediction of the binding site of 1-benzyl-4-[(5,6-dimethoxy-1-indanon-2-yl)methyl]piperidine in acetylcholinesterase by docking studies with the SYSDOC program , 1994, J. Comput. Aided Mol. Des..

[32]  D. Goodsell,et al.  Automated docking of substrates to proteins by simulated annealing , 1990, Proteins.

[33]  D. K. Friesen,et al.  A combinatorial algorithm for calculating ligand binding , 1984 .

[34]  S E Ealick,et al.  Application of crystallographic and modeling methods in the design of purine nucleoside phosphorylase inhibitors. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[35]  J. Bolin,et al.  Crystal structures of Escherichia coli and Lactobacillus casei dihydrofolate reductase refined at 1.7 A resolution. I. General features and binding of methotrexate. , 1982, The Journal of biological chemistry.

[36]  I. Kuntz,et al.  Molecular docking to ensembles of protein structures. , 1997, Journal of molecular biology.

[37]  Daniel A. Gschwend,et al.  Orientational sampling and rigid‐body minimization in molecular docking , 1993, Proteins.

[38]  Hans-Joachim Böhm,et al.  The computer program LUDI: A new method for the de novo design of enzyme inhibitors , 1992, J. Comput. Aided Mol. Des..

[39]  M. Mizutani,et al.  Rational automatic search method for stable docking models of protein and ligand. , 1994, Journal of molecular biology.

[40]  R. Glen,et al.  Molecular recognition of receptor sites using a genetic algorithm with a description of desolvation. , 1995, Journal of molecular biology.

[41]  Hans-Joachim Böhm,et al.  On the use of LUDI to search the Fine Chemicals Directory for ligands of proteins of known three-dimensional structure , 1994, J. Comput. Aided Mol. Des..

[42]  H J Berendsen,et al.  Molecular dynamics simulation of the docking of substrates to proteins , 1994, Proteins.

[43]  A Caflisch,et al.  Monte Carlo docking of oligopeptides to proteins , 1992, Proteins.

[44]  R C Wade,et al.  Identifying targets for bioreductive agents: using GRID to predict selective binding regions of proteins. , 1989, Journal of molecular graphics.