Knowledge-based design of target-focused libraries using protein-ligand interaction constraints.

Here we present a new strategy for designing and filtering potentially massive combinatorial libraries using structural information of a binding site. We have developed a variation of the structural interaction fingerprint (SIFt) named r-SIFt, which incorporates the binding interactions of variable fragments in a combinatorial library. This method takes into account the 3D structure of the active site of the target molecule and translates desirable ligand-target binding interactions into library filtering constraints. We show using the MAP kinase p38 as a test case that we can efficiently analyze and classify compounds on the basis of their abilities to interact with the target in the desired binding mode. On the basis of these classifications, decision tree models were generated using the molecular descriptors of the compounds as predictor variables. Our results suggest that r-SIFt coupled with the classification models should be a valuable tool for structure-based focusing of combinatorial chemical libraries.

[1]  P. Charifson,et al.  Improved scoring of ligand-protein interactions using OWFEG free energy grids. , 2001, Journal of medicinal chemistry.

[2]  Martyn G. Ford,et al.  Unsupervised Forward Selection: A Method for Eliminating Redundant Variables , 2000, J. Chem. Inf. Comput. Sci..

[3]  J. Gasteiger,et al.  Mining High-Throughput Screening Data of Combinatorial Libraries: Development of a Filter to Distinguish Hits from Nonhits. , 2004 .

[4]  Jim Austin,et al.  Chemical similarity searching using a neural graph matcher , 2005, ESANN.

[5]  Harren Jhoti,et al.  High-throughput crystallography for lead discovery in drug design , 2002, Nature Reviews Drug Discovery.

[6]  Mark A. Murcko,et al.  Virtual screening : an overview , 1998 .

[7]  D. Zaller,et al.  Structural basis for p38alpha MAP kinase quinazolinone and pyridol-pyrimidine inhibitor specificity. , 2003 .

[8]  Thomas Lengauer,et al.  A fast flexible docking method using an incremental construction algorithm. , 1996, Journal of molecular biology.

[9]  Susan S. Taylor,et al.  Three protein kinase structures define a common motif. , 1994, Structure.

[10]  Anil K. Jain,et al.  Clustering Methodologies in Exploratory Data Analysis , 1980, Adv. Comput..

[11]  Jennifer L. Miller,et al.  Combinatorial Library Design: Maximizing Model-Fitting Compounds within Matrix Synthesis Constraints , 2000, J. Chem. Inf. Comput. Sci..

[12]  M. Murcko,et al.  Consensus scoring: A method for obtaining improved hit rates from docking databases of three-dimensional structures into proteins. , 1999, Journal of medicinal chemistry.

[13]  Darren V. S. Green,et al.  PLUMS: a Program for the Rapid Optimization of Focused Libraries , 2000, J. Chem. Inf. Comput. Sci..

[14]  Valler,et al.  Diversity screening versus focussed screening in drug discovery. , 2000, Drug discovery today.

[15]  Trudi Wright,et al.  Optimizing the Size and Configuration of Combinatorial Libraries , 2003, J. Chem. Inf. Comput. Sci..

[16]  Huafeng Xu,et al.  Retrospect and prospect of virtual screening in drug discovery. , 2002, Current topics in medicinal chemistry.

[17]  D. Zaller,et al.  Structural basis for p38α MAP kinase quinazolinone and pyridol-pyrimidine inhibitor specificity , 2003, Nature Structural Biology.

[18]  Arup K. Ghose,et al.  Combinatorial Library Design and Evaluation: Principles, Software, Tools, and Applications in Drug Discovery , 2001 .

[19]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[20]  E. Goldsmith,et al.  Structural basis of inhibitor selectivity in MAP kinases. , 1998, Structure.

[21]  Y. Martin,et al.  A general and fast scoring function for protein-ligand interactions: a simplified potential approach. , 1999, Journal of medicinal chemistry.

[22]  G. Klebe,et al.  Knowledge-based scoring function to predict protein-ligand interactions. , 2000, Journal of molecular biology.

[23]  G. V. Paolini,et al.  Empirical scoring functions: I. The development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes , 1997, J. Comput. Aided Mol. Des..

[24]  Zhan Deng,et al.  Interaction profiles of protein kinase-inhibitor complexes and their application to virtual screening. , 2005, Journal of medicinal chemistry.

[25]  Z. Deng,et al.  Structural interaction fingerprint (SIFt): a novel method for analyzing three-dimensional protein-ligand binding interactions. , 2004, Journal of medicinal chemistry.

[26]  H. Jhoti High‐Throughput Crystallography , 2008 .

[27]  T. Hunter,et al.  The eukaryotic protein kinase superfamily: kinase (catalytic) domain structure and classification 1 , 1995, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[28]  Marvin Waldman,et al.  Design of focused and restrained subsets from extremely large virtual libraries. , 2003, Journal of molecular graphics & modelling.

[29]  J. Adams,et al.  Recent progress towards the identification of selective inhibitors of serine/threonine protein kinases. , 1999, Current opinion in drug discovery & development.

[30]  P Willett,et al.  Development and validation of a genetic algorithm for flexible docking. , 1997, Journal of molecular biology.