Small molecule affinity fingerprinting. A tool for enzyme family subclassification, target identification, and inhibitor design.

Classifying proteins into functionally distinct families based only on primary sequence information remains a difficult task. We describe here a method to generate a large data set of small molecule affinity fingerprints for a group of closely related enzymes, the papain family of cysteine proteases. Binding data was generated for a library of inhibitors based on the ability of each compound to block active-site labeling of the target proteases by a covalent activity based probe (ABP). Clustering algorithms were used to automatically classify a reference group of proteases into subfamilies based on their small molecule affinity fingerprints. This approach was also used to identify cysteine protease targets modified by the ABP in complex proteomes by direct comparison of target affinity fingerprints with those of the reference library of proteases. Finally, experimental data were used to guide the development of a computational method that predicts small molecule inhibitors based on reported crystal structures. This method could ultimately be used with large enzyme families to aid in the design of selective inhibitors of targets based on limited structural/function information.

[1]  I D Kuntz,et al.  Structure-based design and combinatorial chemistry yield low nanomolar inhibitors of cathepsin D. , 1997, Chemistry & biology.

[2]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[3]  S Toba,et al.  Selective targeting of lysosomal cysteine proteases with radiolabeled electrophilic substrate analogs. , 2000, Chemistry & biology.

[4]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[5]  A. Burlingame,et al.  Chemical Approaches for Functionally Probing the Proteome* , 2002, Molecular & Cellular Proteomics.

[6]  Malin M. Young,et al.  Design, docking, and evaluation of multiple libraries against multiple targets , 2001, Proteins.

[7]  D. Turk,et al.  E-64 analogues as inhibitors of cathepsin B. On the role of the absolute configuration of the epoxysuccinyl group. , 1997, Bioorganic & Medicinal Chemistry.

[8]  Xiayang Qiu,et al.  Crystal structure of human osteoclast cathepsin K complex with E-64 , 1997, Nature Structural Biology.

[9]  M. Bogyo,et al.  Global analysis of proteasomal substrate specificity using positional-scanning libraries of covalent inhibitors , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[10]  D. Rich,et al.  Mechanistic studies on the inactivation of papain by epoxysuccinyl inhibitors. , 1996, Journal of medicinal chemistry.

[11]  A. Barrett,et al.  L-trans-Epoxysuccinyl-leucylamido(4-guanidino)butane (E-64) and its analogues as inhibitors of cysteine proteinases including cathepsins B, H and L. , 1982, The Biochemical journal.

[12]  A. Burlingame,et al.  Epoxide electrophiles as activity-dependent cysteine protease profiling and discovery tools. , 2000, Chemistry & biology.

[13]  D. Eisenberg,et al.  Protein function in the post-genomic era , 2000, Nature.

[14]  Peter A. Kollman,et al.  AMBER, a package of computer programs for applying molecular mechanics, normal mode analysis, molecular dynamics and free energy calculations to simulate the structural and energetic properties of molecules , 1995 .

[15]  Piotr Cieplak,et al.  Molecular dynamics and free energy analyses of cathepsin D-inhibitor interactions: insight into structure-based ligand design. , 2002, Journal of medicinal chemistry.

[16]  S. Frye Structure-activity relationship homology (SARAH): a conceptual framework for drug discovery in the genomic era. , 1999, Chemistry & biology.

[17]  Todd J. A. Ewing,et al.  DOCK 4.0: Search strategies for automated molecular docking of flexible molecule databases , 2001, J. Comput. Aided Mol. Des..

[18]  I D Kuntz,et al.  Inhibitors of kinesin activity from structure-based computer screening. , 2000, Biochemistry.

[19]  R. Huber,et al.  Crystal structure of cathepsin B inhibited with CA030 at 2.0-A resolution: A basis for the design of specific epoxysuccinyl inhibitors. , 1995, Biochemistry.

[20]  J. Skolnick,et al.  Method for prediction of protein function from sequence using the sequence-to-structure-to-function paradigm with application to glutaredoxins/thioredoxins and T1 ribonucleases. , 1998, Journal of molecular biology.

[21]  R J Fletterick,et al.  A target within the target: probing cruzain's P1' site to define structural determinants for the Chagas' disease protease. , 2000, Structure.

[22]  E. Fluder,et al.  Latent semantic structure indexing (LaSSI) for defining chemical similarity. , 2001, Journal of medicinal chemistry.

[23]  Sunny Shin,et al.  Substrate binding and sequence preference of the proteasome revealed by active-site-directed affinity probes. , 1998, Chemistry & biology.

[24]  D. Turk,et al.  Revised Definition of Substrate Binding Sites of Papain-Like Cysteine Proteases , 1998, Biological chemistry.

[25]  P. Kollman,et al.  Calculating structures and free energies of complex molecules: combining molecular mechanics and continuum models. , 2000, Accounts of chemical research.

[26]  D. Turk,et al.  Crystal structure of MHC class II‐associated p41 Ii fragment bound to cathepsin L reveals the structural basis for differentiation between cathepsins L and S , 1999, The EMBO journal.

[27]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[28]  J T Palmer,et al.  Crystal structure of human cathepsin S , 1998, Protein science : a publication of the Protein Society.

[29]  David M. Rocke,et al.  Predicting ligand binding to proteins by affinity fingerprinting. , 1995, Chemistry & biology.