Proteome-derived, database-searchable peptide libraries for identifying protease cleavage sites

We introduce human proteome–derived, database-searchable peptide libraries for characterizing sequence-specific protein interactions. To identify endoprotease cleavage sites, we used peptides in such libraries with protected primary amines to simultaneously determine sequence preferences on the N-terminal (nonprime P) and C-terminal (prime P′) sides of the scissile bond. Prime-side cleavage products were tagged with biotin, isolated and identified by tandem mass spectrometry, and the corresponding nonprime-side sequences were derived from human proteome databases using bioinformatics. Identification of hundreds to over 1,000 individual cleaved peptides allows the consensus protease cleavage site and subsite cooperativity to be readily determined from P6 to P6′. For the highly specific GluC protease, >95% of the 558 cleavage sites identified displayed the canonical selectivity. For the broad-specificity matrix metalloproteinase 2, >1,200 peptidic cleavage sites were identified. Profiling of HIV protease 1, caspase 3, caspase 7, cathepsins K and G, elastase and thrombin showed that this approach is broadly applicable to all mechanistic classes of endoproteases.

[1]  Lennart Martens,et al.  PRIDE: The proteomics identifications database , 2005, Proteomics.

[2]  Christopher M. Overall,et al.  Validating matrix metalloproteinases as drug targets and anti-targets for cancer therapy , 2006, Nature Reviews Cancer.

[3]  Dustin J Maly,et al.  Peptide microarrays for the determination of protease substrate specificity. , 2002, Journal of the American Chemical Society.

[4]  Jeffrey W. Smith,et al.  A Unique Substrate Recognition Profile for Matrix Metalloproteinase-2* , 2002, The Journal of Biological Chemistry.

[5]  F. Gomis-Rüth,et al.  Structural aspects of the metzincin clan of metalloendopeptidases , 2003, Molecular biotechnology.

[6]  C. López-Otín,et al.  Protease degradomics: A new challenge for proteomics , 2002, Nature Reviews Molecular Cell Biology.

[7]  Aaron Shipway,et al.  Activity profile of dust mite allergen extract using substrate libraries and functional proteomic microarrays. , 2004, Chemistry & biology.

[8]  H. Birkedal‐Hansen,et al.  Comparative sequence specificities of human 72- and 92-kDa gelatinases (type IV collagenases) and PUMP (matrilysin). , 1993, Biochemistry.

[9]  Neil D. Rawlings,et al.  MEROPS: the peptidase database , 2009, Nucleic Acids Res..

[10]  A. Stephens,et al.  Site-directed mutagenesis of the reactive center (serine 394) of antithrombin III. , 1988, The Journal of biological chemistry.

[11]  Steven P Gygi,et al.  Comparative evaluation of mass spectrometry platforms used in large-scale proteomics investigations , 2005, Nature Methods.

[12]  Jennifer L. Harris,et al.  PNA-encoded protease substrate microarrays. , 2004, Chemistry & biology.

[13]  B. Turk Targeting proteases: successes, failures and future prospects , 2006, Nature Reviews Drug Discovery.

[14]  C. Overall,et al.  Discovery of Chemokine Substrates for Matrix Metalloproteinases by Exosite Scanning: A New Tool for Degradomics , 2002, Biological chemistry.

[15]  L. Liotta,et al.  The activation of human type IV collagenase proenzyme. Sequence identification of the major conversion product following organomercurial activation. , 1989, The Journal of biological chemistry.

[16]  A. Eisen,et al.  Human skin fibroblast procollagenase: mechanisms of activation by organomercurials and trypsin. , 1983, Biochemistry.

[17]  S. Diamond Methods for mapping protease specificity. , 2007, Current opinion in chemical biology.

[18]  N. Thornberry,et al.  A combinatorial approach for determining protease specificities: application to interleukin-1beta converting enzyme (ICE). , 1997, Chemistry & biology.

[19]  D. Brömme,et al.  Biochemical properties and regulation of cathepsin K activity. , 2008, Biochimie.

[20]  A. Berger,et al.  On the size of the active site in proteases. I. Papain. , 1967, Biochemical and biophysical research communications.

[21]  C. Overall,et al.  Tissue inhibitor of metalloproteinases-4 inhibits but does not support the activation of gelatinase A via efficient inhibition of membrane type 1-matrix metalloproteinase. , 2001, Cancer research.

[22]  M. Haine,et al.  Van Damme A. , 1986 .

[23]  E. Birney,et al.  The International Protein Index: An integrated database for proteomics experiments , 2004, Proteomics.

[24]  R. Beynon,et al.  Positional proteomics: selective recovery and analysis of N-terminal proteolytic peptides , 2005, Nature Methods.

[25]  N. Thornberry,et al.  A Combinatorial Approach Defines Specificities of Members of the Caspase Family and Granzyme B , 1997, The Journal of Biological Chemistry.

[26]  James C. Whisstock,et al.  PoPS: a computational tool for modeling and predicting protease specificity , 2004, Proceedings. 2004 IEEE Computational Systems Bioinformatics Conference, 2004. CSB 2004..

[27]  C. Craik,et al.  Rapid and general profiling of protease specificity by using combinatorial fluorogenic substrate libraries. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Patrick S Daugherty,et al.  Protease specificity determination by using cellular libraries of peptide substrates (CLiPS) , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[29]  C. Overall,et al.  Inflammation dampened by gelatinase A cleavage of monocyte chemoattractant protein-3. , 2000, Science.

[30]  I. Schechter Mapping of the active site of proteases in the 1960s and rational design of inhibitors/drugs in the 1990s. , 2005, Current protein & peptide science.

[31]  G. Drapeau,et al.  Purification and properties of an extracellular protease of Staphylococcus aureus. , 1972, The Journal of biological chemistry.

[32]  Gary D. Stormo,et al.  Displaying the information contents of structural RNA alignments: the structure logos , 1997, Comput. Appl. Biosci..

[33]  Alexey I Nesvizhskii,et al.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search. , 2002, Analytical chemistry.

[34]  S. Diamond,et al.  High Throughput Substrate Specificity Profiling of Serine and Cysteine Proteases Using Solution-phase Fluorogenic Peptide Microarrays* , 2005, Molecular & Cellular Proteomics.

[35]  J. Wells,et al.  Substrate phage: selection of protease substrates by monovalent phage display. , 1993, Science.

[36]  G. Salvesen,et al.  Identification of proteolytic cleavage sites by quantitative proteomics. , 2007, Journal of proteome research.

[37]  J. Whisstock,et al.  Elucidation of the Substrate Specificity of the C1s Protease of the Classical Complement Pathway* , 2005, Journal of Biological Chemistry.

[38]  D. N. Perkins,et al.  Probability‐based protein identification by searching sequence databases using mass spectrometry data , 1999, Electrophoresis.

[39]  T. Hardingham,et al.  The interglobular domain of cartilage aggrecan is cleaved by PUMP, gelatinases, and cathepsin B. , 1992, The Journal of biological chemistry.

[40]  X. Puente,et al.  Human and mouse proteases: a comparative genomic approach , 2003, Nature Reviews Genetics.

[41]  Christopher M Overall,et al.  Proteomics Discovery of Metalloproteinase Substrates in the Cellular Context by iTRAQ™ Labeling Reveals a Diverse MMP-2 Substrate Degradome*S , 2007, Molecular & Cellular Proteomics.

[42]  M. Stack,et al.  Membrane protease proteomics: Isotope-coded affinity tag MS identification of undescribed MT1-matrix metalloproteinase substrates. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[43]  T. D. Schneider,et al.  Sequence logos: a new way to display consensus sequences. , 1990, Nucleic acids research.

[44]  K. Gevaert,et al.  Caspase-specific and nonspecific in vivo protein processing during Fas-induced apoptosis , 2005, Nature Methods.

[45]  L. Cantley,et al.  Determination of protease cleavage site motifs using mixture-based oriented peptide libraries , 2001, Nature Biotechnology.

[46]  G. Fields,et al.  Human matrix metalloproteinase specificity studies using collagen sequence-based synthetic peptides. , 1996, Biopolymers.

[47]  Lennart Martens,et al.  Automated reprocessing pipeline for searching heterogeneous mass spectrometric data of the HUPO Brain Proteome Project pilot phase , 2006, Proteomics.

[48]  C. Overall,et al.  Identification of Candidate Angiogenic Inhibitors Processed by Matrix Metalloproteinase 2 (MMP-2) in Cell-Based Proteomic Screens: Disruption of Vascular Endothelial Growth Factor (VEGF)/Heparin Affin Regulatory Peptide (Pleiotrophin) and VEGF/Connective Tissue Growth Factor Angiogenic Inhibitory Co , 2007, Molecular and Cellular Biology.

[49]  C. Overall,et al.  Proteomic discovery of protease substrates. , 2007, Current opinion in chemical biology.

[50]  K. Maskos Crystal structures of MMPs in complex with physiological and pharmacological inhibitors. , 2005, Biochimie.