Structural determinants of limited proteolysis.

Limited or regulatory proteolysis plays a critical role in many important biological pathways like blood coagulation, cell proliferation, and apoptosis. A better understanding of mechanisms that control this process is required for discovering new proteolytic events and for developing inhibitors with potential therapeutic value. Two features that determine the susceptibility of peptide bonds to proteolysis are the sequence in the vicinity of the scissile bond and the structural context in which the bond is displayed. In this study, we assessed statistical significance and predictive power of individual structural descriptors and combination thereof for the identification of cleavage sites. The analysis was performed on a data set of >200 proteolytic events documented in CutDB for a variety of mammalian regulatory proteases and their physiological substrates with known 3D structures. The results confirmed the significance and provided a ranking within three main categories of structural features: exposure > flexibility > local interactions. Among secondary structure elements, the largest frequency of proteolytic cleavage was confirmed for loops and lower but significant frequency for helices. Limited proteolysis has lower albeit appreciable frequency of occurrence in certain types of β-strands, which is in contrast with some previous reports. Descriptors deduced directly from the amino acid sequence displayed only marginal predictive capabilities. Homology-based structural models showed a predictive performance comparable to protein substrates with experimentally established structures. Overall, this study provided a foundation for accurate automated prediction of segments of protein structure susceptible to proteolytic processing and, potentially, other post-translational modifications.

[1]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[2]  Christopher M Overall,et al.  Identification of proteolytic products and natural protein N-termini by Terminal Amine Isotopic Labeling of Substrates (TAILS). , 2011, Methods in molecular biology.

[3]  G. Salvesen,et al.  N-terminomics: a high-content screen for protease substrates and their cleavage sites. , 2011, Methods in molecular biology.

[4]  Ursula Pieper,et al.  Prediction of protease substrates using sequence and structure features , 2010, Bioinform..

[5]  N. Bovenschen,et al.  Proteomic profiling of proteases: tools for granzyme degradomics , 2010, Expert review of proteomics.

[6]  C. Overall,et al.  Multiplex N-terminome Analysis of MMP-2 and MMP-9 Substrate Degradomes by iTRAQ-TAILS Quantitative Proteomics* , 2010, Molecular & Cellular Proteomics.

[7]  Geoffrey I. Webb,et al.  Cascleave: towards more accurate prediction of caspase substrate cleavage sites , 2010, Bioinform..

[8]  Vasile Palade,et al.  A New Performance Measure for Class Imbalance Learning. Application to Bioinformatics Problems , 2009, 2009 International Conference on Machine Learning and Applications.

[9]  G. Salvesen,et al.  Structural and kinetic determinants of protease substrates , 2009, Nature Structural &Molecular Biology.

[10]  Ines Thiele,et al.  Three-Dimensional Structural View of the Central Metabolic Network of Thermotoga maritima , 2009, Science.

[11]  Haruki Nakamura,et al.  Data Deposition and Annotation at the Worldwide Protein Data Bank , 2009, Molecular biotechnology.

[12]  Yang Zhang Protein structure prediction: when is it useful? , 2009, Current opinion in structural biology.

[13]  Bartek Wilczynski,et al.  Biopython: freely available Python tools for computational molecular biology and bioinformatics , 2009, Bioinform..

[14]  S. Maurer-Stroh,et al.  Analysis of Protein Processing by N-terminal Proteomics Reveals Novel Species-specific Substrate Determinants of Granzyme B Orthologs *S , 2009, Molecular & Cellular Proteomics.

[15]  C. López-Otín,et al.  Proteases: Multifunctional Enzymes in Life and Disease* , 2008, Journal of Biological Chemistry.

[16]  David T. Barkan,et al.  Global Sequencing of Proteolytic Cleavage Sites in Apoptosis by Specific Labeling of Protein N Termini , 2008, Cell.

[17]  Oliver Schilling,et al.  Proteome-derived, database-searchable peptide libraries for identifying protease cleavage sites , 2008, Nature Biotechnology.

[18]  Yang Zhang Progress and challenges in protein structure prediction. , 2008, Current opinion in structural biology.

[19]  Adam Godzik,et al.  Between order and disorder in protein structures: analysis of "dual personality" fragments in proteins. , 2007, Structure.

[20]  M. Wilkins,et al.  Surface accessibility of protein post-translational modifications. , 2007, Journal of proteome research.

[21]  S. Diamond Methods for mapping protease specificity. , 2007, Current opinion in chemical biology.

[22]  Jeffrey W. Smith,et al.  CutDB: a proteolytic event database , 2006, Nucleic Acids Res..

[23]  Tin Wee Tan,et al.  SVM-based prediction of caspase substrate cleavage sites , 2006, BMC Bioinformatics.

[24]  B. Turk Targeting proteases: successes, failures and future prospects , 2006, Nature Reviews Drug Discovery.

[25]  C. Craik,et al.  Specificity Profiling of Seven Human Tissue Kallikreins Reveals Individual Subsite Preferences* , 2006, Journal of Biological Chemistry.

[26]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[27]  Masoud Nikravesh,et al.  Feature Extraction - Foundations and Applications , 2006, Feature Extraction.

[28]  Christina Backes,et al.  GraBCas: a bioinformatics tool for score-based prediction of Caspase- and Granzyme B-cleavage sites in protein sequences , 2005, Nucleic Acids Res..

[29]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[30]  Bernard F. Buxton,et al.  The DISOPRED server for the prediction of protein disorder , 2004, Bioinform..

[31]  Aleksey Porollo,et al.  PROTEINS: Structure, Function, and Bioinformatics 56:753–767 (2004) Accurate Prediction of Solvent Accessibility Using Neural Networks–Based Regression , 2022 .

[32]  P. Ascenzi,et al.  Proteolytic activity of bovine lactoferrin , 2004, Biometals.

[33]  P. Ascenzi,et al.  Erratum: Proteolytic activity of bovine lactoferrin (Biometals 17 (249-255)) , 2004 .

[34]  Jun Kong,et al.  MEROPS: the peptidase database. , 2004, Nucleic acids research.

[35]  James C. Whisstock,et al.  PoPS: a computational tool for modeling and predicting protease specificity , 2004, Proceedings. 2004 IEEE Computational Systems Bioinformatics Conference, 2004. CSB 2004..

[36]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[37]  L. Juliano,et al.  S3 to S3' subsite specificity of recombinant human cathepsin K and development of selective internally quenched fluorescent substrates. , 2003, The Biochemical journal.

[38]  X. Puente,et al.  Human and mouse proteases: a comparative genomic approach , 2003, Nature Reviews Genetics.

[39]  Foster J. Provost,et al.  Learning When Training Data are Costly: The Effect of Class Distribution on Tree Induction , 2003, J. Artif. Intell. Res..

[40]  Oliviero Carugo,et al.  DPX: for the analysis of the protein core , 2003, Bioinform..

[41]  C. Overall Molecular determinants of metalloproteinase substrate specificity , 2002, Molecular biotechnology.

[42]  Jotun Hein,et al.  Statistical Methods in Bioinformatics: An Introduction , 2002 .

[43]  C. López-Otín,et al.  Protease degradomics: A new challenge for proteomics , 2002, Nature Reviews Molecular Cell Biology.

[44]  Oliviero Carugo,et al.  CX, an algorithm that identifies protruding atoms in proteins , 2002, Bioinform..

[45]  K. Fang,et al.  Mast Cell α-Chymase Reduces IgE Recognition of Birch Pollen Profilin by Cleaving Antibody-Binding Epitopes1 , 2002, The Journal of Immunology.

[46]  L. Cantley,et al.  Determination of protease cleavage site motifs using mixture-based oriented peptide libraries , 2001, Nature Biotechnology.

[47]  Paul Schliekelman,et al.  Statistical Methods in Bioinformatics: An Introduction , 2001 .

[48]  Gregory R. Grant,et al.  Statistical Methods in Bioinformatics , 2001 .

[49]  Liam J. McGuffin,et al.  The PSIPRED protein structure prediction server , 2000, Bioinform..

[50]  J M Thornton,et al.  Assessment of conformational parameters as predictors of limited proteolytic sites in native protein structures. , 1998, Protein engineering.

[51]  S. Parthasarathy,et al.  Analysis of temperature factor distribution in high‐resolution protein structures , 1997, Protein science : a publication of the Protein Society.

[52]  D. Bredesen,et al.  Cleavage of actin by interleukin 1 beta-converting enzyme to reverse DNase I inhibition. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[53]  M. Sanner,et al.  Reduced surface: an efficient way to compute molecular surfaces. , 1996, Biopolymers.

[54]  K Fidelis,et al.  A large‐scale experiment to assess protein structure prediction methods , 1995, Proteins.

[55]  T. Wilkins,et al.  The enterotoxin of Bacteroides fragilis is a metalloprotease , 1995, Infection and immunity.

[56]  J M Thornton,et al.  Modeling studies of the change in conformation required for cleavage of limited proteolytic sites , 1994, Protein science : a publication of the Protein Society.

[57]  J M Thornton,et al.  Molecular recognition. Conformational analysis of limited proteolytic sites and serine proteinase protein inhibitors. , 1991, Journal of molecular biology.

[58]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[59]  R. Bruccoleri,et al.  Correlation among sites of limited proteolysis, enzyme accessibility and segmental mobility , 1987, FEBS letters.

[60]  K. Nishikawa,et al.  Radial locations of amino acid residues in a globular protein: correlation with the sequence. , 1986, Journal of biochemistry.

[61]  M. Zamai,et al.  Correlation between sites of limited proteolysis and segmental mobility in thermolysin. , 1986, Biochemistry.

[62]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[63]  H. Jörnvall,et al.  The site in human antithrombin for functional proteolytic cleavage by human thrombin , 1981, FEBS letters.

[64]  Sirpa Mäki,et al.  The computer program , 1980 .

[65]  A. Berger,et al.  On the active site of proteases. 3. Mapping the active site of papain; specific peptide inhibitors of papain. , 1968, Biochemical and biophysical research communications.

[66]  E. Davie,et al.  Identification of a peptide released during autocatalytic activation of trypsinogen. , 1955, The Journal of biological chemistry.