Basis for substrate recognition and distinction by matrix metalloproteinases

Significance Specificity-determining positions (SDPs) account for distinctions in function across a protein family. Many theories on the evolution of functional specificity have led to approaches for predicting SDPs in silico, but large experimental datasets allowing a statistical assignment are lacking. Here, the SDPs of matrix metalloproteinases are elucidated by querying the proteolytic efficiency of eight matrix metalloproteinases, representing three phylogenetic branches, in an extended and diverse substrate space. More than 10,000 measures of cleavage efficiency reveal a near-perfect correlation between similarity in proteolytic function and sequence identity at 50–57 positions on the front face of the catalytic domain. These positions are assigned as SDPs. Transmutation of proteolytic function is possible by swapping SDPs nearest to bound substrate. Genomic sequencing and structural genomics produced a vast amount of sequence and structural data, creating an opportunity for structure–function analysis in silico [Radivojac P, et al. (2013) Nat Methods 10(3):221–227]. Unfortunately, only a few large experimental datasets exist to serve as benchmarks for function-related predictions. Furthermore, currently there are no reliable means to predict the extent of functional similarity among proteins. Here, we quantify structure–function relationships among three phylogenetic branches of the matrix metalloproteinase (MMP) family by comparing their cleavage efficiencies toward an extended set of phage peptide substrates that were selected from ∼64 million peptide sequences (i.e., a large unbiased representation of substrate space). The observed second-order rate constants [k(obs)] across the substrate space provide a distance measure of functional similarity among the MMPs. These functional distances directly correlate with MMP phylogenetic distance. There is also a remarkable and near-perfect correlation between the MMP substrate preference and sequence identity of 50–57 discontinuous residues surrounding the catalytic groove. We conclude that these residues represent the specificity-determining positions (SDPs) that allowed for the expansion of MMP proteolytic function during evolution. A transmutation of only a few selected SDPs proximal to the bound substrate peptide, and contributing the most to selectivity among the MMPs, is sufficient to enact a global change in the substrate preference of one MMP to that of another, indicating the potential for the rational and focused redesign of cleavage specificity in MMPs.

[1]  Dong Xu,et al.  FFAS-3D: improving fold recognition by including optimized structural features and template re-ranking , 2014, Bioinform..

[2]  S. Kumar,et al.  Antibody-directed coupling of endoglin and MMP-14 is a key mechanism for endoglin shedding and deregulation of TGF-β signaling , 2013, Oncogene.

[3]  B. Jeppsson,et al.  Platelet shedding of CD40L is regulated by matrix metalloproteinase‐9 in abdominal sepsis , 2013, Journal of thrombosis and haemostasis : JTH.

[4]  Matrix metalloproteinase-2 (MMP-2) generates soluble HLA-G1 by cell surface proteolytic shedding , 2013, Molecular and Cellular Biochemistry.

[5]  Felice C. Lightstone,et al.  Catalytic site identification—a web server to identify catalytic site structural matches throughout PDB , 2013, Nucleic Acids Res..

[6]  Peter Marek,et al.  Commercial proteases: Present and future , 2013, FEBS letters.

[7]  George Georgiou,et al.  Engineering of TEV protease variants by yeast ER sequestration screening (YESS) of combinatorial libraries , 2013, Proceedings of the National Academy of Sciences.

[8]  C. Overall,et al.  Protein TAILS: when termini tell tales of proteolysis and function. , 2013, Current opinion in chemical biology.

[9]  András Fiser,et al.  Protein structure based prediction of catalytic residues , 2013, BMC Bioinformatics.

[10]  Angela D. Wilkins,et al.  Disentangling evolutionary signals: conservation, specificity determining positions and coevolution. Implication for catalytic residue prediction , 2012, BMC Bioinformatics.

[11]  K. Neuman,et al.  Single-Molecule Tracking of Collagenase on Native Type I Collagen Fibrils Reveals Degradation Mechanism , 2012, Current Biology.

[12]  Samuel Achilefu,et al.  Detection of MMP-2 and MMP-9 activity in vivo with a triple-helical peptide optical probe. , 2012, Bioconjugate chemistry.

[13]  A. Burlingame,et al.  Global kinetic analysis of proteolysis via quantitative targeted proteomics , 2012, Proceedings of the National Academy of Sciences.

[14]  Stijn van Dongen,et al.  Using MCL to extract clusters from networks. , 2012, Methods in molecular biology.

[15]  G. Fields,et al.  The Interface between Catalytic and Hemopexin Domains in Matrix Metalloproteinase-1 Conceals a Collagen Binding Exosite* , 2011, The Journal of Biological Chemistry.

[16]  Christopher M Overall,et al.  Identifying and quantifying proteolytic events and the natural N terminome by terminal amine isotopic labeling of substrates , 2011, Nature Protocols.

[17]  Ying Zhang,et al.  Structural determinants of limited proteolysis. , 2011, Journal of proteome research.

[18]  A. Godzik,et al.  Internal organization of large protein families: Relationship between the sequence, structure, and function‐based clustering , 2011, Proteins.

[19]  C. Craik,et al.  Proteases as therapeutics. , 2011, The Biochemical journal.

[20]  Jennifer McDowall,et al.  InterPro protein classification. , 2011, Methods in molecular biology.

[21]  Shili Duan,et al.  Recognition and Specificity Determinants of the Human Cbx Chromodomains* , 2010, The Journal of Biological Chemistry.

[22]  C. Overall,et al.  Multiplex N-terminome Analysis of MMP-2 and MMP-9 Substrate Degradomes by iTRAQ-TAILS Quantitative Proteomics* , 2010, Molecular & Cellular Proteomics.

[23]  Akira R. Kinjo,et al.  SeSAW: balancing sequence and structural information in protein functional mapping , 2010, Bioinform..

[24]  Oliver Kohlbacher,et al.  Combining Structure and Sequence Information Allows Automated Prediction of Substrate Specificities within Enzyme Families , 2010, PLoS Comput. Biol..

[25]  I. Sagi,et al.  Structural and functional bases for allosteric control of MMP activities: can it pave the path for selective inhibition? , 2010, Biochimica et biophysica acta.

[26]  Jaap Heringa,et al.  Protein secondary structure prediction. , 2010, Methods in molecular biology.

[27]  Robert B. Russell,et al.  An automated stochastic approach to the identification of the protein specificity determinants and functional subfamilies , 2010, Algorithms for Molecular Biology.

[28]  M. Pogson,et al.  Engineering next generation proteases. , 2009, Current opinion in biotechnology.

[29]  Scott A. Busby,et al.  Identification of Specific Hemopexin-like Domain Residues That Facilitate Matrix Metalloproteinase Collagenolytic Activity* , 2009, The Journal of Biological Chemistry.

[30]  Anna R Panchenko,et al.  Coevolution in defining the functional specificity , 2009, Proteins.

[31]  High throughput substrate phage display for protease profiling. , 2009, Methods in molecular biology.

[32]  L. Mirny,et al.  Using evolutionary information to find specificity-determining and co-evolving residues. , 2009, Methods in molecular biology.

[33]  Kai Wang,et al.  Protein Meta-Functional Signatures from Combining Sequence, Structure, Evolution, and Amino Acid Property Information , 2008, PLoS Comput. Biol..

[34]  G. Fields,et al.  MMP-12 Catalytic Domain Recognizes Triple Helical Peptide Models of Collagen V with Exosites and High Activity* , 2008, Journal of Biological Chemistry.

[35]  N. Grishin,et al.  Sequences and topology: from genome structure to protein structure. , 2008, Current opinion in structural biology.

[36]  George Georgiou,et al.  Highly active and selective endopeptidases with programmed substrate specificities. , 2008, Nature chemical biology.

[37]  Kai Ye,et al.  Multi-RELIEF: a method to recognize specificity determining residues from multiple sequence alignments using a Machine-Learning approach for feature weighting , 2008, Bioinform..

[38]  K. Brew,et al.  Differentiation of secreted and membrane-type matrix metalloproteinase activities based on substitutions and interruptions of triple-helical sequences. , 2007, Biochemistry.

[39]  K. Brew,et al.  The Roles of Substrate Thermal Stability and P2 and P1′ Subsite Identity on Matrix Metalloproteinase Triple-helical Peptidase Activity and Collagen Specificity* , 2006, Journal of Biological Chemistry.

[40]  J. Heringa,et al.  Sequence comparison by sequence harmony identifies subtype-specific functional sites , 2006, Nucleic acids research.

[41]  P. Dempsey,et al.  Control of ErbB signaling through metalloprotease mediated ectodomain shedding of EGF-like factors , 2006, Growth factors.

[42]  Vladimir A. Ivanisenko,et al.  WebProAnalyst: an interactive tool for analysis of quantitative structure–activity relationships in protein families , 2005, Nucleic Acids Res..

[43]  C. Overall,et al.  Pivotal Molecular Determinants of Peptidic and Collagen Triple Helicase Activities Reside in the S3′ Subsite of Matrix Metalloproteinase 8 (MMP-8) , 2005, Journal of Biological Chemistry.

[44]  D. Dinakarpandian,et al.  Collagenase unwinds triple‐helical collagen prior to peptide bond hydrolysis , 2004, The EMBO journal.

[45]  A. Godzik,et al.  The interplay of fold recognition and experimental structure determination in structural genomics. , 2004, Current opinion in structural biology.

[46]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[47]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[48]  Jeffrey W. Smith,et al.  A Residue in the S2 Subsite Controls Substrate Selectivity of Matrix Metalloproteinase-2 and Matrix Metalloproteinase-9* , 2003, Journal of Biological Chemistry.

[49]  Eugene I Shakhnovich,et al.  Amino acids determining enzyme-substrate specificity in prokaryotic and eukaryotic protein kinases , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[50]  J. Wallace,et al.  Matrix metalloproteinase processing of monocyte chemoattractant proteins generates CC chemokine receptor antagonists with anti-inflammatory properties in vivo. , 2002, Blood.

[51]  L. Mirny,et al.  Using orthologous and paralogous proteins to identify specificity determining residues , 2002, Genome Biology.

[52]  A. Valencia,et al.  Similarity of phylogenetic trees as indicator of protein-protein interaction. , 2001, Protein engineering.

[53]  J W Smith,et al.  Substrate Hydrolysis by Matrix Metalloproteinase-9* , 2001, The Journal of Biological Chemistry.

[54]  D. Dinakarpandian,et al.  Identification of the 183RWTNNFREY191Region as a Critical Segment of Matrix Metalloproteinase 1 for the Expression of Collagenolytic Activity* , 2000, The Journal of Biological Chemistry.

[55]  C. Overall,et al.  Inflammation dampened by gelatinase A cleavage of monocyte chemoattractant protein-3. , 2000, Science.

[56]  Jonathan Bingham,et al.  Visualizing large hierarchical clusters in hyperbolic space , 2000, Bioinform..

[57]  J. M. Bradshaw,et al.  Mutational investigation of the specificity determining region of the Src SH2 domain. , 2000, Journal of molecular biology.

[58]  F. Cohen,et al.  Co-evolution of proteins with their interaction partners. , 2000, Journal of molecular biology.

[59]  D. T. Jones Protein structure prediction in the postgenomic era. , 2000, Current opinion in structural biology.

[60]  Charles D. McDermott,et al.  Broad Antitumor and Antiangiogenic Activities of AG3340, a Potent and Selective MMP Inhibitor Undergoing Advanced Oncology Clinical Trials , 1999, Annals of the New York Academy of Sciences.

[61]  L. Kotra,et al.  Matrix metalloproteinases: structures, evolution, and diversification , 1998, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[62]  Jeffrey W. Smith,et al.  Identification of a Region in the Integrin β3 Subunit That Confers Ligand Binding Specificity* , 1997, The Journal of Biological Chemistry.

[63]  K. J. Fryxell,et al.  The coevolution of gene family trees. , 1996, Trends in genetics : TIG.

[64]  T. Voelker,et al.  Modification of the substrate specificity of an acyl-acyl carrier protein thioesterase by protein engineering. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[65]  M. Schaefer,et al.  Low Molecular Weight Inhibitors in Corneal Ulceration a , 1994, Annals of the New York Academy of Sciences.

[66]  J. Wells,et al.  Substrate phage: selection of protease substrates by monovalent phage display. , 1993, Science.

[67]  A. Fersht,et al.  Modification of the amino acid specificity of tyrosyl‐tRNA synthetase by protein engineering , 1993, FEBS letters.

[68]  J. Wells,et al.  High-resolution epitope mapping of hGH-receptor interactions by alanine-scanning mutagenesis. , 1989, Science.

[69]  Pivotal Molecular Determinants of Peptidic and Collagen Triple Helicase Activities Reside in the S 3 (cid:1) Subsite of Matrix Metalloproteinase 8 (MMP-8) THE ROLE OF HYDROGEN BONDING POTENTIAL OF ASN 188 AND TYR 189 AND THE CONNECTING CIS BOND* , 2022 .