SLiM on Diet: finding short linear motifs on domain interaction interfaces in Protein Data Bank

MOTIVATION An important class of protein interactions involves the binding of a protein's domain to a short linear motif (SLiM) on its interacting partner. Extracting such motifs, either experimentally or computationally, is challenging because of their weak binding and high degree of degeneracy. Recent rapid increase of available protein structures provides an excellent opportunity to study SLiMs directly from their 3D structures. RESULTS Using domain interface extraction (Diet), we characterized 452 distinct SLiMs from the Protein Data Bank (PDB), of which 155 are validated in varying degrees-40 have literature validation, 54 are supported by at least one domain-peptide structural instance, and another 61 have overrepresentation in high-throughput PPI data. We further observed that the lacklustre coverage of existing computational SLiM detection methods could be due to the common assumption that most SLiMs occur outside globular domain regions. 198 of 452 SLiM that we reported are actually found on domain-domain interface; some of them are implicated in autoimmune and neurodegenerative diseases. We suggest that these SLiMs would be useful for designing inhibitors against the pathogenic protein complexes underlying these diseases. Our findings show that 3D structure-based SLiM detection algorithms can provide a more complete coverage of SLiM-mediated protein interactions than current sequence-based approaches.

[1]  Ian Sillitoe,et al.  The CATH classification revisited—architectures reviewed and new ways to characterize structural divergence in superfamilies , 2008, Nucleic Acids Res..

[2]  Leszek Rychlewski,et al.  ELM server: a new resource for investigating short functional sites in modular eukaryotic proteins , 2003, Nucleic Acids Res..

[3]  Kara Dolinski,et al.  The BioGRID Interaction Database: 2008 update , 2008, Nucleic Acids Res..

[4]  E. Katchalski‐Katzir,et al.  Three-dimensional solution structure of the complex of alpha-bungarotoxin with a library-derived peptide. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[5]  W. Tatton,et al.  Neuroprotection by deprenyl and other propargylamines: glyceraldehyde-3-phosphate dehydrogenase rather than monoamine oxidase B , 2003, Journal of Neural Transmission.

[6]  Carl F. Ware,et al.  Autoimmunity: Decoy receptors thwart B cells , 2000, Nature.

[7]  A. Minton,et al.  Effect of macromolecular crowding upon the structure and function of an enzyme: glyceraldehyde-3-phosphate dehydrogenase. , 1981, Biochemistry.

[8]  Jinyan Li,et al.  Bioinformatics Original Paper Discovering Motif Pairs at Interaction Sites from Protein Sequences on a Proteome-wide Scale , 2022 .

[9]  R. Kelley,et al.  BAFF/BLyS receptor 3 comprises a minimal TNF receptor-like module that encodes a highly focused ligand-binding site. , 2003, Biochemistry.

[10]  Robert D. Finn,et al.  The Pfam protein families database , 2004, Nucleic Acids Res..

[11]  J. H. Shinn,et al.  Minimotif Miner: a tool for investigating protein function , 2006, Nature Methods.

[12]  Ziv Bar-Joseph,et al.  Identifying cycling genes by combining sequence homology and expression data , 2006, ISMB.

[13]  D Eisenberg,et al.  The crystal structure of a 3D domain-swapped dimer of RNase A at a 2.1-A resolution. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[14]  D Fischer,et al.  Analysis of topological and nontopological structural similarities in the PDB: New examples with old structures , 1996, Proteins.

[15]  Elspeth F Garman,et al.  Crystal structures of fibronectin-binding sites from Staphylococcus aureus FnBPA in complex with fibronectin domains , 2008, Proceedings of the National Academy of Sciences.

[16]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[17]  P. Sparén,et al.  Efficiency of organised and opportunistic cytological screening for cancer in situ of the cervix , 1995, British Journal of Cancer.

[18]  David Eisenberg,et al.  A domain-swapped RNase A dimer with implications for amyloid formation , 2001, Nature Structural Biology.

[19]  Allegra Via,et al.  A structure filter for the Eukaryotic Linear Motif Resource , 2009, BMC Bioinformatics.

[20]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[21]  Andrew C. R. Martin Databases and ontologies Mapping PDB chains to UniProtKB entries , 2005 .

[22]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Kian-Lee Tan,et al.  Matalign: Precise Protein Structure Comparison by Matrix Alignment , 2006, J. Bioinform. Comput. Biol..

[24]  福原 葉子 GAPDH knockdown rescues mesencephalic dopaminergic neurons from MPP[+] -induced apoptosis , 2002 .

[25]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[26]  R. Russell,et al.  Linear motifs: Evolutionary interaction switches , 2005, FEBS letters.

[27]  Robert D. Finn,et al.  InterPro: the integrative protein signature database , 2008, Nucleic Acids Res..

[28]  R. Carrell,et al.  Conformational changes and disease--serpins, prions and Alzheimer's. , 1998, Current opinion in structural biology.

[29]  Christopher W. V. Hogue,et al.  Structure-Templated Predictions of Novel Protein Interactions from Sequence Information , 2007, PLoS Comput. Biol..

[30]  Don Foster,et al.  TACI and BCMA are receptors for a TNF homologue implicated in B-cell autoimmune disease , 2000, Nature.

[31]  D. Lacey,et al.  Severe B cell hyperplasia and autoimmune disease in TALL-1 transgenic mice. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[32]  R. Russell,et al.  Structural systems biology: modelling protein interactions , 2006, Nature Reviews Molecular Cell Biology.

[33]  T. Gibson,et al.  Systematic Discovery of New Recognition Peptides Mediating Protein Interaction Networks , 2005, PLoS biology.

[34]  Arne Elofsson,et al.  A comparison of sequence and structure protein domain families as a basis for structural genomics , 1999, Bioinform..

[35]  Gail J. Bartlett,et al.  Using a library of structural templates to recognise catalytic sites and explore their evolution in homologous families. , 2005, Journal of molecular biology.

[36]  See-Kiong Ng,et al.  A correlated motif approach for finding short linear motifs from protein interaction networks , 2006, BMC Bioinformatics.

[37]  B. Snel,et al.  Comparative assessment of large-scale data sets of protein–protein interactions , 2002, Nature.

[38]  Victor Neduva,et al.  Peptides mediating interaction networks: new leads at last. , 2006, Current opinion in biotechnology.

[39]  Tim J. P. Hubbard,et al.  Data growth and its impact on the SCOP database: new developments , 2007, Nucleic Acids Res..

[40]  M. Teresa Pisabarro,et al.  SCOWLP classification: Structural comparison and analysis of protein binding regions , 2008, BMC Bioinformatics.

[41]  Richard J. Edwards,et al.  SLiMFinder: A Probabilistic Method for Identifying Over-Represented, Convergently Evolved, Short Linear Motifs in Proteins , 2007, PloS one.

[42]  A. Boulton,et al.  Glyceraldehyde‐3‐phosphate dehydrogenase and apoptosis , 2000, Journal of neuroscience research.

[43]  T. Pawson,et al.  Assembly of Cell Regulatory Systems Through Protein Interaction Domains , 2003, Science.

[44]  Dan M. Bolser,et al.  Using convex hulls to extract interaction interfaces from known structures , 2004, Bioinform..

[45]  D Cowburn,et al.  Modular peptide recognition domains in eukaryotic signaling. , 1997, Annual review of biophysics and biomolecular structure.