Short Linear Motifs recognized by SH2, SH3 and Ser/Thr Kinase domains are conserved in disordered protein regions

BackgroundProtein interactions are essential for most cellular functions. Interactions mediated by domains that appear in a large number of proteins are of particular interest since they are expected to have an impact on diversities of cellular processes such as signal transduction and immune response. Many well represented domains recognize and bind to primary sequences less than 10 amino acids in length called Short Linear Motifs (SLiMs).ResultsIn this study, we systematically studied the evolutionary conservation of SLiMs recognized by SH2, SH3 and Ser/Thr Kinase domains in both ordered and disordered protein regions. Disordered protein regions are protein sequences that lack a fixed three-dimensional structure under putatively native conditions. We find that, in all these domains examined, SLiMs are more conserved in disordered regions. This trend is more evident in those protein functional groups that are frequently reported to interact with specific domains.ConclusionThe correlation between SLiM conservation with disorder prediction demonstrates that functional SLiMs recognized by each domain occur more often in disordered as compared to structured regions of proteins.

[1]  Dipanwita Roy Chowdhury,et al.  Human protein reference database as a discovery resource for proteomics , 2004, Nucleic Acids Res..

[2]  Zoran Obradovic,et al.  The Protein Non-Folding Problem: Amino Acid Determinants of Intrinsic Order and Disorder , 2000, Pacific Symposium on Biocomputing.

[3]  M. Yaffe,et al.  Biochemical Interactions Integrating Itk with the T Cell Receptor-initiated Signaling Cascade* , 2000, The Journal of Biological Chemistry.

[4]  D. Baltimore,et al.  Modular binding domains in signal transduction proteins , 1995, Cell.

[5]  Yu Xue,et al.  PPSP: prediction of PK-specific phosphorylation site with Bayesian decision theory , 2006, BMC Bioinformatics.

[6]  T Pawson,et al.  Specific motifs recognized by the SH2 domains of Csk, 3BP2, fps/fes, GRB-2, HCP, SHC, Syk, and Vav , 1994, Molecular and cellular biology.

[7]  Michael B. Yaffe,et al.  Scansite 2.0: proteome-wide prediction of cell signaling interactions using short sequence motifs , 2003, Nucleic Acids Res..

[8]  M. Yaffe,et al.  A peptide library approach identifies a specific inhibitor for the ZAP-70 protein tyrosine kinase. , 2000, Molecular cell.

[9]  N. Blom,et al.  Sequence and structure-based prediction of eukaryotic protein phosphorylation sites. , 1999, Journal of molecular biology.

[10]  T. Pawson,et al.  A Potential SH3 Domain-binding Site in the Crk SH2 Domain* , 1996, The Journal of Biological Chemistry.

[11]  Tatiana Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..

[12]  Peter E Wright,et al.  Solution Structure of the KIX Domain of CBP Bound to the Transactivation Domain of CREB: A Model for Activator:Coactivator Interactions , 1997, Cell.

[13]  Michael B Yaffe,et al.  MAPKAP kinase-2 is a cell cycle checkpoint kinase that regulates the G2/M transition and S phase progression in response to UV irradiation. , 2005, Molecular cell.

[14]  A Keith Dunker,et al.  Signal transduction via unstructured protein conduits. , 2008, Nature chemical biology.

[15]  Tony Pawson,et al.  Protein modules and signalling networks , 1995, Nature.

[16]  Xuegong Zhang,et al.  Prediction of kinase‐specific phosphorylation sites with sequence features by a log‐odds ratio approach , 2007, Proteins.

[17]  V. Uversky Natively unfolded proteins: A point where biology waits for physics , 2002, Protein science : a publication of the Protein Society.

[18]  A. Sparks,et al.  Distinct ligand preferences of Src homology 3 domains from Src, Yes, Abl, Cortactin, p53bp2, PLCgamma, Crk, and Grb2. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Zoran Obradovic,et al.  DisProt: a database of protein disorder , 2005, Bioinform..

[20]  L. Cantley,et al.  SRPK2: A Differentially Expressed SR Protein-specific Kinase Involved in Mediating the Interaction and Localization of Pre-mRNA Splicing Factors in Mammalian Cells , 1998, The Journal of cell biology.

[21]  Yu Xue,et al.  GPS: a novel group-based phosphorylation predicting and scoring method. , 2004, Biochemical and biophysical research communications.

[22]  H. Dyson,et al.  Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm. , 1999, Journal of molecular biology.

[23]  Zoran Obradovic,et al.  Predicting intrinsic disorder from amino acid sequence , 2003, Proteins.

[24]  T. Pawson,et al.  The human and mouse complement of SH2 domain proteins-establishing the boundaries of phosphotyrosine signaling. , 2006, Molecular cell.

[25]  D. Baltimore,et al.  Crystal structure of the phosphotyrosine recognition domain SH2 of v-src complexed with tyrosine-phosphorylated peptides , 1993, Nature.

[26]  Sonia Longhi,et al.  The C-terminal Domain of the Measles Virus Nucleoprotein Is Intrinsically Disordered and Folds upon Binding to the C-terminal Moiety of the Phosphoprotein* , 2003, The Journal of Biological Chemistry.

[27]  Nikolaj Blom,et al.  Phospho.ELM: A database of experimentally verified phosphorylation sites in eukaryotic proteins , 2004, BMC Bioinformatics.

[28]  A Keith Dunker,et al.  TOP-IDP-scale: a new amino acid scale measuring propensity for intrinsic disorder. , 2008, Protein and peptide letters.

[29]  Christopher J. Oldfield,et al.  Showing your ID: intrinsic disorder as an ID for recognition, regulation and cell signaling , 2005, Journal of molecular recognition : JMR.

[30]  Erik L. L. Sonnhammer,et al.  Inparanoid: a comprehensive database of eukaryotic orthologs , 2004, Nucleic Acids Res..

[31]  M. Yaffe,et al.  A motif-based profile scanning approach for genome-wide prediction of signaling pathways , 2001, Nature Biotechnology.

[32]  T. Hunter,et al.  The protein kinases of Caenorhabditis elegans: a model for signal transduction in multicellular organisms. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[33]  T. Pawson,et al.  Molecular interactions of the Src homology 2 domain protein Shb with phosphotyrosine residues, tyrosine kinase receptors and Src homology 3 domain proteins. , 1995, Oncogene.

[34]  L. Cantley,et al.  The use of peptide library for the determination of kinase peptide substrates. , 1998, Methods in molecular biology.

[35]  Zoran Obradovic,et al.  Optimizing Long Intrinsic Disorder Predictors with Protein Evolutionary Information , 2005, J. Bioinform. Comput. Biol..

[36]  A Keith Dunker,et al.  Mining alpha-helix-forming molecular recognition features with cross species sequence alignments. , 2007, Biochemistry.

[37]  Obradovic,et al.  Predicting Binding Regions within Disordered Proteins. , 1999, Genome informatics. Workshop on Genome Informatics.

[38]  J. Schlessinger SH2/SH3 signaling proteins. , 1994, Current opinion in genetics & development.

[39]  Marc S. Cortese,et al.  Analysis of molecular recognition features (MoRFs). , 2006, Journal of molecular biology.

[40]  T Pawson,et al.  SH2 domains, interaction modules and cellular wiring. , 2001, Trends in cell biology.

[41]  A. Keith Dunker,et al.  Mining α-Helix-Forming Molecular Recognition Features with Cross Species Sequence Alignments† , 2007 .

[42]  Ricardo M Biondi,et al.  Signalling specificity of Ser/Thr protein kinases through docking-site-mediated interactions. , 2003, The Biochemical journal.

[43]  Bermseok Oh,et al.  Prediction of phosphorylation sites using SVMs , 2004, Bioinform..

[44]  Christopher J. Oldfield,et al.  Flexible nets: disorder and induced fit in the associations of p53 and 14-3-3 with their partners , 2008, BMC Genomics.

[45]  Leszek Rychlewski,et al.  ELM server: a new resource for investigating short functional sites in modular eukaryotic proteins , 2003, Nucleic Acids Res..

[46]  E. Krebs,et al.  Substrate specificity of the cyclic AMP-dependent protein kinase. , 1975, Proceedings of the National Academy of Sciences of the United States of America.

[47]  Marc S. Cortese,et al.  Coupled folding and binding with α-helix-forming molecular recognition elements , 2005 .

[48]  T. Pawson,et al.  Protein-protein interactions define specificity in signal transduction. , 2000, Genes & development.

[49]  Tatiana A. Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..

[50]  Marc S. Cortese,et al.  Flexible nets , 2005, The FEBS journal.

[51]  M. Kashiwada,et al.  Immunoreceptor Tyrosine-Based Inhibitory Motif of the IL-4 Receptor Associates with SH2-Containing Phosphatases and Regulates IL-4-Induced Proliferation1 , 2001, The Journal of Immunology.

[52]  T. Hunter,et al.  The Protein Kinase Complement of the Human Genome , 2002, Science.

[53]  T. Pawson,et al.  SH2 domains recognize specific phosphopeptide sequences , 1993, Cell.

[54]  T. Hunter,et al.  The protein kinases of budding yeast: six score and more. , 1997, Trends in biochemical sciences.

[55]  D. Morrison,et al.  Protein Kinases and Phosphatases in the Drosophila Genome , 2000, The Journal of cell biology.

[56]  N. Blom,et al.  Prediction of post‐translational glycosylation and phosphorylation of proteins from the amino acid sequence , 2004, Proteomics.

[57]  István Simon,et al.  BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm035 Structural bioinformatics Local structural disorder imparts plasticity on linear motifs , 2022 .

[58]  Darren A. Natale,et al.  The COG database: an updated version includes eukaryotes , 2003, BMC Bioinformatics.

[59]  E. Nikolakaki,et al.  Phosphorylation by LAMMER protein kinases: determination of a consensus site, identification of in vitro substrates, and implications for substrate preferences. , 2002, Biochemistry.

[60]  Marc S. Cortese,et al.  Coupled folding and binding with alpha-helix-forming molecular recognition elements. , 2005, Biochemistry.

[61]  A Keith Dunker,et al.  Characterization of molecular recognition features, MoRFs, and their binding partners. , 2007, Journal of proteome research.

[62]  John Moult,et al.  Evaluation of disorder predictions in CASP5 , 2003, Proteins.

[63]  E. Humble,et al.  The minimum substrate of cyclic AMP-stimulated protein kinase, as studied by synthetic peptides representing the phosphorylatable site of pyruvate kinase (type L) of rat liver. , 1976, Biochemical and biophysical research communications.

[64]  Carol V Robinson,et al.  Studies of the RNA degradosome-organizing domain of the Escherichia coli ribonuclease RNase E. , 2004, Journal of molecular biology.

[65]  Christopher J. Oldfield,et al.  Evolutionary Rate Heterogeneity in Proteins with Long Disordered Regions , 2002, Journal of Molecular Evolution.

[66]  T. Soderling,et al.  A structural basis for substrate specificities of protein Ser/Thr kinases: primary sequence preference of casein kinases I and II, NIMA, phosphorylase kinase, calmodulin-dependent kinase II, CDK5, and Erk1 , 1996, Molecular and cellular biology.

[67]  F E Cohen,et al.  Exploiting the basis of proline recognition by SH3 and WW domains: design of N-substituted inhibitors. , 1998, Science.

[68]  Zoran Obradovic,et al.  The protein trinity—linking function and disorder , 2001, Nature Biotechnology.

[69]  L. Pinna,et al.  How do protein kinases recognize their substrates? , 1996, Biochimica et biophysica acta.