Identification of Novel Phosphorylation Motifs Through an Integrative Computational and Experimental Analysis of the Human Phosphoproteome.

Protein phosphorylation occurs in certain sequence/structural contexts that are still incompletely understood. The amino acids surrounding the phosphorylated residues are important in determining the binding of the kinase to the protein sequence. Upon phosphorylation these sequences also determine the binding of certain domains that specifically bind to phosphorylated sequences. Thus far, such 'motifs' have been identified through alignment of a limited number of well identified kinase substrates. RESULTS: Experimentally determined phosphorylation sites from Human Protein Reference Database were used to identify 1,167 novel serine/threonine or tyrosine phosphorylation motifs using a computational approach. We were able to statistically validate a number of these novel motifs based on their enrichment in known phosphopeptides datasets over phosphoserine/threonine/tyrosine peptides in the human proteome. There were 299 novel serine/threonine or tyrosine phosphorylation motifs that were found to be statistically significant. Several of the novel motifs that we identified computationally have subsequently appeared in large datasets of experimentally determined phosphorylation sites since we initiated our analysis. Using a peptide microarray platform, we have experimentally evaluated the ability of casein kinase I to phosphorylate a subset of the novel motifs discovered in this study. Our results demonstrate that it is feasible to identify novel phosphorylation motifs through large phosphorylation datasets. Our study also establishes peptide microarrays as a novel platform for high throughput kinase assays and for the validation of consensus motifs. Finally, this extended catalog of phosphorylation motifs should assist in a systematic study of phosphorylation networks in signal transduction pathways.

[1]  R. Kolesnick,et al.  Substrate recognition by ceramide-activated protein kinase. Evidence that kinase activity is proline-directed. , 1993, The Journal of biological chemistry.

[2]  J. Thomson,et al.  Human embryonic stem cell phosphoproteome revealed by electron transfer dissociation tandem mass spectrometry , 2009, Proceedings of the National Academy of Sciences.

[3]  Eugene I Shakhnovich,et al.  Amino acids determining enzyme-substrate specificity in prokaryotic and eukaryotic protein kinases , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[4]  M. Yaffe,et al.  PhosphoSerine/threonine binding domains: you can't pSERious? , 2001, Structure.

[5]  Jun Zhu,et al.  14-3-3 proteins; bringing new definitions to scaffolding , 2001, Oncogene.

[6]  Steven P Gygi,et al.  A probability-based approach for high-throughput protein phosphorylation analysis and site localization , 2006, Nature Biotechnology.

[7]  Ramars Amanchy,et al.  Phosphoproteome analysis of HeLa cells using stable isotope labeling with amino acids in cell culture (SILAC). , 2005, Journal of proteome research.

[8]  W. Cleveland,et al.  Smoothing by Local Regression: Principles and Methods , 1996 .

[9]  Montserrat Carrascal,et al.  Phosphorylation analysis of primary human T lymphocytes using sequential IMAC and titanium oxide enrichment. , 2008, Journal of proteome research.

[10]  Charis Eng,et al.  Catalytic specificity of protein-tyrosine kinases is critical for selective signalling , 1995, Nature.

[11]  S. Gygi,et al.  An iterative statistical approach to the identification of protein phosphorylation motifs from large-scale data sets , 2005, Nature Biotechnology.

[12]  Jonathan A. Cooper,et al.  Analysis of the sequence of amino acids surrounding sites of tyrosine phosphorylation. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[13]  S. Mathivanan,et al.  A curated compendium of phosphorylation motifs , 2007, Nature Biotechnology.

[14]  T Pawson,et al.  Specific motifs recognized by the SH2 domains of Csk, 3BP2, fps/fes, GRB-2, HCP, SHC, Syk, and Vav , 1994, Molecular and cellular biology.

[15]  P. Schultz,et al.  Profiling of tyrosine phosphorylation pathways in human cells using mass spectrometry , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[16]  B. Zetter,et al.  Isolation of a mouse cDNA encoding mSTI1, a stress-inducible protein containing the TPR motif. , 1997, Gene.

[17]  M. Yaffe,et al.  A motif-based profile scanning approach for genome-wide prediction of signaling pathways , 2001, Nature Biotechnology.

[18]  T. Hunter,et al.  The Protein Kinase Complement of the Human Genome , 2002, Science.

[19]  T. Pawson,et al.  SH2 domains recognize specific phosphopeptide sequences , 1993, Cell.

[20]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[21]  D. Lauffenburger,et al.  Time-resolved Mass Spectrometry of Tyrosine Phosphorylation Sites in the Epidermal Growth Factor Receptor Signaling Network Reveals Dynamic Modules*S , 2005, Molecular & Cellular Proteomics.

[22]  George M. Church,et al.  Predicting Protein Post-translational Modifications Using Meta-analysis of Proteome Scale Data Sets*S , 2009, Molecular & Cellular Proteomics.

[23]  N. Blom,et al.  Sequence and structure-based prediction of eukaryotic protein phosphorylation sites. , 1999, Journal of molecular biology.

[24]  Erich A Nigg,et al.  Phosphoproteome analysis of the human mitotic spindle. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[25]  M. Yaffe Phosphotyrosine-binding domains in signal transduction , 2002, Nature Reviews Molecular Cell Biology.

[26]  Steven P Gygi,et al.  Large-scale characterization of HeLa cell nuclear phosphoproteins. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[27]  R. Pearson,et al.  Protein kinase phosphorylation site sequences and consensus specificity motifs: tabulations. , 1991, Methods in enzymology.

[28]  Tony Hunter,et al.  MNK1, a new MAP kinase‐activated protein kinase, isolated by a novel expression screening method for identifying protein kinase substrates , 1997, The EMBO journal.

[29]  Zhou Songyang,et al.  Use of an oriented peptide library to determine the optimal substrates of protein kinases , 1994, Current Biology.

[30]  L. Cesaro,et al.  A noncanonical sequence phosphorylated by casein kinase 1 in β-catenin may play a role in casein kinase 1 targeting of important signaling proteins , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Gregory Shakhnarovich,et al.  Discovery of phosphorylation motif mixtures in phosphoproteomics data , 2008, Bioinform..

[32]  L. Pinna,et al.  A synthetic β‐casein phosphopeptide and analogues as model substrates for casein kinase‐1, a ubiquitous, phosphate directed protein kinase , 1991, FEBS letters.

[33]  J. Endicott,et al.  Cyclin-dependent kinases: inhibition and substrate recognition. , 1999, Current opinion in structural biology.

[34]  M. Andretta,et al.  Phosphorylation of synthetic fragments of inhibitor-2 of protein phosphatase-1 by casein kinase-1 and -2. Evidence that phosphorylated residues are not strictly required for efficient targeting by casein kinase-1. , 1994, European journal of biochemistry.

[35]  Suresh Mathivanan,et al.  Global proteomic profiling of phosphopeptides using electron transfer dissociation tandem mass spectrometry , 2007, Proceedings of the National Academy of Sciences.

[36]  Akhilesh Pandey,et al.  Identification of c-Src Tyrosine Kinase Substrates Using Mass Spectrometry and Peptide Microarrays , 2008, Journal of proteome research.

[37]  Jinzi J. Wu,et al.  Protein tyrosine kinases: Structure, substrate specificity, and drug discovery , 1998, Biopolymers.

[38]  Nikolaj Blom,et al.  Motif Decomposition of the Phosphotyrosine Proteome Reveals a New N-terminal Binding Motif for SHIP2*S , 2008, Molecular & Cellular Proteomics.

[39]  M. Yaffe,et al.  Phosphoserine/Threonine Binding Domains , 2008 .

[40]  L. Cantley,et al.  Recognition and specificity in protein tyrosine kinase-mediated signalling. , 1995, Trends in biochemical sciences.

[41]  M. Yaffe,et al.  Phosphoserine/threonine-binding domains. , 2001, Current opinion in cell biology.

[42]  P. Graves,et al.  Phosphate groups as substrate determinants for casein kinase I action. , 1990, The Journal of biological chemistry.

[43]  Jos Joore,et al.  Kinome Profiling for Studying Lipopolysaccharide Signal Transduction in Human Peripheral Blood Mononuclear Cells* , 2004, Journal of Biological Chemistry.

[44]  T. Soderling,et al.  A structural basis for substrate specificities of protein Ser/Thr kinases: primary sequence preference of casein kinases I and II, NIMA, phosphorylase kinase, calmodulin-dependent kinase II, CDK5, and Erk1 , 1996, Molecular and cellular biology.

[45]  Forest M White,et al.  Global phosphoproteome of HT-29 human colon adenocarcinoma cells. , 2005, Journal of proteome research.

[46]  Josephine C. Adams,et al.  Characterization of a Drosophila melanogaster orthologue of muskelin. , 2002, Gene.

[47]  P. Greengard,et al.  A solid-phase assay for the phosphorylation of proteins blotted on nitrocellulose membrane filters. , 1986, Analytical biochemistry.