The structural and functional signatures of proteins that undergo multiple events of post‐translational modification

The structural, functional, and mechanistic characterization of several types of post‐translational modifications (PTMs) is well‐documented. PTMs, however, may interact or interfere with one another when regulating protein function. Yet, characterization of the structural and functional signatures of their crosstalk has been hindered by the scarcity of data. To this end, we developed a unified sequence‐based predictor of 23 types of PTM sites that, we believe, is a useful tool in guiding biological experiments and data interpretation. We then used experimentally determined and predicted PTM sites to investigate two particular cases of potential PTM crosstalk in eukaryotes. First, we identified proteins statistically enriched in multiple types of PTM sites and found that they show preferences toward intrinsically disordered regions as well as functional roles in transcriptional, posttranscriptional, and developmental processes. Second, we observed that target sites modified by more than one type of PTM, referred to as shared PTM sites, show even stronger preferences toward disordered regions than their single‐PTM counterparts; we explain this by the need for these regions to accommodate multiple partners. Finally, we investigated the influence of single and shared PTMs on differential regulation of protein–protein interactions. We provide evidence that molecular recognition features (MoRFs) show significant preferences for PTM sites, particularly shared PTM sites, implicating PTMs in the modulation of this specific type of macromolecular recognition. We conclude that intrinsic disorder is a strong structural prerequisite for complex PTM‐based regulation, particularly in context‐dependent protein–protein interactions related to transcriptional and developmental processes. Availability: www.modpred.org

[1]  R. Fisher On the Interpretation of χ2 from Contingency Tables, and the Calculation of P , 2010 .

[2]  Arne G. Schmeisky,et al.  Cross-talk between phosphorylation and lysine acetylation in a genome-reduced bacterium , 2012, Molecular systems biology.

[3]  M. Mann,et al.  Lysine Acetylation Targets Protein Complexes and Co-Regulates Major Cellular Functions , 2009, Science.

[4]  Christopher J. Oldfield,et al.  Exploring the binding diversity of intrinsically disordered proteins involved in one‐to‐many binding , 2013, Protein science : a publication of the Protein Society.

[5]  Predrag Radivojac,et al.  Intrinsic Disorder and Prote in Modifications: Building an SVM Predictor for Methylation , 2005, 2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology.

[6]  Sebastian A. Wagner,et al.  A Proteome-wide, Quantitative Survey of In Vivo Ubiquitylation Sites Reveals Widespread Regulatory Roles* , 2011, Molecular & Cellular Proteomics.

[7]  C. Heldin,et al.  Mechanism of action and in vivo role of platelet-derived growth factor. , 1999, Physiological reviews.

[8]  Ruth Nussinov,et al.  Regulating highly dynamic unstructured proteins and their coding mRNAs , 2009, Genome Biology.

[9]  M. Vihinen,et al.  Accuracy of protein flexibility predictions , 1994, Proteins.

[10]  Ying Xu,et al.  A general user interface for prediction servers of proteins' post-translational modification sites , 2006, Nature Protocols.

[11]  Birgit Eisenhaber,et al.  Prediction of posttranslational modification of proteins from their amino acid sequence. , 2010, Methods in molecular biology.

[12]  Stefan Westermann,et al.  Post-translational modifications regulate microtubule function , 2003, Nature Reviews Molecular Cell Biology.

[13]  Alan Bridge,et al.  New and continuing developments at PROSITE , 2012, Nucleic Acids Res..

[14]  Christopher T. Walsh,et al.  Posttranslational Modification of Proteins: Expanding Nature's Inventory , 2005 .

[15]  V. Uversky,et al.  Why are “natively unfolded” proteins unstructured under physiologic conditions? , 2000, Proteins.

[16]  Man Kit Cheung,et al.  Human proteins with target sites of multiple post-translational modification types are more prone to be involved in disease. , 2014, Journal of proteome research.

[17]  A Keith Dunker,et al.  Characterization of molecular recognition features, MoRFs, and their binding partners. , 2007, Journal of proteome research.

[18]  A Keith Dunker,et al.  Alternative splicing in concert with protein intrinsic disorder enables increased functional diversity in multicellular organisms. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Christopher J. Oldfield,et al.  Intrinsically disordered protein. , 2001, Journal of molecular graphics & modelling.

[20]  R. Youle,et al.  Role of the Ubiquitin Conjugation System in the Maintenance of Mitochondrial Homeostasis , 2008, Annals of the New York Academy of Sciences.

[21]  Ming-Ming Zhou,et al.  Structural basis of site-specific histone recognition by the bromodomains of human coactivators PCAF and CBP/p300. , 2008, Structure.

[22]  S. Fields,et al.  Global analysis of phosphorylation and ubiquitylation cross-talk in protein degradation , 2013, Nature Methods.

[23]  N. Grishin,et al.  Substrate and functional diversity of lysine acetylation revealed by a proteomics survey. , 2006, Molecular cell.

[24]  C. Maki,et al.  Regulation of p53 Nuclear Export through Sequential Changes in Conformation and Ubiquitination* , 2007, Journal of Biological Chemistry.

[25]  Peer Bork,et al.  Deciphering a global network of functionally associated post-translational modifications , 2012, Molecular systems biology.

[26]  N. Kelleher,et al.  Measurement of acetylation turnover at distinct lysines in human histones identifies long-lived acetylation sites , 2013, Nature Communications.

[27]  Ulrich Stelzl,et al.  Dual Coordination of Post Translational Modifications in Human Protein Networks , 2013, PLoS Comput. Biol..

[28]  Dong Xu,et al.  Correlation Between Posttranslational Modification and Intrinsic Disorder in Protein , 2011, Pacific Symposium on Biocomputing.

[29]  G. Hart,et al.  Cross-talk between GlcNAcylation and phosphorylation: Site-specific phosphorylation dynamics in response to globally elevated O-GlcNAc , 2008, Proceedings of the National Academy of Sciences.

[30]  C. Anderson,et al.  Posttranslational modification of p53: cooperative integrators of function. , 2009, Cold Spring Harbor perspectives in biology.

[31]  Albert J R Heck,et al.  Identification of enriched PTM crosstalk motifs from large-scale experimental data sets. , 2014, Journal of proteome research.

[32]  David H Perlman,et al.  Extensive Post-translational Modification of Active and Inactivated Forms of Endogenous p53* , 2013, Molecular & Cellular Proteomics.

[33]  María Martín,et al.  Activities at the Universal Protein Resource (UniProt) , 2013, Nucleic Acids Res..

[34]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[35]  L A Kelley,et al.  OLDERADO: On‐line database of ensemble representatives and domains , 1997, Protein science : a publication of the Protein Society.

[36]  J C Costello,et al.  Seeking the Wisdom of Crowds Through Challenge‐Based Competitions in Biomedical Research , 2013, Clinical pharmacology and therapeutics.

[37]  E. Appella,et al.  Phosphorylation Site Interdependence of Human p53 Post-translational Modifications in Response to Stress* , 2003, Journal of Biological Chemistry.

[38]  D. Speicher,et al.  The CBP Bromodomain and Nucleosome Targeting Are Required for Zta-Directed Nucleosome Acetylation and Transcription Activation , 2003, Molecular and Cellular Biology.

[39]  L. Iakoucheva,et al.  The importance of intrinsic disorder for protein phosphorylation. , 2004, Nucleic acids research.

[40]  V. Uversky Intrinsically Disordered Proteins , 2014 .

[41]  J. R. Hutchins,et al.  Many Fingers on the Mitotic Trigger: Post-Translational Regulation of the Cdc25C Phosphatase , 2004, Cell cycle.

[42]  Christopher J. Oldfield,et al.  Intrinsic disorder in transcription factors. , 2006, Biochemistry.

[43]  A. Panchenko,et al.  Phosphorylation in protein-protein binding: effect on stability and function. , 2011, Structure.

[44]  A. Dunker,et al.  Understanding protein non-folding. , 2010, Biochimica et biophysica acta.

[45]  T. Pawson,et al.  Post-translational modifications in signal integration , 2010, Nature Structural &Molecular Biology.

[46]  B. Benayoun,et al.  A post-translational modification code for transcription factors: sorting through a sea of signals. , 2009, Trends in cell biology.

[47]  Alan J Tackett,et al.  Long-distance combinatorial linkage between methylation and acetylation on histone H3 N termini , 2007, Proceedings of the National Academy of Sciences.

[48]  Xiang-Jiao Yang,et al.  Multisite protein modification and intramolecular signaling , 2005, Oncogene.

[49]  Niall J. Haslam,et al.  Understanding eukaryotic linear motifs and their role in cell signaling and regulation. , 2008, Frontiers in bioscience : a journal and virtual library.

[50]  T. Höfer,et al.  Circadian conformational change of the Neurospora clock protein FREQUENCY triggered by clustered hyperphosphorylation of a basic domain. , 2011, Molecular cell.

[51]  D. Eisenberg,et al.  The hydrophobic moment detects periodicity in protein hydrophobicity. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[52]  Zoran Obradovic,et al.  Length-dependent prediction of protein intrinsic disorder , 2006, BMC Bioinformatics.

[53]  Bin Zhang,et al.  PhosphoSitePlus: a comprehensive resource for investigating the structure and function of experimentally determined post-translational modifications in man and mouse , 2011, Nucleic Acids Res..

[54]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[55]  Yu Xue,et al.  Computational Prediction of Post-Translational Modification Sites in Proteins , 2011 .

[56]  Predrag Radivojac,et al.  Post-translational modifications induce significant yet not extreme changes to protein structure , 2012, Bioinform..

[57]  M. Wilkins,et al.  Surface accessibility of protein post-translational modifications. , 2007, Journal of proteome research.

[58]  S. Vucetic,et al.  Flavors of protein disorder , 2003, Proteins.

[59]  Siddhartha Roy,et al.  Binding of the histone chaperone ASF1 to the CBP bromodomain promotes histone acetylation , 2014, Proceedings of the National Academy of Sciences.

[60]  Christopher J. Oldfield,et al.  Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. , 2007, Journal of proteome research.

[61]  Vladimir Vacic,et al.  Two Sample Logo: a graphical representation of the differences between two sets of sequence alignments , 2006, Bioinform..

[62]  L. Freedman,et al.  A coactivator code for transcription. , 2002, Trends in biochemical sciences.

[63]  M. Mann,et al.  Status of Large-scale Analysis of Post-translational Modifications by Mass Spectrometry* , 2013, Molecular & Cellular Proteomics.

[64]  Danny Reinberg,et al.  Is there a code embedded in proteins that is based on post-translational modifications? , 2008, Nature Reviews Molecular Cell Biology.

[65]  R. Fisher On the Interpretation of χ2 from Contingency Tables, and the Calculation of P , 2018, Journal of the Royal Statistical Society Series A (Statistics in Society).

[66]  Marc S. Cortese,et al.  Analysis of molecular recognition features (MoRFs). , 2006, Journal of molecular biology.

[67]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[68]  Xiao Zhen Zhou,et al.  Pin1 modulates the structure and function of human RNA polymerase II. , 2003, Genes & development.

[69]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[70]  V. Vacic,et al.  Identification, analysis, and prediction of protein ubiquitination sites , 2010, Proteins.

[71]  L. Johnson,et al.  Structural basis for control by phosphorylation. , 1997, Chemical reviews.

[72]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[73]  Hanno Steen,et al.  Post‐translational modification: nature's escape from genetic imprisonment and the basis for dynamic information encoding , 2012, Wiley interdisciplinary reviews. Systems biology and medicine.

[74]  Janusz M. Bujnicki,et al.  Intrinsic Disorder in the Human Spliceosomal Proteome , 2012, PLoS Comput. Biol..

[75]  L. Jensen,et al.  Mass Spectrometric Analysis of Lysine Ubiquitylation Reveals Promiscuity at Site Level* , 2010, Molecular & Cellular Proteomics.

[76]  Florian Gnad,et al.  PHOSIDA 2011: the posttranslational modification database , 2010, Nucleic Acids Res..

[77]  Allegra Via,et al.  Phospho.ELM: a database of phosphorylation sites—update 2008 , 2007, Nucleic Acids Res..

[78]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[79]  Dong Xu,et al.  Musite, a Tool for Global Prediction of General and Kinase-specific Phosphorylation Sites* , 2010, Molecular & Cellular Proteomics.

[80]  E. Verdin,et al.  Acetylation of mitochondrial proteins. , 2009, Methods in enzymology.

[81]  A. Brunet,et al.  The FoxO code , 2008, Oncogene.

[82]  Zoltán Daróczy,et al.  Generalized Information Functions , 1970, Inf. Control..

[83]  P. Radivojac,et al.  Protein flexibility and intrinsic disorder , 2004, Protein science : a publication of the Protein Society.

[84]  Matthew P Torres,et al.  Deciphering post‐translational modification codes , 2013, FEBS letters.

[85]  S. Teichmann,et al.  Tight Regulation of Unstructured Proteins: From Transcript Synthesis to Protein Degradation , 2008, Science.

[86]  Christopher J. Oldfield,et al.  Functional anthology of intrinsic disorder. 3. Ligands, post-translational modifications, and diseases associated with intrinsically disordered proteins. , 2007, Journal of proteome research.

[87]  S. Boulton,et al.  Playing the end game: DNA double-strand break repair pathway choice. , 2012, Molecular cell.

[88]  R. Nussinov,et al.  Allosteric post-translational modification codes. , 2012, Trends in biochemical sciences.

[89]  The UniProt Consortium,et al.  Update on activities at the Universal Protein Resource (UniProt) in 2013 , 2012, Nucleic Acids Res..

[90]  Raymond A Dwek,et al.  Statistical analysis of the protein environment of N-glycosylation sites: implications for occupancy, structure, and folding. , 2003, Glycobiology.

[91]  C. Allis,et al.  The language of covalent histone modifications , 2000, Nature.

[92]  Zoran Obradovic,et al.  DisProt: the Database of Disordered Proteins , 2006, Nucleic Acids Res..

[93]  Adam Godzik,et al.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..

[94]  Ning-Sun Yang,et al.  Systems and Computational Biology - Molecular and Cellular Experimental Systems , 2011 .

[95]  Christopher J. Oldfield,et al.  Intrinsic disorder and functional proteomics. , 2007, Biophysical journal.

[96]  Richard J. Edwards,et al.  Computational identification and analysis of protein short linear motifs. , 2010, Frontiers in bioscience.

[97]  W. Lim,et al.  Systematic Functional Prioritization of Protein Posttranslational Modifications , 2012, Cell.