Evolution of protein phosphorylation across 18 fungal species

Phosphorylation and fungal evolution Phosphorylation after transcription modifies the activity of proteins. To understand how phosphorylation sites have evolved, Studer et al. studied a range of fungal species (see the Perspective by Matalon et al.). Only a few sites were apparently present in the common ancestor of all 18 species investigated. Evolutionary age appeared to predict the potential functional importance of specific conserved phosphosites. Science, this issue p. 229; see also p. 176 Phosphorylation in fungal proteins offers an understanding of evolutionary constraints acting on posttranscriptional modification. Living organisms have evolved protein phosphorylation, a rapid and versatile mechanism that drives signaling and regulates protein function. We report the phosphoproteomes of 18 fungal species and a phylogenetic-based approach to study phosphosite evolution. We observe rapid divergence, with only a small fraction of phosphosites conserved over hundreds of millions of years. Relative to recently acquired phosphosites, ancient sites are enriched at protein interfaces and are more likely to be functionally important, as we show for sites on H2A1 and eIF4E. We also observe a change in phosphorylation motif frequencies and kinase activities that coincides with the whole-genome duplication event. Our results provide an evolutionary history for phosphosites and suggest that rapid evolution of phosphorylation can contribute strongly to phenotypic diversity.

[1]  S. Bohaty,et al.  The late Eocene clinopyroxene-bearing spherule layer: New sites, nature of the strewn field, Ir data, and discovery of coesite and shocked quartz , 2009 .

[2]  Yoshinori Watanabe,et al.  Phosphorylation of H2A by Bub1 Prevents Chromosomal Instability Through Localizing Shugoshin , 2010, Science.

[3]  D. Odom,et al.  Evolution of transcription factor binding in metazoans — mechanisms and functional implications , 2014, Nature Reviews Genetics.

[4]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[5]  François Stricher,et al.  The FoldX web server: an online force field , 2005, Nucleic Acids Res..

[6]  Wei Qian,et al.  Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. , 2000, Molecular biology and evolution.

[7]  S. Gygi,et al.  Sensitive multiplexed analysis of kinase activities and activity-based kinase identification , 2009, Nature Biotechnology.

[8]  J. Eng,et al.  Comet: An open‐source MS/MS sequence database search tool , 2013, Proteomics.

[9]  P. Aloy,et al.  Interactome3D: adding structural details to protein networks , 2013, Nature Methods.

[10]  Edith D. Wong,et al.  Saccharomyces Genome Database: the genomics resource of budding yeast , 2011, Nucleic Acids Res..

[11]  B. Glass,et al.  Distal Impact Ejecta Layers: A Record of Large Impacts in Sedimentary Deposits , 2013 .

[12]  Cole Trapnell,et al.  TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions , 2013, Genome Biology.

[13]  J. Coon,et al.  Value of using multiple proteases for large-scale mass spectrometry-based proteomics. , 2010, Journal of proteome research.

[14]  M. King,et al.  Evolution at two levels in humans and chimpanzees. , 1975, Science.

[15]  J. Villén,et al.  A practical recipe to survey phosphoproteomes. , 2014, Methods in molecular biology.

[16]  Hao Li,et al.  The Evolution of Combinatorial Gene Regulation in Fungi , 2008, PLoS biology.

[17]  Leopold Parts,et al.  A High-Definition View of Functional Genetic Variation from Natural Yeast Genomes , 2014, Molecular biology and evolution.

[18]  Geoffrey J. Barton,et al.  Jalview Version 2—a multiple sequence alignment editor and analysis workbench , 2009, Bioinform..

[19]  Henk W. P. van den Toorn,et al.  An Augmented Multiple-Protease-Based Human Phosphopeptide Atlas. , 2015, Cell reports.

[20]  W. Lim,et al.  Evolution of Phosphoregulation: Comparison of Phosphorylation Patterns across Yeast Species , 2009, PLoS biology.

[21]  C. Gaillardin,et al.  The intronome of budding yeasts. , 2011, Comptes rendus biologies.

[22]  Steven P Gygi,et al.  Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry , 2007, Nature Methods.

[23]  Songnian Hu,et al.  EvolView, an online tool for visualizing, annotating and managing phylogenetic trees , 2012, Nucleic Acids Res..

[24]  Evgeny M. Zdobnov,et al.  The Newick utilities: high-throughput phylogenetic tree processing in the Unix shell , 2010, Bioinform..

[25]  S. Gygi,et al.  Global Analysis of Cdk1 Substrate Phosphorylation Sites Provides Insights into Evolution , 2009, Science.

[26]  W. Lim,et al.  Systematic Functional Prioritization of Protein Posttranslational Modifications , 2012, Cell.

[27]  Sudhir Kumar,et al.  Tree of Life Reveals Clock-Like Speciation and Diversification , 2014, Molecular biology and evolution.

[28]  Peter J. Park,et al.  Unphosphorylated SR-Like Protein Npl3 Stimulates RNA Polymerase II Elongation , 2008, PloS one.

[29]  M. Mann,et al.  Protocol for micro-purification, enrichment, pre-fractionation and storage of peptides for proteomics using StageTips , 2007, Nature Protocols.

[30]  L. Jensen,et al.  KinomeXplorer: an integrated platform for kinome biology studies , 2014, Nature Methods.

[31]  K. Katoh,et al.  MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability , 2013, Molecular biology and evolution.

[32]  James E. Ferrell,et al.  A Mechanism for the Evolution of Phosphorylation Sites , 2011, Cell.

[33]  W. S. Valdar,et al.  Scoring residue conservation , 2002, Proteins.

[34]  M. Mann,et al.  Stable Isotope Labeling by Amino Acids in Cell Culture, SILAC, as a Simple and Accurate Approach to Expression Proteomics* , 2002, Molecular & Cellular Proteomics.

[35]  Jonathan J. Ellis,et al.  Predicting Protein Kinase Specificity: Predikin Update and Performance in the DREAM4 Challenge , 2011, PloS one.

[36]  Steven P Gygi,et al.  The SCX/IMAC enrichment approach for global phosphorylation analysis by mass spectrometry , 2008, Nature Protocols.

[37]  C. Landry,et al.  Weak functional constraints on phosphoproteomes. , 2009, Trends in genetics : TIG.

[38]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[39]  K. H. Wolfe,et al.  Molecular evidence for an ancient duplication of the entire yeast genome , 1997, Nature.

[40]  Korbinian Strimmer,et al.  APE: Analyses of Phylogenetics and Evolution in R language , 2004, Bioinform..

[41]  Wei Shi,et al.  featureCounts: an efficient general purpose program for assigning sequence reads to genomic features , 2013, Bioinform..

[42]  Gaston H. Gonnet,et al.  The OMA orthology database in 2015: function predictions, better plant support, synteny view and other improvements , 2014, Nucleic Acids Res..

[43]  Yu Xue,et al.  GPS 2.0, a Tool to Predict Kinase-specific Phosphorylation Sites in Hierarchy *S , 2008, Molecular & Cellular Proteomics.

[44]  G. Barton,et al.  Classification and functional annotation of eukaryotic protein kinases , 2007, Proteins.

[45]  Richard Durbin,et al.  Regulatory evolution in proteins by turnover and lineage-specific changes of cyclin-dependent kinase consensus sites , 2007, Proceedings of the National Academy of Sciences.

[46]  S. Gygi,et al.  A site-specific, multiplexed kinase activity assay using stable-isotope dilution and high-resolution mass spectrometry , 2009, Proceedings of the National Academy of Sciences.

[47]  Kara Dolinski,et al.  The PhosphoGRID Saccharomyces cerevisiae protein phosphorylation site database: version 2.0 update , 2013, Database J. Biol. Databases Curation.

[48]  Bartek Wilczynski,et al.  Biopython: freely available Python tools for computational molecular biology and bioinformatics , 2009, Bioinform..

[49]  Steven P Gygi,et al.  A probability-based approach for high-throughput protein phosphorylation analysis and site localization , 2006, Nature Biotechnology.

[50]  Mark Gerstein,et al.  Biochemical and genetic analysis of the yeast proteome with a movable ORF collection. , 2005, Genes & development.

[51]  Manolis Kellis,et al.  Evolutionary principles of modular gene regulation in yeasts , 2013, eLife.

[52]  Edith D. Wong,et al.  Saccharomyces genome database provides new regulation data , 2013, Nucleic Acids Res..

[53]  Luca Freschi,et al.  Functional Divergence and Evolutionary Turnover in Mammalian Phosphoproteomes , 2014, PLoS genetics.

[54]  A. Shilatifard,et al.  A comprehensive library of histone mutants identifies nucleosomal residues required for H3K4 methylation , 2008, Nature Structural &Molecular Biology.

[55]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[56]  Fabian Rudolf,et al.  Transient Activation of the HOG MAPK Pathway Regulates Bimodal Gene Expression , 2011, Science.

[57]  S. Moreno,et al.  Characterization of yeast pyruvate kinase 1 as a protein kinase A substrate, and specificity of the phosphorylation site sequence in the whole protein. , 2006, The Biochemical journal.