15 years of PhosphoSitePlus®: integrating post-translationally modified sites, disease variants and isoforms

Abstract For 15 years the mission of PhosphoSitePlus® (PSP, https://www.phosphosite.org) has been to provide comprehensive information and tools for the study of mammalian post-translational modifications (PTMs). The number of unique PTMs in PSP is now more than 450 000 from over 22 000 articles and thousands of MS datasets. The most important areas of growth in PSP are in disease and isoform informatics. Germline mutations associated with inherited diseases and somatic cancer mutations have been added to the database and can now be viewed along with PTMs and associated quantitative information on novel ‘lollipop' plots. These plots enable researchers to interactively visualize the overlap between disease variants and PTMs, and to identify mutations that may alter phenotypes by rewiring signaling networks. We are expanding the sequence space to include over 30 000 human and mouse isoforms to enable researchers to explore the important but understudied biology of isoforms. This represents a necessary expansion of sequence space to accommodate the growing precision and depth of coverage enabled by ongoing advances in mass spectrometry. Isoforms are aligned using a new algorithm. Exploring the worlds of PTMs and disease mutations in the entire isoform space will hopefully lead to new biomarkers, therapeutic targets, and insights into isoform biology.

[1]  Peter W. Laird,et al.  Cell-of-Origin Patterns Dominate the Molecular Classification of 10,000 Tumors from 33 Types of Cancer , 2018, Cell.

[2]  S. Elledge,et al.  Cumulative Haploinsufficiency and Triplosensitivity Drive Aneuploidy Patterns and Shape the Cancer Genome , 2013, Cell.

[3]  Evan Bolton,et al.  Database resources of the National Center for Biotechnology Information , 2017, Nucleic Acids Res..

[4]  Michael B Yaffe,et al.  The Scientific Drunk and the Lamppost: Massive Sequencing Efforts in Cancer Discovery and Treatment , 2013, Science Signaling.

[5]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[6]  Adeeb Rahman,et al.  Clustergrammer, a web-based heatmap visualization and analysis tool for high-dimensional biological data , 2017, Scientific Data.

[7]  C. Hutter,et al.  The Cancer Genome Atlas: Creating Lasting Value beyond Its Data , 2018, Cell.

[8]  Chunlei Liu,et al.  ClinVar: improving access to variant interpretations and supporting evidence , 2017, Nucleic Acids Res..

[9]  Lin Wang,et al.  Analyzing Effects of Naturally Occurring Missense Mutations , 2012, Comput. Math. Methods Medicine.

[10]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[11]  Benjamin E. Gross,et al.  Integrative Analysis of Complex Cancer Genomics and Clinical Profiles Using the cBioPortal , 2013, Science Signaling.

[12]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[13]  Russ P Carstens,et al.  Functional roles of alternative splicing factors in human disease , 2015, Wiley interdisciplinary reviews. RNA.

[14]  Veronika Csizmok,et al.  Complex regulatory mechanisms mediated by the interplay of multiple post-translational modifications. , 2018, Current opinion in structural biology.

[15]  Michael B. Yaffe,et al.  Scansite 2.0: proteome-wide prediction of cell signaling interactions using short sequence motifs , 2003, Nucleic Acids Res..

[16]  Joaquín Dopazo,et al.  SNPeffect 4.0: on-line prediction of molecular and structural effects of protein-coding variants , 2011, Nucleic Acids Res..

[17]  Steven J. M. Jones,et al.  Oncogenic Signaling Pathways in The Cancer Genome Atlas. , 2018, Cell.

[18]  Sylvie Garneau-Tsodikova,et al.  Protein posttranslational modifications: the chemistry of proteome diversifications. , 2005, Angewandte Chemie.

[19]  Bin Zhang,et al.  PhosphoSitePlus, 2014: mutations, PTMs and recalibrations , 2014, Nucleic Acids Res..

[20]  D. Rio,et al.  Mechanisms and Regulation of Alternative Pre-mRNA Splicing. , 2015, Annual review of biochemistry.

[21]  Christodoulos A. Floudas,et al.  Proteome-wide post-translational modification statistics: frequency analysis and curation of the swiss-prot database , 2011, Scientific reports.

[22]  Benjamin E. Gross,et al.  The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. , 2012, Cancer discovery.

[23]  F. Gnad,et al.  Bioinformatics analysis of thousands of TCGA tumors to determine the involvement of epigenetic regulators in human cancer , 2015, BMC Genomics.

[24]  François Schiettecatte,et al.  OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders , 2014, Nucleic Acids Res..

[25]  J. Kornhauser,et al.  PhosphoSite: A bioinformatics resource dedicated to physiological protein phosphorylation , 2004, Proteomics.

[26]  Florian Gnad,et al.  Evolutionary Constraints of Phosphorylation in Eukaryotes, Prokaryotes, and Mitochondria* , 2010, Molecular & Cellular Proteomics.

[27]  M. Mann,et al.  PHOSIDA (phosphorylation site database): management, structural and evolutionary investigation, and prediction of phosphosites , 2007, Genome Biology.

[28]  C. Burge,et al.  Evolutionary Dynamics of Gene and Isoform Regulation in Mammalian Tissues , 2012, Science.

[29]  Fabian Sievers,et al.  Clustal Omega, accurate alignment of very large numbers of sequences. , 2014, Methods in molecular biology.

[30]  S Sakoda,et al.  The molecular genetic basis of muscle phosphoglycerate mutase (PGAM) deficiency. , 1993, American journal of human genetics.

[31]  Monia Magliozzi,et al.  Structural, Functional, and Clinical Characterization of a Novel PTPN11 Mutation Cluster Underlying Noonan Syndrome , 2017, Human mutation.

[32]  H. Watson,et al.  Structure and activity of phosphoglycerate mutase. , 1981, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[33]  Travis J. Wheeler,et al.  Splice-Aware Multiple Sequence Alignment of Protein Isoforms , 2018, BCB.

[34]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[35]  J. Licht,et al.  Somatic mutations in PTPN11 in juvenile myelomonocytic leukemia, myelodysplastic syndromes and acute myeloid leukemia , 2003, Nature Genetics.