Characterization of proteoform post-translational modifications by top-down and bottom-up mass spectrometry in conjunction with UniProt annotations

Many proteoforms can be produced from a gene due to genetic mutations, alternative splicing, post-translational modifications (PTMs), and other variations. PTMs in proteoforms play critical roles in cell signaling, protein degradation, and other biological processes. Mass spectrometry (MS) is the primary technique for investigating PTMs in proteoforms, and two alternative MS approaches, top-down and bottom-up, have complementary strengths. The combination of the two approaches has the potential to increase the sensitivity and accuracy in PTM identification and characterization. In addition, protein and PTM knowledgebases, such as UniProt, provide valuable information for PTM characterization and validation. Here, we present a software pipeline called PTM-TBA (PTM characterization by Top-down, Bottom-up MS and Annotations) for identifying and localizing PTMs in proteoforms by integrating top-down and bottom-up MS as well as UniProt annotations. We identified 1,662 mass shifts from a top-down MS data set of SW480 cells, 545 (33%) of which were matched to 12 common PTMs, and 351 of which were localized. PTM-TBA validated 346 of the 1,662 mass shifts using UniProt annotations or a bottom-up MS data set of SW480 cells.

[1]  A. Hummon,et al.  Deep top-down proteomics revealed significant proteoform-level differences between metastatic and nonmetastatic colorectal cancer cells , 2022, Science advances.

[2]  Abdul Rehman Basharat,et al.  TopFD: A Proteoform Feature Detection Tool for Top–Down Proteomics , 2022, bioRxiv.

[3]  Kaixian Chen,et al.  Dynamics of Post-Translational Modification Inspires Drug Design in the Kinase Family. , 2021, Journal of medicinal chemistry.

[4]  Leah V Schaffer,et al.  Improving Proteoform Identifications in Complex Systems Through Integration of Bottom-Up and Top-Down Data. , 2020, Journal of proteome research.

[5]  Wenrong Chen,et al.  Proteoform Identification by Combining RNA-Seq and Top-down Mass Spectrometry , 2020, bioRxiv.

[6]  Eric W. Deutsch,et al.  PTMProphet: Fast and Accurate Mass Modification Localization for the Trans-Proteomic Pipeline , 2019, bioRxiv.

[7]  B. Li,et al.  Advances of Proteomics in Novel PTM Discovery: Applications in Cancer Therapy , 2019, Small Methods.

[8]  Geoffrey I. Webb,et al.  PRISMOID: a comprehensive 3D structure database for post-translational modifications and mutations with functional impact , 2019, bioRxiv.

[9]  Costel C. Darie,et al.  Identification of Posttranslational Modifications (PTMs) of Proteins by Mass Spectrometry. , 2019, Advances in experimental medicine and biology.

[10]  A. Hummon,et al.  Comparison of In-Solution, FASP, and S-Trap Based Digestion Methods for Bottom-Up Proteomic Studies. , 2018, Journal of proteome research.

[11]  Ying Ge,et al.  Top-Down Proteomics: Ready for Prime Time? , 2018, Analytical chemistry.

[12]  Jungkap Park,et al.  Informed-Proteomics: Open Source Software Package for Top-down Proteomics , 2017, Nature Methods.

[13]  Xiaowen Liu,et al.  A mass graph‐based approach for the identification of modified proteoforms using top‐down tandem mass spectra , 2016, Bioinform..

[14]  Jüergen Cox,et al.  The MaxQuant computational platform for mass spectrometry-based shotgun proteomics , 2016, Nature Protocols.

[15]  Qiang Kou,et al.  TopPIC: a software tool for top-down mass spectrometry-based proteoform identification and characterization , 2016, Bioinform..

[16]  Nikola Tolić,et al.  Characterization of Proteoforms with Unknown Post-translational Modifications Using the MIScore. , 2016, Journal of proteome research.

[17]  Hsien-Da Huang,et al.  dbPTM 2016: 10-year anniversary of a resource for post-translational modification of proteins , 2015, Nucleic Acids Res..

[18]  Edward L. Huttlin,et al.  A mass-tolerant database search identifies a large proportion of unassigned spectra in shotgun proteomics as modified peptides , 2015, Nature Biotechnology.

[19]  Pavel A. Pevzner,et al.  Universal database search tool for proteomics , 2014, Nature Communications.

[20]  Neil L. Kelleher,et al.  The C-Score: A Bayesian Framework to Sharply Improve Proteoform Scoring in High-Throughput Top Down Proteomics , 2014, Journal of proteome research.

[21]  Jared B Shaw,et al.  The first pilot project of the consortium for top‐down proteomics: A status report , 2014, Proteomics.

[22]  Wei Li,et al.  SysPTM 2.0: an updated systematic resource for post-translational modification , 2014, Database J. Biol. Databases Curation.

[23]  N. Kelleher,et al.  Top Down proteomics: facts and perspectives. , 2014, Biochemical and biophysical research communications.

[24]  M. Mann,et al.  Status of Large-scale Analysis of Post-translational Modifications by Mass Spectrometry* , 2013, Molecular & Cellular Proteomics.

[25]  Bronwen L. Aken,et al.  GENCODE: The reference human genome annotation for The ENCODE Project , 2012, Genome research.

[26]  H. Schlüter,et al.  Amino acids: chemistry, functionality and selected non-enzymatic post-translational modifications. , 2012, Journal of proteomics.

[27]  T. Köcher,et al.  Universal and confident phosphorylation site localization using phosphoRS. , 2011, Journal of proteome research.

[28]  M. Mann,et al.  Andromeda: a peptide search engine integrated into the MaxQuant environment. , 2011, Journal of proteome research.

[29]  Markus Brosch,et al.  Accurate and sensitive peptide identification with Mascot Percolator. , 2009, Journal of proteome research.

[30]  Martin Zeller,et al.  SLoMo: automated site localization of modifications from ETD/ECD mass spectra. , 2009, Journal of proteome research.

[31]  Richard D. Smith,et al.  An integrated top-down and bottom-up strategy for broadly characterizing protein isoforms and modifications. , 2009, Journal of proteome research.

[32]  Luisa Montecchi-Palazzi,et al.  The PSI-MOD community standard for representation of protein modification data , 2008, Nature Biotechnology.

[33]  Robert Burke,et al.  ProteoWizard: open source software for rapid proteomics tools development , 2008, Bioinform..

[34]  Laura A. Sullivan,et al.  Global Survey of Phosphotyrosine Signaling Identifies Oncogenic Kinases in Lung Cancer , 2007, Cell.

[35]  Yong-Bin Kim,et al.  ProSight PTM 2.0: improved protein identification and characterization for top down mass spectrometry , 2007, Nucleic Acids Res..

[36]  Steven P Gygi,et al.  A probability-based approach for high-throughput protein phosphorylation analysis and site localization , 2006, Nature Biotechnology.

[37]  J. Marth,et al.  Glycosylation in Cellular Mechanisms of Health and Disease , 2006, Cell.

[38]  Dekel Tsur,et al.  Identification of post-translational modifications via blind search of mass-spectra , 2005, 2005 IEEE Computational Systems Bioinformatics Conference (CSB'05).

[39]  P. Roepstorff,et al.  Highly Selective Enrichment of Phosphorylated Peptides from Peptide Mixtures Using Titanium Dioxide Microcolumns* , 2005, Molecular & Cellular Proteomics.

[40]  J. Rush,et al.  Immunoaffinity profiling of tyrosine phosphorylation in cancer cells , 2005, Nature Biotechnology.

[41]  D. Creasy,et al.  Unimod: Protein modifications for mass spectrometry , 2004, Proteomics.

[42]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[43]  D. Eisenberg,et al.  Protein function in the post-genomic era , 2000, Nature.