pGlyco 2.0 enables precision N-glycoproteomics with comprehensive quality control and one-step mass spectrometry for intact glycopeptide identification

The precise and large-scale identification of intact glycopeptides is a critical step in glycoproteomics. Owing to the complexity of glycosylation, the current overall throughput, data quality and accessibility of intact glycopeptide identification lack behind those in routine proteomic analyses. Here, we propose a workflow for the precise high-throughput identification of intact N-glycopeptides at the proteome scale using stepped-energy fragmentation and a dedicated search engine. pGlyco 2.0 conducts comprehensive quality control including false discovery rate evaluation at all three levels of matches to glycans, peptides and glycopeptides, improving the current level of accuracy of intact glycopeptide identification. The N-glycoproteome of samples metabolically labeled with 15N/13C were analyzed quantitatively and utilized to validate the glycopeptide identification, which could be used as a novel benchmark pipeline to compare different search engines. Finally, we report a large-scale glycoproteome dataset consisting of 10,009 distinct site-specific N-glycans on 1988 glycosylation sites from 955 glycoproteins in five mouse tissues.Protein glycosylation is a heterogeneous post-translational modification that generates greater proteomic diversity that is difficult to analyze. Here the authors describe pGlyco 2.0, a workflow for the precise one step identification of intact N-glycopeptides at the proteome scale.

[1]  Robert J Chalkley,et al.  Characterizing sialic acid variants at the glycopeptide level. , 2015, Analytical chemistry.

[2]  Liang Li,et al.  Two-dimensional mass spectra generated from the analysis of 15N-labeled and unlabeled peptides for efficient protein identification and de novo peptide sequencing. , 2004, Journal of proteome research.

[3]  Suh-Yuen Liang,et al.  Sweet-Heart - an integrated suite of enabling computational tools for automated MS2/MS3 sequencing and identification of glycopeptides. , 2013, Journal of proteomics.

[4]  William F. Martin,et al.  Automated glycopeptide analysis - review of current state and future directions , 2013, Briefings Bioinform..

[5]  Jin Young Kim,et al.  Integrated GlycoProteome Analyzer (I-GPA) for Automated Identification and Quantitation of Site-Specific N-Glycosylation , 2016, Scientific Reports.

[6]  J. Paulson,et al.  Glycomics: an integrated systems approach to structure-function relationships of glycans , 2005, Nature Methods.

[7]  C. Lebrilla,et al.  Annotation of a serum N-glycan library for rapid identification of structures. , 2012, Journal of proteome research.

[8]  Johannes Griss,et al.  The Proteomics Identifications (PRIDE) database and associated tools: status in 2013 , 2012, Nucleic Acids Res..

[9]  Joseph Zaia,et al.  Algorithms and design strategies towards automated glycoproteomics analysis. , 2017, Mass spectrometry reviews.

[10]  B. Ma,et al.  GlycoMaster DB: software to assist the automated identification of N-linked glycopeptides by tandem mass spectrometry. , 2014, Journal of proteome research.

[11]  Serenus Hua,et al.  Automated assignments of N- and O-site specific glycosylation with extensive glycan heterogeneity of glycoprotein mixtures. , 2013, Analytical chemistry.

[12]  Hao Chi,et al.  pQuant improves quantitation by keeping out interfering signals and evaluating the accuracy of calculated ratios. , 2014, Analytical chemistry.

[13]  Edward L Huttlin,et al.  Implications of 15N‐metabolic labeling for automated peptide identification in Arabidopsis thaliana , 2007, Proteomics.

[14]  Florian Gnad,et al.  Precision Mapping of an In Vivo N-Glycoproteome Reveals Rigid Topological and Sequence Constraints , 2010, Cell.

[15]  Xingde Li,et al.  GPQuest: A Spectral Library Matching Algorithm for Site-Specific Assignment of Tandem Mass Spectra to Intact N-glycopeptides. , 2015, Analytical chemistry.

[16]  Chen-Chun Chen,et al.  MAGIC: an automated N-linked glycoprotein identification tool using a Y1-ion pattern matching algorithm and in silico MS² approach. , 2015, Analytical chemistry.

[17]  Phillip C Wright,et al.  Novel approach for peptide quantitation and sequencing based on 15N and 13C metabolic labeling. , 2005, Journal of proteome research.

[18]  Haixu Tang,et al.  Computational framework for identification of intact glycopeptides in complex samples. , 2014, Analytical chemistry.

[19]  Zhikai Zhu,et al.  GlycoPep Detector: a tool for assigning mass spectrometry data of N-linked glycopeptides on the basis of their electron transfer dissociation spectra. , 2013, Analytical chemistry.

[20]  J. Marth,et al.  Glycosylation in Cellular Mechanisms of Health and Disease , 2006, Cell.

[21]  Nichollas E. Scott,et al.  Site-specific glycan-peptide analysis for determination of N-glycoproteome heterogeneity. , 2013, Journal of proteome research.

[22]  Wen Gao,et al.  pFind: a novel database-searching software system for automated peptide and protein identification via tandem mass spectrometry , 2005, Bioinform..

[23]  F. He,et al.  Efficient and accurate glycopeptide identification pipeline for high-throughput site-specific N-glycosylation analysis. , 2014, Journal of proteome research.

[24]  Daniel Kolarich,et al.  The Art of Destruction: Optimizing Collision Energies in Quadrupole-Time of Flight (Q-TOF) Instruments for Glycopeptide-Based Glycoproteomics , 2016, Journal of The American Society for Mass Spectrometry.

[25]  Wen Gao,et al.  pFind 2.0: a software package for peptide and protein identification via tandem mass spectrometry. , 2007, Rapid communications in mass spectrometry : RCM.

[26]  Gerald W. Hart,et al.  Glycomics Hits the Big Time , 2010, Cell.

[27]  M. Gerstein,et al.  An integrated systems approach to structure-function relationships of glycans , 2005 .

[28]  Derek J. Bailey,et al.  One-hour proteome analysis in yeast , 2015, Nature Protocols.

[29]  Angela M Zivkovic,et al.  Simultaneous and extensive site-specific N- and O-glycosylation analysis in protein mixtures. , 2011, Journal of proteome research.

[30]  Robert J. Chalkley,et al.  Tissue-Specific Glycosylation at the Glycopeptide Level* , 2015, Molecular & Cellular Proteomics.

[31]  Eric D. Dodds,et al.  Gas-phase dissociation of glycosylated peptide ions. , 2012, Mass spectrometry reviews.

[32]  Hui Zhang,et al.  Integrated Proteomic and Glycoproteomic Analyses of Prostate Cancer Cells Reveal Glycoprotein Alteration in Protein Abundance and Glycosylation* , 2015, Molecular & Cellular Proteomics.

[33]  Tsung-Hsien Pu,et al.  Novel LC-MS² product dependent parallel data acquisition function and data analysis workflow for sequencing and identification of intact glycopeptides. , 2014, Analytical chemistry.

[34]  Jonas Nilsson,et al.  Human Urinary Glycoproteomics; Attachment Site Specific Analysis of N- and O-Linked Glycosylations by CID and ECD* , 2011, Molecular & Cellular Proteomics.

[35]  Paul Aiyetan,et al.  Comprehensive analysis of protein glycosylation by solid-phase extraction of N-linked glycans and glycosite-containing peptides , 2016, Nature Biotechnology.

[36]  Milos V. Novotny,et al.  High-sensitivity analytical approaches for the structural characterization of glycoproteins. , 2013, Chemical reviews.

[37]  W. Ying,et al.  Strategy integrating stepped fragmentation and glycan diagnostic ion-based spectrum refinement for the identification of core fucosylated glycoproteome using mass spectrometry. , 2014, Analytical chemistry.

[38]  A. Varki,et al.  Why Is N-Glycolylneuraminic Acid Rare in the Vertebrate Brain? , 2013, Topics in current chemistry.

[39]  Daniel Figeys,et al.  Large-scale characterization of intact N-glycopeptides using an automated glycoproteomic method. , 2014, Journal of proteomics.

[40]  Chao Liu,et al.  pParse: A method for accurate determination of monoisotopic peaks in high‐resolution mass spectra , 2012, Proteomics.

[41]  C. Bertozzi,et al.  Isotope-targeted glycoproteomics (IsoTaG): a mass-independent platform for intact N- and O-glycopeptide discovery and analysis , 2015, Nature Methods.

[42]  H. Desaire Glycopeptide Analysis, Recent Developments and Applications* , 2013, Molecular & Cellular Proteomics.

[43]  Hao Chi,et al.  pGlyco: a pipeline for the identification of intact N-glycopeptides by using HCD- and CID-MS/MS and MS3 , 2016, Scientific Reports.

[44]  Radoslav Goldman,et al.  Exploring site-specific N-glycosylation microheterogeneity of haptoglobin using glycopeptide CID tandem mass spectra and glycan database search. , 2013, Journal of proteome research.

[45]  L. R. Ruhaak,et al.  A Method for Comprehensive Glycosite-Mapping and Direct Quantitation of Serum Glycoproteins. , 2015, Journal of proteome research.