Large-scale intact glycopeptide identification by Mascot database search

Workflows capable of determining glycopeptides in large-scale are missing in the field of glycoproteomics. We present an approach for automated annotation of intact glycopeptide mass spectra. The steps in adopting the Mascot search engine for intact glycopeptide analysis included: (i) assigning one letter codes for monosaccharides, (ii) linearizing glycan sequences and (iii) preparing custom glycoprotein databases. Automated annotation of both N- and O-linked glycopeptides was proven using standard glycoproteins. In a large-scale study, a total of 257 glycoproteins containing 970 unique glycosylation sites and 3447 non-redundant N-linked glycopeptide variants were identified in 24 serum samples. Thus, a single tool was developed that collectively allows the (i) elucidation of N- and O-linked glycopeptide spectra, (ii) matching glycopeptides to known protein sequences, and (iii) high-throughput, batch-wise analysis of large-scale glycoproteomics data sets.

[1]  J. Marth,et al.  Glycosylation in Cellular Mechanisms of Health and Disease , 2006, Cell.

[2]  Radoslav Goldman,et al.  Exploring site-specific N-glycosylation microheterogeneity of haptoglobin using glycopeptide CID tandem mass spectra and glycan database search. , 2013, Journal of proteome research.

[3]  Nicolle H Packer,et al.  Toward Automated N-Glycopeptide Identification in Glycoproteomics. , 2016, Journal of proteome research.

[4]  Jonathan C Trinidad,et al.  N- and O-Glycosylation in the Murine Synaptosome* , 2013, Molecular & Cellular Proteomics.

[5]  Jian Min Ren,et al.  N-Glycan structure annotation of glycopeptides using a linearized glycan structure database (GlyDB). , 2007, Journal of proteome research.

[6]  Andrew R. Jones,et al.  ProteomeXchange provides globally co-ordinated proteomics data submission and dissemination , 2014, Nature Biotechnology.

[7]  David E James,et al.  Terminal Galactosylation and Sialylation Switching on Membrane Glycoproteins upon TNF-Alpha-Induced Insulin Resistance in Adipocytes* , 2015, Molecular & Cellular Proteomics.

[8]  Nuno Bandeira,et al.  SweetNET: A Bioinformatics Workflow for Glycopeptide MS/MS Spectral Analysis. , 2016, Journal of proteome research.

[9]  Jan Haug Anonsen,et al.  An extended spectrum of target proteins and modification sites in the general O-linked protein glycosylation system in Neisseria gonorrhoeae. , 2012, Journal of proteome research.

[10]  Xiaomeng Su,et al.  New Glycoproteomics Software, GlycoPep Evaluator, Generates Decoy Glycopeptides de Novo and Enables Accurate False Discovery Rate Analysis for Small Data Sets , 2014, Analytical chemistry.

[11]  Albert J R Heck,et al.  Toward full peptide sequence coverage by dual fragmentation combining electron-transfer and higher-energy collision dissociation tandem mass spectrometry. , 2012, Analytical chemistry.

[12]  Haixu Tang,et al.  Computational framework for identification of intact glycopeptides in complex samples. , 2014, Analytical chemistry.

[13]  Bieke Dejaegher,et al.  Glycan characterization of biopharmaceuticals: Updates and perspectives. , 2016, Analytica chimica acta.

[14]  Chengchao Xu,et al.  Glycosylation-directed quality control of protein folding , 2015, Nature Reviews Molecular Cell Biology.

[15]  Martin R Larsen,et al.  Selective enrichment of sialic acid–containing glycopeptides using titanium dioxide chromatography with analysis by HILIC and mass spectrometry , 2010, Nature Protocols.

[16]  Daniel Kolarich,et al.  The Art of Destruction: Optimizing Collision Energies in Quadrupole-Time of Flight (Q-TOF) Instruments for Glycopeptide-Based Glycoproteomics , 2016, Journal of The American Society for Mass Spectrometry.

[17]  Ningombam Sanjib Meitei,et al.  Bioinformatics in glycomics: glycan characterization with mass spectrometric data using SimGlycan. , 2010, Methods in molecular biology.

[18]  Jan Haug Anonsen,et al.  Insights into type IV pilus biogenesis and dynamics from genetic analysis of a C‐terminally tagged pilin: a role for O‐linked glycosylation , 2012, Molecular microbiology.

[19]  Wen-Lian Hsu,et al.  MAGIC-web: a platform for untargeted and targeted N-linked glycoprotein identification , 2016, Nucleic Acids Res..

[20]  Hui Zhang,et al.  Integrated Proteomic and Glycoproteomic Analyses of Prostate Cancer Cells Reveal Glycoprotein Alteration in Protein Abundance and Glycosylation* , 2015, Molecular & Cellular Proteomics.

[21]  Yehia Mechref,et al.  Use of CID/ETD Mass Spectrometry to Analyze Glycopeptides , 2012, Current protocols in protein science.

[22]  Rosa Viner,et al.  Increasing the Productivity of Glycopeptides Analysis by Using Higher-Energy Collision Dissociation-Accurate Mass-Product-Dependent Electron Transfer Dissociation , 2012, International journal of proteomics.

[23]  S. Pinho,et al.  Glycosylation in cancer: mechanisms and clinical implications , 2015, Nature Reviews Cancer.

[24]  Feng Li,et al.  Glycobioinformatics: Current strategies and tools for data mining in MS‐based glycoproteomics , 2013, Proteomics.

[25]  Ian G. Mills,et al.  The role of glycans in the development and progression of prostate cancer , 2016, Nature Reviews Urology.

[26]  Hao Chi,et al.  pGlyco: a pipeline for the identification of intact N-glycopeptides by using HCD- and CID-MS/MS and MS3 , 2016, Scientific Reports.

[27]  Chen-Chun Chen,et al.  MAGIC: an automated N-linked glycoprotein identification tool using a Y1-ion pattern matching algorithm and in silico MS² approach. , 2015, Analytical chemistry.

[28]  Chen-Zhong Li,et al.  Lectin approaches for glycoproteomics in FDA-approved cancer biomarkers , 2014, Expert review of proteomics.

[29]  Nichollas E. Scott,et al.  Site-specific glycan-peptide analysis for determination of N-glycoproteome heterogeneity. , 2013, Journal of proteome research.

[30]  Robert J Chalkley,et al.  Mass Spectrometric Analysis, Automated Identification and Complete Annotation of O-Linked Glycopeptides , 2010, European journal of mass spectrometry.

[31]  Albert J R Heck,et al.  Expanding the detectable HLA peptide repertoire using electron-transfer/higher-energy collision dissociation (EThcD) , 2014, Proceedings of the National Academy of Sciences.

[32]  Alexandre M J J Bonvin,et al.  Extended O-GlcNAc on HLA Class-I-Bound Peptides. , 2015, Journal of the American Chemical Society.

[33]  R Apweiler,et al.  On the frequency of protein glycosylation, as deduced from analysis of the SWISS-PROT database. , 1999, Biochimica et biophysica acta.

[34]  Nicolle H Packer,et al.  Advances in LC-MS/MS-based glycoproteomics: getting closer to system-wide site-specific mapping of the N- and O-glycoproteome. , 2014, Biochimica et biophysica acta.

[35]  Marshall Bern,et al.  Glycoproteomic Analysis of the Secretome of Human Endothelial Cells , 2013, Molecular & Cellular Proteomics.

[36]  Y. Kooyk,et al.  Protein-glycan interactions in the control of innate and adaptive immune responses , 2008, Nature Immunology.

[37]  Joseph Zaia,et al.  Algorithms and design strategies towards automated glycoproteomics analysis. , 2017, Mass spectrometry reviews.

[38]  Yong J. Kil,et al.  Byonic: Advanced Peptide and Protein Identification Software , 2012, Current protocols in bioinformatics.

[39]  Zhikai Zhu,et al.  GlycoPep Detector: a tool for assigning mass spectrometry data of N-linked glycopeptides on the basis of their electron transfer dissociation spectra. , 2013, Analytical chemistry.

[40]  A. Helenius,et al.  Intracellular functions of N-linked glycans. , 2001, Science.

[41]  Suh-Yuen Liang,et al.  Sweet-Heart - an integrated suite of enabling computational tools for automated MS2/MS3 sequencing and identification of glycopeptides. , 2013, Journal of proteomics.

[42]  Quan Quan,et al.  Online two-dimensional porous graphitic carbon/reversed phase liquid chromatography platform applied to shotgun proteomics and glycoproteomics. , 2014, Analytical chemistry.

[43]  Peter R Baker,et al.  Modification Site Localization Scoring Integrated into a Search Engine* , 2011, Molecular & Cellular Proteomics.

[44]  Yehia Mechref,et al.  Recent advances in mass spectrometric analysis of glycoproteins , 2017, Electrophoresis.

[45]  M. L. Silva,et al.  Cancer serum biomarkers based on aberrant post-translational modifications of glycoproteins: Clinical value and discovery strategies. , 2015, Biochimica et biophysica acta.

[46]  Milos V Novotny,et al.  Recent Advances in the Analysis of Complex Glycoproteins. , 2017, Analytical chemistry.

[47]  B. Ma,et al.  GlycoMaster DB: software to assist the automated identification of N-linked glycopeptides by tandem mass spectrometry. , 2014, Journal of proteome research.

[48]  Anders Engeland,et al.  Cohort Profile Update: The Janus Serum Bank Cohort in Norway. , 2017, International journal of epidemiology.

[49]  Friedrich Altmann,et al.  Site-specific analysis of the O-glycosylation of bovine fetuin by electron-transfer dissociation mass spectrometry. , 2014, Journal of proteomics.

[50]  Matthew P. Campbell,et al.  Quantitative profiling of glycans and glycopeptides: an informatics' perspective. , 2016, Current opinion in structural biology.

[51]  Heather Desaire,et al.  Software for automated interpretation of mass spectrometry data from glycans and glycopeptides. , 2013, The Analyst.

[52]  Joseph Zaia,et al.  A review of methods for interpretation of glycopeptide tandem mass spectral data , 2015, Glycoconjugate Journal.

[53]  Haixu Tang,et al.  Mapping site-specific protein N-glycosylations through liquid chromatography/mass spectrometry and targeted tandem mass spectrometry. , 2010, Rapid communications in mass spectrometry : RCM.

[54]  Jonas Nilsson,et al.  Liquid chromatography-tandem mass spectrometry-based fragmentation analysis of glycopeptides , 2016, Glycoconjugate Journal.

[55]  Jan Haug Anonsen,et al.  Structural and genetic analyses of glycan O-acetylation in a bacterial protein glycosylation system: evidence for differential effects on glycan chain length , 2017, Glycobiology.

[56]  Helen J Cooper,et al.  Higher energy collision dissociation (HCD) product ion-triggered electron transfer dissociation (ETD) mass spectrometry for the analysis of N-linked glycoproteins. , 2012, Journal of proteome research.

[57]  Veit Schwämmle,et al.  A Novel Method for the Simultaneous Enrichment, Identification, and Quantification of Phosphopeptides and Sialylated Glycopeptides Applied to a Temporal Profile of Mouse Brain Development* , 2012, Molecular & Cellular Proteomics.

[58]  Pauline M Rudd,et al.  A systematic approach to protein glycosylation analysis: a path through the maze. , 2010, Nature chemical biology.

[59]  Jin Young Kim,et al.  Integrated GlycoProteome Analyzer (I-GPA) for Automated Identification and Quantitation of Site-Specific N-Glycosylation , 2016, Scientific Reports.

[60]  Tsung-Hsien Pu,et al.  Novel LC-MS² product dependent parallel data acquisition function and data analysis workflow for sequencing and identification of intact glycopeptides. , 2014, Analytical chemistry.

[61]  David Hua,et al.  GlycoPep grader: a web-based utility for assigning the composition of N-linked glycopeptides. , 2012, Analytical chemistry.

[62]  J. Diedrich,et al.  Energy Dependence of HCD on Peptide Fragmentation: Stepped Collisional Energy Finds the Sweet Spot , 2013, Journal of The American Society for Mass Spectrometry.

[63]  William F. Martin,et al.  Automated glycopeptide analysis - review of current state and future directions , 2013, Briefings Bioinform..

[64]  Joakim Dillner,et al.  Cohort Profile: The Janus Serum Bank Cohort in Norway. , 2016, International journal of epidemiology.