GPQuest: A Spectral Library Matching Algorithm for Site-Specific Assignment of Tandem Mass Spectra to Intact N-glycopeptides.

Glycoprotein changes occur in not only protein abundance but also the occupancy of each glycosylation site by different glycoforms during biological or pathological processes. Recent advances in mass spectrometry instrumentation and techniques have facilitated analysis of intact glycopeptides in complex biological samples by allowing the users to generate spectra of intact glycopeptides with glycans attached to each specific glycosylation site. However, assigning these spectra, leading to identification of the glycopeptides, is challenging. Here, we report an algorithm, named GPQuest, for site-specific identification of intact glycopeptides using higher-energy collisional dissociation (HCD) fragmentation of complex samples. In this algorithm, a spectral library of glycosite-containing peptides in the sample was built by analyzing the isolated glycosite-containing peptides using HCD LC-MS/MS. Spectra of intact glycopeptides were selected by using glycan oxonium ions as signature ions for glycopeptide spectra. These oxonium-ion-containing spectra were then compared with the spectral library generated from glycosite-containing peptides, resulting in assignment of each intact glycopeptide MS/MS spectrum to a specific glycosite-containing peptide. The glycan occupying each glycosite was determined by matching the mass difference between the precursor ion of intact glycopeptide and the glycosite-containing peptide to a glycan database. Using GPQuest, we analyzed LC-MS/MS spectra of protein extracts from prostate tumor LNCaP cells. Without enrichment of glycopeptides from global tryptic peptides and at a false discovery rate of 1%, 1008 glycan-containing MS/MS spectra were assigned to 769 unique intact N-linked glycopeptides, representing 344 N-linked glycosites with 57 different N-glycans. Spectral library matching using GPQuest assigns the HCD LC-MS/MS generated spectra of intact glycopeptides in an automated and high-throughput manner. Additionally, spectral library matching gives the user the possibility of identifying novel or modified glycans on specific glycosites that might be missing from the predetermined glycan databases.

[1]  Serenus Hua,et al.  Automated assignments of N- and O-site specific glycosylation with extensive glycan heterogeneity of glycoprotein mixtures. , 2013, Analytical chemistry.

[2]  Yuan Tian,et al.  Solid-phase extraction of N-linked glycopeptides , 2007, Nature Protocols.

[3]  Naoyuki Taniguchi,et al.  Functional roles of N‐glycans in cell signaling and cell adhesion in cancer , 2008, Cancer science.

[4]  Kiyoko F. Aoki-Kinoshita,et al.  UniCarbKB: building a knowledge platform for glycoproteomics , 2013, Nucleic Acids Res..

[5]  G. Turner N-glycosylation of serum proteins in disease and its investigation using lectins. , 1992, Clinica chimica acta; international journal of clinical chemistry.

[6]  Haixu Tang,et al.  Improving confidence in detection and characterization of protein N-glycosylation sites and microheterogeneity. , 2011, Rapid communications in mass spectrometry : RCM.

[7]  Y. Mechref,et al.  Characterizing protein glycosylation sites through higher-energy C-trap dissociation. , 2010, Rapid communications in mass spectrometry : RCM.

[8]  Hui Zhang,et al.  Mass spectrometric analysis of sialylated glycans with use of solid-phase labeling of sialic acids. , 2013, Analytical chemistry.

[9]  Helen J Cooper,et al.  Higher energy collision dissociation (HCD) product ion-triggered electron transfer dissociation (ETD) mass spectrometry for the analysis of N-linked glycoproteins. , 2012, Journal of proteome research.

[10]  H. Freeze,et al.  Glycosylation diseases: quo vadis? , 2009, Biochimica et biophysica acta.

[11]  Alessio Ceroni,et al.  GlycoWorkbench: a tool for the computer-assisted annotation of mass spectra of glycans. , 2008, Journal of proteome research.

[12]  E. Go,et al.  Glycosylation site-specific analysis of clade C HIV-1 envelope proteins. , 2009, Journal of proteome research.

[13]  Radoslav Goldman,et al.  Exploring site-specific N-glycosylation microheterogeneity of haptoglobin using glycopeptide CID tandem mass spectra and glycan database search. , 2013, Journal of proteome research.

[14]  Gerald W. Hart,et al.  Glycomics Hits the Big Time , 2010, Cell.

[15]  M. Raftery,et al.  Detection and Characterization of Low Abundance Glycopeptides Via Higher-Energy C-Trap Dissociation and Orbitrap Mass Analysis , 2011, Journal of The American Society for Mass Spectrometry.

[16]  Yehia Mechref,et al.  Use of CID/ETD Mass Spectrometry to Analyze Glycopeptides , 2012, Current protocols in protein science.

[17]  J. Ravetch,et al.  Anti-Inflammatory Activity of Immunoglobulin G Resulting from Fc Sialylation , 2006, Science.

[18]  Catherine A. Cooper,et al.  GlycoMod – A software tool for determining glycosylation compositions from mass spectrometric data , 2001, Proteomics.

[19]  V. Reinhold,et al.  The N-glycome of human plasma. , 2010, Journal of proteome research.

[20]  Angela M Zivkovic,et al.  Simultaneous and extensive site-specific N- and O-glycosylation analysis in protein mixtures. , 2011, Journal of proteome research.

[21]  J. Marth,et al.  Glycosylation in Cellular Mechanisms of Health and Disease , 2006, Cell.

[22]  Nichollas E. Scott,et al.  Site-specific glycan-peptide analysis for determination of N-glycoproteome heterogeneity. , 2013, Journal of proteome research.

[23]  Joseph Zaia,et al.  Mass spectrometry and the emerging field of glycomics. , 2008, Chemistry & biology.

[24]  John S. Strum,et al.  Site-specific protein glycosylation analysis with glycan isomer differentiation , 2012, Analytical and Bioanalytical Chemistry.

[25]  R. Clouse,et al.  HETEROGENEITY OF THE CARBOHYDRATE MOIETY OF CRYSTALLINE OVALBUMIN. , 1963, Biochimica et biophysica acta.

[26]  D. Harvey,et al.  Derivatization of sialic acids for stabilization in matrix-assisted laser desorption/ionization mass spectrometry and concomitant differentiation of alpha(2 --> 3)- and alpha(2 --> 6)-isomers. , 2009, Rapid communications in mass spectrometry : RCM.

[27]  David J Harvey,et al.  Proteomic analysis of glycosylation: structural determination of N- and O-linked glycans by mass spectrometry , 2005, Expert review of proteomics.

[28]  Hui Zhang,et al.  Glycoform Analysis of Recombinant and Human Immunodeficiency Virus Envelope Protein gp120 via Higher Energy Collisional Dissociation and Spectral-Aligning Strategy , 2014, Analytical chemistry.

[29]  C. Lebrilla,et al.  Annotation of a serum N-glycan library for rapid identification of structures. , 2012, Journal of proteome research.

[30]  B. Ma,et al.  GlycoMaster DB: software to assist the automated identification of N-linked glycopeptides by tandem mass spectrometry. , 2014, Journal of proteome research.

[31]  A. Varki Sialic acids in human health and disease. , 2008, Trends in molecular medicine.

[32]  N Seta,et al.  Protein glycosylation and diseases: blood and urinary oligosaccharides as markers for diagnosis and therapeutic monitoring. , 2000, Clinical chemistry.

[33]  Yan Li,et al.  Glycomic analysis using glycoprotein immobilization for glycan extraction. , 2013, Analytical chemistry.

[34]  André M Deelder,et al.  Glycoproteomics based on tandem mass spectrometry of glycopeptides. , 2007, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[35]  Guillaume Vogt,et al.  Gains of glycosylation comprise an unexpectedly large group of pathogenic mutations , 2005, Nature Genetics.

[36]  D. Harvey,et al.  Matrix-assisted laser desorption/ionization mass spectrometry of carbohydrates. , 1999, Mass spectrometry reviews.