MAGIC: an automated N-linked glycoprotein identification tool using a Y1-ion pattern matching algorithm and in silico MS² approach.

Glycosylation is a highly complex modification influencing the functions and activities of proteins. Interpretation of intact glycopeptide spectra is crucial but challenging. In this paper, we present a mass spectrometry-based automated glycopeptide identification platform (MAGIC) to identify peptide sequences and glycan compositions directly from intact N-linked glycopeptide collision-induced-dissociation spectra. The identification of the Y1 (peptideY0 + GlcNAc) ion is critical for the correct analysis of unknown glycoproteins, especially without prior knowledge of the proteins and glycans present in the sample. To ensure accurate Y1-ion assignment, we propose a novel algorithm called Trident that detects a triplet pattern corresponding to [Y0, Y1, Y2] or [Y0-NH3, Y0, Y1] from the fragmentation of the common trimannosyl core of N-linked glycopeptides. To facilitate the subsequent peptide sequence identification by common database search engines, MAGIC generates in silico spectra by overwriting the original precursor with the naked peptide m/z and removing all of the glycan-related ions. Finally, MAGIC computes the glycan compositions and ranks them. For the model glycoprotein horseradish peroxidase (HRP) and a 5-glycoprotein mixture, a 2- to 31-fold increase in the relative intensities of the peptide fragments was achieved, which led to the identification of 7 tryptic glycopeptides from HRP and 16 glycopeptides from the mixture via Mascot. In the HeLa cell proteome data set, MAGIC processed over a thousand MS(2) spectra in 3 min on a PC and reported 36 glycopeptides from 26 glycoproteins. Finally, a remarkable false discovery rate of 0 was achieved on the N-glycosylation-free Escherichia coli data set. MAGIC is available at http://ms.iis.sinica.edu.tw/COmics/Software_MAGIC.html .

[1]  J. J. Lucas,et al.  The critical glycosylation site of human transferrin receptor contains a high-mannose oligosaccharide. , 1995, Glycobiology.

[2]  A. Perschl,et al.  Variant cell lines selected for alterations in the function of the hyaluronan receptor CD44 show differences in glycosylation , 1995, The Journal of experimental medicine.

[3]  Kai Simons,et al.  The role of n-glycans in the secretory pathway , 1995, Cell.

[4]  I. Stamenkovic,et al.  Glycosylation of CD44 is implicated in CD44-mediated cell adhesion to hyaluronan , 1996, The Journal of cell biology.

[5]  J. S. Gray,et al.  The glycans of horseradish peroxidase. , 1996, Carbohydrate research.

[6]  Pietro Traldi,et al.  Rapid Commun. Mass Spectrom.10. 1629-1637 (1996) Matrix-assisted Laser Desorption/Ionisation Mass Spectrometry in Milk Science , 1997 .

[7]  R. Hyman,et al.  Site-specific de-N-glycosylation of CD44 can activate hyaluronan binding, and CD44 activation states show distinct threshold densities for hyaluronan binding. , 1998, Cancer research.

[8]  C. Lok,et al.  The transferrin receptor: role in health and disease. , 1999, The international journal of biochemistry & cell biology.

[9]  R Apweiler,et al.  On the frequency of protein glycosylation, as deduced from analysis of the SWISS-PROT database. , 1999, Biochimica et biophysica acta.

[10]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[11]  N Seta,et al.  Protein glycosylation and diseases: blood and urinary oligosaccharides as markers for diagnosis and therapeutic monitoring. , 2000, Clinical chemistry.

[12]  A. Helenius,et al.  Intracellular functions of N-linked glycans. , 2001, Science.

[13]  Catherine A. Cooper,et al.  GlycoMod – A software tool for determining glycosylation compositions from mass spectrometric data , 2001, Proteomics.

[14]  S. Satomura,et al.  AFP-L3: a new generation of tumor marker for hepatocellular carcinoma. , 2001, Clinica chimica acta; international journal of clinical chemistry.

[15]  S. Nedvetzki,et al.  CD44 in Cancer , 2002, Critical reviews in clinical laboratory sciences.

[16]  Jakob Bunkenborg,et al.  A new strategy for identification of N-glycosylated proteins and unambiguous assignment of their glycosylation sites using HILIC enrichment and partial deglycosylation. , 2004, Journal of proteome research.

[17]  T. Takao,et al.  Site-specific carbohydrate profiling of human transferrin by nano-flow liquid chromatography/electrospray ionization mass spectrometry. , 2004, Rapid communications in mass spectrometry : RCM.

[18]  J. Weinstein,et al.  Biomarkers in Cancer Staging, Prognosis and Treatment Selection , 2005, Nature Reviews Cancer.

[19]  Y. Mechref,et al.  New hyphenated methodologies in high-sensitivity glycoprotein analysis. , 2005, Journal of separation science.

[20]  Dekel Tsur,et al.  Identification of post-translational modifications by blind search of mass spectra , 2005, Nature Biotechnology.

[21]  K. Medzihradszky Characterization of protein N-glycosylation. , 2005, Methods in enzymology.

[22]  F. Fischer,et al.  Mapping the membrane proteome of Corynebacterium glutamicum , 2005, Proteomics.

[23]  B. Domon,et al.  A systematic nomenclature for carbohydrate fragmentations in FAB-MS/MS spectra of glycoconjugates , 1988, Glycoconjugate Journal.

[24]  David L Tabb,et al.  Efficient and specific trypsin digestion of microgram to nanogram quantities of proteins in organic-aqueous solvent systems. , 2006, Analytical chemistry.

[25]  David Goldberg,et al.  Automated N-glycopeptide identification using a combination of single- and tandem-MS. , 2007, Journal of proteome research.

[26]  E. Go,et al.  GlycoPep DB: a tool for glycopeptide analysis using a "Smart Search". , 2007, Analytical chemistry.

[27]  Masakazu Toi,et al.  A Nonfucosylated Anti-HER2 Antibody Augments Antibody-Dependent Cellular Cytotoxicity in Breast Cancer Patients , 2007, Clinical Cancer Research.

[28]  E. Go,et al.  Simplification of mass spectral analysis of acidic glycopeptides using GlycoPep ID. , 2007, Analytical chemistry.

[29]  Jian Min Ren,et al.  N-Glycan structure annotation of glycopeptides using a linearized glycan structure database (GlyDB). , 2007, Journal of proteome research.

[30]  Pengyuan Yang,et al.  Facile synthesis of aminophenylboronic acid-functionalized magnetic nanoparticles for selective separation of glycopeptides and glycoproteins. , 2008, Chemical communications.

[31]  Rong Zeng,et al.  Fast and accurate identification of semi-tryptic peptides in shotgun proteomics , 2008, Bioinform..

[32]  Yet-Ran Chen,et al.  A Multiplexed Quantitative Strategy for Membrane Proteomics , 2008, Molecular & Cellular Proteomics.

[33]  László Drahos,et al.  GlycoMiner: a new software tool to elucidate glycopeptide composition. , 2008, Rapid communications in mass spectrometry : RCM.

[34]  P. Højrup,et al.  Site-specific glycoprofiling of N-linked glycopeptides using MALDI-TOF MS: strong correlation between signal strength and glycoform quantities. , 2009, Analytical chemistry.

[35]  Susan J Fisher,et al.  Sweetening the pot: adding glycosylation to the biomarker discovery equation. , 2010, Clinical chemistry.

[36]  Aneeka M Hancock,et al.  Glycoproteomics in neurodegenerative diseases. , 2010, Mass spectrometry reviews.

[37]  Haixu Tang,et al.  Mapping site-specific protein N-glycosylations through liquid chromatography/mass spectrometry and targeted tandem mass spectrometry. , 2010, Rapid communications in mass spectrometry : RCM.

[38]  Daniel Kolarich,et al.  GlycoSpectrumScan: fishing glycopeptides from MS spectra of protease digests of human colostrum sIgA. , 2010, Journal of proteome research.

[39]  O. Mayboroda,et al.  Mass Spectrometric Identification of Aberrantly Glycosylated Human Apolipoprotein C-III Peptides in Urine from Schistosoma mansoni-infected Individuals* , 2010, Molecular & Cellular Proteomics.

[40]  Y. Mechref,et al.  Characterizing protein glycosylation sites through higher-energy C-trap dissociation. , 2010, Rapid communications in mass spectrometry : RCM.

[41]  Birgit Schilling,et al.  ScanRanker: Quality assessment of tandem mass spectra via sequence tagging. , 2011, Journal of proteome research.

[42]  I. Tsai,et al.  Terminal disialylated multiantennary complex-type N-glycans carried on acutobin define the glycosylation characteristics of the Deinagkistrodon acutus venom. , 2011, Glycobiology.

[43]  Lance Wells,et al.  Combining high-energy C-trap dissociation and electron transfer dissociation for protein O-GlcNAc modification site assignment. , 2011, Journal of proteome research.

[44]  D. Chan,et al.  Aberrant glycosylation associated with enzymes as cancer biomarkers , 2011, Clinical Proteomics.

[45]  Angela M Zivkovic,et al.  Simultaneous and extensive site-specific N- and O-glycosylation analysis in protein mixtures. , 2011, Journal of proteome research.

[46]  I. Lazar,et al.  Recent advances in the MS analysis of glycoproteins: Theoretical considerations , 2011, Electrophoresis.

[47]  L. Tang,et al.  Comprehensive characterization of the N-glycosylation status of CD44s by use of multiple mass spectrometry-based techniques , 2012, Analytical and Bioanalytical Chemistry.

[48]  Radoslav Goldman,et al.  Semi-automated identification of N-Glycopeptides by hydrophilic interaction chromatography, nano-reverse-phase LC-MS/MS, and glycan database search. , 2012, Journal of proteome research.

[49]  T. Hennet Diseases of glycosylation beyond classical congenital disorders of glycosylation. , 2012, Biochimica et biophysica acta.

[50]  H. Freeze,et al.  Neurology of inherited glycosylation disorders , 2012, The Lancet Neurology.

[51]  J. Rose,et al.  The Secreted Plant N-Glycoproteome and Associated Secretory Pathways , 2012, Front. Plant Sci..

[52]  David Hua,et al.  GlycoPep grader: a web-based utility for assigning the composition of N-linked glycopeptides. , 2012, Analytical chemistry.

[53]  William F. Martin,et al.  Automated glycopeptide analysis - review of current state and future directions , 2013, Briefings Bioinform..

[54]  Michael J. Sweredoski,et al.  Comprehensive profiling of N-linked glycosylation sites in HeLa cells using hydrazide enrichment. , 2013, Journal of proteome research.

[55]  Suh-Yuen Liang,et al.  Sweet-Heart - an integrated suite of enabling computational tools for automated MS2/MS3 sequencing and identification of glycopeptides. , 2013, Journal of proteomics.

[56]  Serenus Hua,et al.  Automated assignments of N- and O-site specific glycosylation with extensive glycan heterogeneity of glycoprotein mixtures. , 2013, Analytical chemistry.

[57]  Milos V. Novotny,et al.  High-sensitivity analytical approaches for the structural characterization of glycoproteins. , 2013, Chemical reviews.

[58]  Heather Desaire,et al.  Software for automated interpretation of mass spectrometry data from glycans and glycopeptides. , 2013, The Analyst.

[59]  Jonathan C Trinidad,et al.  N- and O-Glycosylation in the Murine Synaptosome* , 2013, Molecular & Cellular Proteomics.

[60]  Feng Li,et al.  Glycobioinformatics: Current strategies and tools for data mining in MS‐based glycoproteomics , 2013, Proteomics.

[61]  Kiyoko F. Aoki-Kinoshita,et al.  UniCarbKB: building a knowledge platform for glycoproteomics , 2013, Nucleic Acids Res..

[62]  María Martín,et al.  Activities at the Universal Protein Resource (UniProt) , 2013, Nucleic Acids Res..

[63]  B. Ma,et al.  GlycoMaster DB: software to assist the automated identification of N-linked glycopeptides by tandem mass spectrometry. , 2014, Journal of proteome research.