Large-scale identification and visualization of N-glycans with primary structures using GlySeeker.

RATIONALE Most of the current popular tandem mass spectrometers have the capability of resolving the primary structures (monosaccharide composition, sequence and linkage) of N-glycans; however, compositions or putative structures have mostly been reported so far. Identification and visualization tools of N-glycans are needed. METHODS The isotopic mass-to-charge ratio and envelope fingerprinting algorithm, which has been successfully used for intact protein database search and identification, was adapted for N-glycan database search, and a stand-alone N-glycan database search engine, GlySeeker, for automated N-glycan identification and visualization was developed and successfully benchmarked. Both pseudo 2D graph and one-line text formats with one-letter symbols for monosaccharides were proposed for representing N-glycans. N-glycans were identified with comprehensive interpretation of product ions and false discovery rate (FDR) control. RESULTS In a database search of reversed-phase liquid chromatography/tandem mass spectrometry (RPLC/MS/MS) datasets of the N-glycome enriched from OVCAR-3 ovarian cancer cells, with FDR ≤1% and number of best hits (NoBHs) = 1-30, 1525 N-glycans with comprehensive primary structural information (composition, sequence and linkage) were identified and visualized; among these 1525 N-glycans, 559 had NoBHs = 1, i.e. their structures were uniquely identified. This represents a large-scale identification and visualization of N-glycans with primary structures from tandem mass spectra. CONCLUSIONS A stand-alone N-glycan database search engine called GlySeeker has been developed for large-scale identification and visualization of N-glycans with comprehensive interpretation of tandem mass spectra and FDR control.

[1]  Scott R. Kronewitter,et al.  The glycolyzer: Automated glycan annotation software for high performance mass spectrometry and its application to ovarian cancer glycan biomarker discovery , 2012, Proteomics.

[2]  Hui Zhang,et al.  Simultaneous analyses of N-linked and O-linked glycans of ovarian cancer cells using solid-phase chemoenzymatic method , 2017, Clinical Proteomics.

[3]  J. Leary,et al.  STAT: a saccharide topology analysis tool used in combination with tandem mass spectrometry. , 2000, Analytical chemistry.

[4]  E. Go,et al.  GlycoPep DB: a tool for glycopeptide analysis using a "Smart Search". , 2007, Analytical chemistry.

[5]  J. Marth,et al.  Glycosylation in Cellular Mechanisms of Health and Disease , 2006, Cell.

[6]  Niclas G Karlsson,et al.  Development of a mass fingerprinting tool for automated interpretation of oligosaccharide fragmentation data , 2004, Proteomics.

[7]  Evan Bolton,et al.  Symbol Nomenclature for Graphical Representations of Glycans. , 2015, Glycobiology.

[8]  Kaijie Xiao,et al.  Intact Protein Quantitation Using Pseudoisobaric Dimethyl Labeling. , 2016, Analytical chemistry.

[9]  Cui Hao,et al.  Serum glycoprotein-derived N- and O-linked glycans as cancer biomarkers. , 2016, American journal of cancer research.

[10]  Hao Chi,et al.  pGlyco: a pipeline for the identification of intact N-glycopeptides by using HCD- and CID-MS/MS and MS3 , 2016, Scientific Reports.

[11]  Louise Royle,et al.  Proposal for a standard system for drawing structural diagrams of N‐ and O‐linked carbohydrates and related compounds , 2009, Proteomics.

[12]  Radoslav Goldman,et al.  Semi-automated identification of N-Glycopeptides by hydrophilic interaction chromatography, nano-reverse-phase LC-MS/MS, and glycan database search. , 2012, Journal of proteome research.

[13]  Haixu Tang,et al.  Mapping site-specific protein N-glycosylations through liquid chromatography/mass spectrometry and targeted tandem mass spectrometry. , 2010, Rapid communications in mass spectrometry : RCM.

[14]  Richard D Cummings,et al.  Symbol nomenclature for glycan representation , 2009, Proteomics.

[15]  László Drahos,et al.  GlycoMiner: a new software tool to elucidate glycopeptide composition. , 2008, Rapid communications in mass spectrometry : RCM.

[16]  I. Wilson,et al.  Composition of N-linked carbohydrates from ovalbumin and co-purified glycoproteins , 2000, Journal of the American Society for Mass Spectrometry.

[17]  C. Lebrilla,et al.  Glycans and glycoproteins as specific biomarkers for cancer , 2016, Analytical and Bioanalytical Chemistry.

[18]  James Paulson,et al.  Automatic annotation of matrix‐assisted laser desorption/ionization N‐glycan spectra , 2005, Proteomics.

[19]  René Ranzinger,et al.  “Glyco‐peakfinder” – de novo composition analysis of glycoconjugates , 2007, Proteomics.

[20]  Gary Benson,et al.  GlycReSoft: A Software Package for Automated Recognition of Glycans from LC/MS Data , 2012, PloS one.

[21]  Z. Tian,et al.  Interpreting raw biological mass spectra using isotopic mass-to-charge ratio and envelope fingerprinting. , 2013, Rapid communications in mass spectrometry : RCM.

[22]  S. Kornfeld,et al.  Assembly of asparagine-linked oligosaccharides. , 1985, Annual review of biochemistry.

[23]  P. Delannoy,et al.  Role of Cytokine-Induced Glycosylation Changes in Regulating Cell Interactions and Cell Signaling in Inflammatory Diseases and Cancer , 2016, Cells.

[24]  Claus-Wilhelm von der Lieth,et al.  GlycoFragment and GlycoSearchMS: web tools to support the interpretation of mass spectra of complex carbohydrates , 2004, Nucleic Acids Res..

[25]  K. Lilley,et al.  Differentiation of isomeric N-glycan structures by normal-phase liquid chromatography-MALDI-TOF/TOF tandem mass spectrometry. , 2006, Analytical chemistry.

[26]  P. Conroy,et al.  Aberrant PSA glycosylation—a sweet predictor of prostate cancer , 2013, Nature Reviews Urology.

[27]  D. Ashline,et al.  Congruent strategies for carbohydrate sequencing. 3. OSCAR: an algorithm for assigning oligosaccharide topology from MSn data. , 2005, Analytical chemistry.

[28]  Haixu Tang,et al.  Automated interpretation of MS/MS spectra of oligosaccharides , 2005, ISMB.

[29]  Paul Aiyetan,et al.  Comprehensive analysis of protein glycosylation by solid-phase extraction of N-linked glycans and glycosite-containing peptides , 2016, Nature Biotechnology.

[30]  B. Domon,et al.  A systematic nomenclature for carbohydrate fragmentations in FAB-MS/MS spectra of glycoconjugates , 1988, Glycoconjugate Journal.

[31]  Alessio Ceroni,et al.  The GlycanBuilder: a fast, intuitive and flexible software tool for building and displaying glycan structures , 2007, Source Code for Biology and Medicine.

[32]  J. Marth,et al.  A genetic approach to Mammalian glycan function. , 2003, Annual review of biochemistry.

[33]  B. Ma,et al.  GlycoMaster DB: software to assist the automated identification of N-linked glycopeptides by tandem mass spectrometry. , 2014, Journal of proteome research.

[34]  P. Seeberger,et al.  Unlocking Cancer Glycomes from Histopathological Formalin-fixed and Paraffin-embedded (FFPE) Tissue Microdissections * , 2017, Molecular & Cellular Proteomics.

[35]  Tatsuya Akutsu,et al.  KCaM (KEGG Carbohydrate Matcher): a software tool for analyzing the structures of carbohydrate sugar chains , 2004, Nucleic Acids Res..

[36]  S. Pinho,et al.  Glycosylation in cancer: mechanisms and clinical implications , 2015, Nature Reviews Cancer.

[37]  Fan Yu,et al.  Top-down protein identification using isotopic envelope fingerprinting. , 2017, Journal of proteomics.

[38]  David Hua,et al.  GlycoPep grader: a web-based utility for assigning the composition of N-linked glycopeptides. , 2012, Analytical chemistry.

[39]  Catherine A. Cooper,et al.  GlycoMod – A software tool for determining glycosylation compositions from mass spectrometric data , 2001, Proteomics.

[40]  C. Freire-de-Lima,et al.  The Sweet Side of Immune Evasion: Role of Glycans in the Mechanisms of Cancer Progression , 2016, Front. Oncol..

[41]  Hélène Perreault,et al.  Automated structural assignment of derivatized complex N-linked oligosaccharides from tandem mass spectra. , 2002, Rapid communications in mass spectrometry : RCM.

[42]  J. Peter-Katalinic,et al.  Software platform for high-throughput glycomics. , 2009, Analytical chemistry.

[43]  Scott R. Kronewitter,et al.  The development of retrosynthetic glycan libraries to profile and classify the human serum N‐linked glycome , 2009, Proteomics.

[44]  Daniel Kolarich,et al.  GlycoSpectrumScan: fishing glycopeptides from MS spectra of protease digests of human colostrum sIgA. , 2010, Journal of proteome research.

[45]  Fan Yu,et al.  Accurate and Efficient Resolution of Overlapping Isotopic Envelopes in Protein Tandem Mass Spectra , 2015, Scientific Reports.