GlycoSLASH: Concurrent Glycopeptide Identification from Multiple Related LC-MS/MS Data Sets by Using Spectral Clustering and Library Searching.

Liquid chromatography coupled with tandem mass spectrometry is commonly adopted in large-scale glycoproteomic studies involving hundreds of disease and control samples. The software for glycopeptide identification in such data (e.g., the commercial software Byonic) analyzes the individual data set and does not exploit the redundant spectra of glycopeptides presented in the related data sets. Herein, we present a novel concurrent approach for glycopeptide identification in multiple related glycoproteomic data sets by using spectral clustering and spectral library searching. The evaluation on two large-scale glycoproteomic data sets showed that the concurrent approach can identify 105%-224% more spectra as glycopeptides compared to the glycopeptide identification on individual data sets using Byonic alone. The improvement of glycopeptide identification also enabled the discovery of several potential biomarkers of protein glycosylations in hepatocellular carcinoma patients.

[1]  Jin Young Kim,et al.  Classification of Mucin-Type O-Glycopeptides Using Higher-Energy Collisional Dissociation in Mass Spectrometry. , 2020, Analytical chemistry.

[2]  Haixu Tang,et al.  A Fast and Memory‐Efficient Spectral Library Search Algorithm Using Locality‐Sensitive Hashing , 2020, Proteomics.

[3]  A. Singal,et al.  Glycopeptide Biomarkers in Serum Haptoglobin for Hepatocellular Carcinoma Detection in Patients with Non-Alcoholic Steatohepatitis. , 2020, Journal of proteome research.

[4]  Ronghu Wu,et al.  Recent Advances in Glycoproteomic Analysis by Mass Spectrometry. , 2019, Analytical chemistry.

[5]  Stefani N. Thomas,et al.  N-GlycositeAtlas: a database resource for mass spectrometry-based human N-linked glycoprotein and glycosylation site mapping , 2019, Clinical Proteomics.

[6]  Junfeng Huang,et al.  Recent advances in mass spectrometry (MS)-based glycoproteomics in complex biological samples. , 2019, Trends in analytical chemistry : TRAC.

[7]  Mingming Dong,et al.  Highly Efficient Analysis of Glycoprotein Sialylation in Human Serum by Simultaneous Quantification of Glycosites and Site-specific Glycoforms. , 2019, Journal of proteome research.

[8]  Xiaomeng Su,et al.  DecoyDeveloper: An On-Demand, De Novo Decoy Glycopeptide Generator. , 2019, Journal of proteome research.

[9]  Lei Wang,et al.  msCRUSH: Fast Tandem Mass Spectral Clustering Using Locality Sensitive Hashing. , 2018, Journal of proteome research.

[10]  D. Lubman,et al.  Glycoproteomic markers of hepatocellular carcinoma-mass spectrometry based approaches. , 2018, Mass spectrometry reviews.

[11]  Lingjun Li,et al.  Differential Quantitative Determination of Site-Specific Intact N-Glycopeptides in Serum Haptoglobin between Hepatocellular Carcinoma and Cirrhosis Using LC-EThcD-MS/MS. , 2018, Journal of proteome research.

[12]  Jian Wang,et al.  Assembling the Community-Scale Discoverable Human Proteome , 2018, Cell systems.

[13]  Yi Liu,et al.  An approach for N-linked glycan identification from MS/MS spectra by target-decoy strategy , 2018, Comput. Biol. Chem..

[14]  L Renee Ruhaak,et al.  Mass Spectrometry Approaches to Glycomic and Glycoproteomic Analyses. , 2018, Chemical reviews.

[15]  J. Kim,et al.  Designation of fingerprint glycopeptides for targeted glycoproteomic analysis of serum haptoglobin: insights into gastric cancer biomarker discovery , 2018, Analytical and Bioanalytical Chemistry.

[16]  C. Lindskog,et al.  The human protein atlas: A spatial map of the human proteome , 2018, Protein science : a publication of the Protein Society.

[17]  J. Yoo,et al.  gFinder: A Web-Based Bioinformatics Tool for the Analysis of N-Glycopeptides. , 2016, Journal of proteome research.

[18]  Nicolle H Packer,et al.  Toward Automated N-Glycopeptide Identification in Glycoproteomics. , 2016, Journal of proteome research.

[19]  Nuno Bandeira,et al.  SweetNET: A Bioinformatics Workflow for Glycopeptide MS/MS Spectral Analysis. , 2016, Journal of proteome research.

[20]  Lei Wang,et al.  Automated Glycan Sequencing from Tandem Mass Spectra of N-Linked Glycopeptides. , 2016, Analytical chemistry.

[21]  Hao Chi,et al.  pGlyco: a pipeline for the identification of intact N-glycopeptides by using HCD- and CID-MS/MS and MS3 , 2016, Scientific Reports.

[22]  Jin Young Kim,et al.  Integrated GlycoProteome Analyzer (I-GPA) for Automated Identification and Quantitation of Site-Specific N-Glycosylation , 2016, Scientific Reports.

[23]  Cecilia Lindskog,et al.  The potential clinical impact of the tissue-based map of the human proteome , 2015, Expert review of proteomics.

[24]  G. von Heijne,et al.  Tissue-based map of the human proteome , 2015, Science.

[25]  A. Burlingame,et al.  Mass Spectrometry-Based Detection and Assignment of Protein Posttranslational Modifications , 2014, ACS chemical biology.

[26]  Bridget E. Begg,et al.  A Proteome-Scale Map of the Human Interactome Network , 2014, Cell.

[27]  Pavel A. Pevzner,et al.  Universal database search tool for proteomics , 2014, Nature Communications.

[28]  B. Ma,et al.  GlycoMaster DB: software to assist the automated identification of N-linked glycopeptides by tandem mass spectrometry. , 2014, Journal of proteome research.

[29]  Xiaomeng Su,et al.  New Glycoproteomics Software, GlycoPep Evaluator, Generates Decoy Glycopeptides de Novo and Enables Accurate False Discovery Rate Analysis for Small Data Sets , 2014, Analytical chemistry.

[30]  Gary D Bader,et al.  A draft map of the human proteome , 2014, Nature.

[31]  Ziding Feng,et al.  Analysis of Serum Haptoglobin Fucosylation in Hepatocellular Carcinoma and Liver Cirrhosis of Different Etiologies , 2014, Journal of proteome research.

[32]  Haixu Tang,et al.  Computational framework for identification of intact glycopeptides in complex samples. , 2014, Analytical chemistry.

[33]  Radoslav Goldman,et al.  Exploring site-specific N-glycosylation microheterogeneity of haptoglobin using glycopeptide CID tandem mass spectra and glycan database search. , 2013, Journal of proteome research.

[34]  Serenus Hua,et al.  Automated assignments of N- and O-site specific glycosylation with extensive glycan heterogeneity of glycoprotein mixtures. , 2013, Analytical chemistry.

[35]  Zhikai Zhu,et al.  GlycoPep Detector: a tool for assigning mass spectrometry data of N-linked glycopeptides on the basis of their electron transfer dissociation spectra. , 2013, Analytical chemistry.

[36]  Yong J. Kil,et al.  Byonic: Advanced Peptide and Protein Identification Software , 2012, Current protocols in bioinformatics.

[37]  David Hua,et al.  GlycoPep grader: a web-based utility for assigning the composition of N-linked glycopeptides. , 2012, Analytical chemistry.

[38]  Michelle A. Anderson,et al.  Mass spectrometric assay for analysis of haptoglobin fucosylation in pancreatic cancer. , 2011, Journal of proteome research.

[39]  Haojie Lu,et al.  N-linked glycan changes of serum haptoglobin β chain in liver disease patients. , 2011, Molecular bioSystems.

[40]  Linfeng Wu,et al.  Role of spectral counting in quantitative proteomics , 2010, Expert review of proteomics.

[41]  S. Hakomori,et al.  N‐glycosylation status of β‐haptoglobin in sera of patients with colon cancer, chronic inflammatory diseases and normal subjects , 2009, International journal of cancer.

[42]  Ruedi Aebersold,et al.  Building consensus spectral libraries for peptide identification in proteomics , 2008, Nature Methods.

[43]  André M Deelder,et al.  Glycoproteomics based on tandem mass spectrometry of glycopeptides. , 2007, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[44]  Nichole L. King,et al.  Development and validation of a spectral library searching method for peptide identification from MS/MS , 2007, Proteomics.

[45]  Terence C W Poon,et al.  Study of serum haptoglobin and its glycoforms in the diagnosis of hepatocellular carcinoma: a glycoproteomic approach. , 2006, Journal of proteome research.

[46]  O. Jensen Modification-specific proteomics: characterization of post-translational modifications by mass spectrometry. , 2004, Current opinion in chemical biology.

[47]  S. Patterson Data analysis—the Achilles heel of proteomics , 2003, Nature Biotechnology.

[48]  F. Pineda,et al.  Bioinformatics and mass spectrometry for microorganism identification: proteome-wide post-translational modifications and database search algorithms for characterization of intact H. pylori. , 2001, Analytical chemistry.

[49]  N. Kuzushita,et al.  Serum fucosylated haptoglobin in chronic liver diseases as a potential biomarker of hepatocellular carcinoma development , 2015, Clinical chemistry and laboratory medicine.

[50]  Richard D. Smith,et al.  Clustering millions of tandem mass spectra. , 2008, Journal of proteome research.