An N‐glycopeptide MS/MS data analysis workflow leveraging two complementary glycoproteomic software tools for more confident identification and assignments

Complete coverage of all N-glycosylation sites on the SARS-CoV2 spike protein would require the use of multiple proteases in addition to trypsin. Subsequent identification of the resulting glycopeptides by searching against database often introduces assignment errors due to similar mass differences between different permutations of amino acids and glycosyl residues. By manually interpreting the individual MS2 spectra, we report here the common sources of errors in assignment, especially those introduced by the use of chymotrypsin. We show that by applying a stringent threshold of acceptance, erroneous assignment by the commonly used Byonic software can be controlled within 15%, which can be reduced further if only those also confidently identified by a different search engine, pGlyco3, were considered. A representative site-specific N-glycosylation pattern could be constructed based on quantifying only the overlapping subset of N-glycopeptides identified at higher confidence. Applying the two complimentary glycoproteomic software in a concerted data analysis workflow, we found and confirmed that glycosylation at several sites of an unstable Omicron spike protein differed significantly from those of the stable trimeric product of the parental D614G variant.

[1]  M. Thaysen‐Andersen,et al.  Critical considerations in N-glycoproteomics. , 2023, Current opinion in chemical biology.

[2]  Daniel A. Polasky,et al.  Recent advances in computational algorithms and software for large-scale glycoproteomics. , 2022, Current opinion in chemical biology.

[3]  E. Fadda,et al.  Variations within the Glycan Shield of SARS-CoV-2 Impact Viral Spike Dynamics , 2022, bioRxiv.

[4]  M. Sanda,et al.  Site‐specific glycosylation of SARS‐CoV‐2: Big challenges in mass spectrometry analysis , 2022, Proteomics.

[5]  E. Fadda,et al.  Principles of SARS-CoV-2 glycosylation , 2022, Current Opinion in Structural Biology.

[6]  S. Subramaniam,et al.  SARS-CoV-2 Omicron variant: Antibody evasion and cryo-EM structure of spike protein–ACE2 complex , 2022, Science.

[7]  Si-Min He,et al.  Precise, fast and comprehensive analysis of intact glycopeptides and modified glycans with pGlyco3 , 2021, Nature Methods.

[8]  Nichollas E. Scott,et al.  Community evaluation of glycoproteomics informatics solutions reveals high-performance search strategies for serum glycopeptide analysis , 2021, Nature Methods.

[9]  S. Hsu,et al.  D614G mutation in the SARS-CoV-2 spike protein enhances viral fitness by desensitizing it to temperature-dependent denaturation , 2021, Journal of Biological Chemistry.

[10]  S. Hsu,et al.  Effect of SARS-CoV-2 B.1.1.7 mutations on spike protein structure and function , 2021, Nature Structural & Molecular Biology.

[11]  Chu-Wei Kuo,et al.  Distinct shifts in site-specific glycosylation pattern of SARS-CoV-2 spike proteins associated with arising mutations in the D614G and Alpha variants , 2021, bioRxiv.

[12]  B. Haynes,et al.  Effect of natural mutations of SARS-CoV-2 on spike structure, conformation, and antigenicity , 2021, Science.

[13]  R. Dwek,et al.  Assessing Antigen Structural Integrity through Glycosylation Analysis of the SARS-CoV-2 Viral Spike , 2021, ACS central science.

[14]  Jeremy L. Praissman,et al.  Virus-Receptor Interactions of Glycosylated SARS-CoV-2 Spike and Human ACE2 Receptor , 2020, Cell Host and Microbe.

[15]  G. Ippolito,et al.  Structure-based design of prefusion-stabilized SARS-CoV-2 spikes , 2020, Science.

[16]  Lindsay K. Pino,et al.  The Skyline ecosystem: Informatics for quantitative mass spectrometry proteomics. , 2020, Mass spectrometry reviews.

[17]  Tsung-Hsien Pu,et al.  Novel LC-MS² product dependent parallel data acquisition function and data analysis workflow for sequencing and identification of intact glycopeptides. , 2014, Analytical chemistry.