Complexities and Algorithms for Glycan Structure Sequencing using Tandem Mass Spectrometry

Determining glycan structures is vital to comprehend cell-matrix, cell-cell, and even intracellular biological events. Glycan structure sequencing, which is to determine the primary structure of a glycan using MS/MS spectrometry, remains one of the most important tasks in proteomics. Analogous to the peptide de novo sequencing, the glycan de novo sequencing is to determine the structure without the aid of a known glycan database. We show in this paper that glycan de novo sequencing is NP-hard. We then provide a heuristic algorithm and develop a software program to solve the problem in practical cases. Experiments on real MS/MS data of glycopeptides demonstrate that our heuristic algorithm gives satisfactory results on practical data.

[1]  Amanda Doherty-Kirby,et al.  Comprehensive identification of post-translational modifications of rat bone osteopontin by mass spectrometry. , 2005, Biochemistry.

[2]  H. Perreault,et al.  Application of the StrOligo algorithm for the automated structure assignment of complex N-linked glycans from glycoproteins using tandem mass spectrometry. , 2003, Rapid communications in mass spectrometry : RCM.

[3]  Nicolle H. Packer,et al.  GlycoSuiteDB: a new curated relational database of glycoprotein glycan structures and their biological sources , 2001, Nucleic Acids Res..

[4]  J. Zaia Mass spectrometry of oligosaccharides. , 2004, Mass spectrometry reviews.

[5]  Vineet Bafna,et al.  On de novo interpretation of tandem mass spectra for peptide identification , 2003, RECOMB '03.

[6]  Hiren J. Joshi,et al.  GlycoSuiteDB: a curated relational database of glycoprotein glycan structures and their biological sources. 2003, update , 2003, Nucleic Acids Res..

[7]  Ming-Yang Kao,et al.  A dynamic programming approach to de novo peptide sequencing via tandem mass spectrometry , 2000, SODA '00.

[8]  Ming Li,et al.  PEAKS: powerful software for peptide de novo sequencing by tandem mass spectrometry. , 2003, Rapid communications in mass spectrometry : RCM.

[9]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[10]  J. Leary,et al.  STAT: a saccharide topology analysis tool used in combination with tandem mass spectrometry. , 2000, Analytical chemistry.

[11]  K. Zhang,et al.  An Algorithm for determining glycan Structures from MS/MS spectra , 2005, Advances in Bioinformatics and Its Applications.

[12]  James Paulson,et al.  Automatic annotation of matrix‐assisted laser desorption/ionization N‐glycan spectra , 2005, Proteomics.

[13]  Catherine A. Cooper,et al.  GlycoMod – A software tool for determining glycosylation compositions from mass spectrometric data , 2001, Proteomics.

[14]  K Takio,et al.  An automated interpretation of MALDI/TOF postsource decay spectra of oligosaccharides. 1. Automated peak assignment. , 1999, Analytical chemistry.

[15]  B. Domon,et al.  A systematic nomenclature for carbohydrate fragmentations in FAB-MS/MS spectra of glycoconjugates , 1988, Glycoconjugate Journal.

[16]  Haixu Tang,et al.  Automated interpretation of MS/MS spectra of oligosaccharides , 2005, ISMB.

[17]  Maureen E. Taylor,et al.  Introduction to glycobiology , 2003 .

[18]  A. Dell,et al.  Glycoprotein Structure Determination by Mass Spectrometry , 2001, Science.

[19]  Amanda Doherty-Kirby,et al.  Investigation of cationic peanut peroxidase glycans by electrospray ionization mass spectrometry. , 2004, Phytochemistry.