A global representation of the carbohydrate structures: a tool for the analysis of glycan.

Glycan resources have been developed of late, such as carbohydrate databases, analysis tools, and algorithms for analysis of carbohydrate features. With this background, bioinformatics approaches to carbohydrate research have recently begun using a large amount of protein and carbohydrate data. This paper introduces one of these projects that elucidates the range of carbohydrate structures. In this study, the variety of carbohydrate structures have been enumerated in a global tree structure called variation trees, using the KEGG GLYCAN database, which is a public-domain glycan resource for bioinformatics analysis. Additionally, a glycosyltransferase mapping list of glycosyltransferases and their catalyzing glycosidic linkages was constructed. From this, we present the composite structure map (CSM), which is a structural variation map integrating its variation trees and glycosyltransferase map list. CSM is able to display, for example, expression data of glycosyltransferases in a compact manner, illustrating its versatility as a new bioinformatics resource and tool capable of analyzing carbohydrate structures on a global scale. These resources are available at http://www.genome.jp/kegg/glycan/.

[1]  Hisashi Narimatsu,et al.  Construction of a human glycogene library and comprehensive functional analysis , 2004, Glycoconjugate Journal.

[2]  R. Dwek,et al.  Glycosylation and the immune system. , 2001, Science.

[3]  Xinhua Lin,et al.  Functions of heparan sulfate proteoglycans in cell signaling during development , 2004, Development.

[4]  Susumu Goto,et al.  The KEGG resource for deciphering the genome , 2004, Nucleic Acids Res..

[5]  Martin Frank,et al.  Bioinformatics for glycomics: Status, methods, requirements and perspectives , 2004, Briefings Bioinform..

[6]  A Helenius,et al.  How N-linked oligosaccharides affect glycoprotein folding in the endoplasmic reticulum. , 1994, Molecular biology of the cell.

[7]  Raymond A Dwek,et al.  Statistical analysis of the protein environment of N-glycosylation sites: implications for occupancy, structure, and folding. , 2003, Glycobiology.

[8]  A. Helenius,et al.  Intracellular functions of N-linked glycans. , 2001, Science.

[9]  M. Domowicz,et al.  Proteoglycans in brain development , 2004, Glycoconjugate Journal.

[10]  Tatsuya Akutsu,et al.  Efficient tree-matching methods for accurate carbohydrate database queries. , 2003, Genome informatics. International Conference on Genome Informatics.

[11]  Tatsuya Akutsu,et al.  KCaM (KEGG Carbohydrate Matcher): a software tool for analyzing the structures of carbohydrate sugar chains , 2004, Nucleic Acids Res..

[12]  Eitan Rubin,et al.  Biases and complex patterns in the residues flanking protein N-glycosylation sites. , 2003, Glycobiology.

[13]  Jesús Jiménez-Barbero,et al.  New structural insights into carbohydrate-protein interactions from NMR spectroscopy. , 2003, Current opinion in structural biology.

[14]  S. Miyakis,et al.  Beta-2 glycoprotein I and its role in antiphospholipid syndrome-lessons from knockout mice. , 2004, Clinical immunology.

[15]  S. Brunak,et al.  Prediction, conservation analysis, and structural characterization of mammalian mucin-type O-glycosylation sites. , 2005, Glycobiology.

[16]  A. Dell,et al.  Glycoprotein Structure Determination by Mass Spectrometry , 2001, Science.