KEGG as a glycome informatics resource.

Bioinformatics approaches to carbohydrate research have recently begun using large amounts of protein and carbohydrate data. In this field called glycome informatics, the foremost necessity is a comprehensive resource for genome-scale bioinformatics analysis of glycan data. Although the accumulation of experimental data may be useful as a reference of biological and biochemical information on carbohydrates, this is insufficient for bioinformatics analysis. Thus, we have developed a glycome informatics resource (http://www.genome.jp/kegg/glycan/) in KEGG (Kyoto Encyclopedia of Genes and Genomes), an integrated knowledge base of protein networks, genomic information, and chemical information. This review describes three noteworthy features: (1) GLYCAN, a database of carbohydrate structures; (2) glycan-related pathways; and (3) Composite Structure Map (CSM), a map illustrating all possible variations of carbohydrate structures within organisms. GLYCAN includes two useful tools: an intuitive drawing tool called KegDraw, and an efficient glycan search and alignment tool called KEGG Carbohydrate Matcher (KCaM). KEGG's glycan biosynthesis and metabolism pathways, integrating carbohydrate structures, proteins, and reactions, are also a pivotal resource. CSM is constructed as a bridge between carbohydrate functions and structures. CSM is able to display, for example, expression data of glycosyltransferases in a compact manner. In all the KEGG resources, various objects including KEGG pathways, chemical compounds, as well as carbohydrate structures are commonly represented as graphs, which are widely studied and utilized in the computer science field.

[1]  Susumu Goto,et al.  The KEGG resource for deciphering the genome , 2004, Nucleic Acids Res..

[2]  Yoshihiro Yamanishi,et al.  Extraction of leukemia specific glycan motifs in humans by computational glycomics. , 2005, Carbohydrate research.

[3]  Abbreviations , 1971 .

[4]  R. Dwek,et al.  Glycobiology , 2018, Biochimie.

[5]  C. W. von der Lieth,et al.  LINUCS: linear notation for unique description of carbohydrate sequences. , 2001, Carbohydrate research.

[6]  Andreas Bohne,et al.  SWEET-DB: an attempt to create annotated data collections for carbohydrates , 2002, Nucleic Acids Res..

[7]  Tatsuya Akutsu,et al.  KCaM (KEGG Carbohydrate Matcher): a software tool for analyzing the structures of carbohydrate sugar chains , 2004, Nucleic Acids Res..

[8]  P. Albersheim,et al.  Letter to the Glyco-Forum CarbBank , 1992 .

[9]  M. Kanehisa,et al.  Development of a chemical structure comparison method for integrated analysis of chemical and genomic information in the metabolic pathways. , 2003, Journal of the American Chemical Society.

[10]  S. Brunak,et al.  Prediction, conservation analysis, and structural characterization of mammalian mucin-type O-glycosylation sites. , 2005, Glycobiology.

[11]  Kiyoko F. Aoki-Kinoshita,et al.  A global representation of the carbohydrate structures: a tool for the analysis of glycan. , 2005, Genome informatics. International Conference on Genome Informatics.

[12]  S. Miyakis,et al.  Beta-2 glycoprotein I and its role in antiphospholipid syndrome-lessons from knockout mice. , 2004, Clinical immunology.

[13]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[14]  P. Thibault,et al.  Identification of glycan structure and glycosylation sites in cellobiohydrolase II and endoglucanases I and II from Trichoderma reesei. , 2002, Glycobiology.

[15]  Martin Frank,et al.  Bioinformatics for glycomics: Status, methods, requirements and perspectives , 2004, Briefings Bioinform..

[16]  Jesús Jiménez-Barbero,et al.  New structural insights into carbohydrate-protein interactions from NMR spectroscopy. , 2003, Current opinion in structural biology.

[17]  A. Dell,et al.  Glycoprotein Structure Determination by Mass Spectrometry , 2001, Science.

[18]  I Marchal,et al.  Bioinformatics in glycobiology. , 2003, Biochimie.

[19]  Tatsuya Akutsu,et al.  Application of a new probabilistic model for recognizing complex patterns in glycans , 2004, ISMB/ECCB.

[20]  Tatsuya Akutsu,et al.  Efficient tree-matching methods for accurate carbohydrate database queries. , 2003, Genome informatics. International Conference on Genome Informatics.

[21]  K Bock,et al.  The Complex Carbohydrate Structure Database. , 1989, Trends in biochemical sciences.

[22]  Anil K. Kesarwani,et al.  Genome Informatics , 2019, Encyclopedia of Bioinformatics and Computational Biology.

[23]  Hiren J. Joshi,et al.  GlycoSuiteDB: a curated relational database of glycoprotein glycan structures and their biological sources. 2003, update , 2003, Nucleic Acids Res..

[24]  Yoshihiro Yamanishi,et al.  Extraction of species-specific glycan substructures. , 2004, Genome informatics. International Conference on Genome Informatics.

[25]  Eitan Rubin,et al.  Biases and complex patterns in the residues flanking protein N-glycosylation sites. , 2003, Glycobiology.

[26]  Raymond A Dwek,et al.  Statistical analysis of the protein environment of N-glycosylation sites: implications for occupancy, structure, and folding. , 2003, Glycobiology.

[27]  Tatsuya Akutsu,et al.  A score matrix to reveal the hidden links in glycans , 2005, Bioinform..