Multiple Tree Alignment with Weights Applied to Carbohydrates to Extract Binding Recognition Patterns

The purpose of our research is the elucidation of glycan recognition patterns. Glycans are composed of monosaccharides and have complex structures with branches due to the fact that monosaccharides have multiple potential binding positions compared to amino acids. Each monosaccharide can potentially be bound by up to five other monosaccharides, compared to two for any amino acid. Glycans are often bound to proteins and lipids on the cell surface and play important roles in biological processes. Lectins in particular are proteins that recognize and bind to glycans. In general, lectins bind to the terminal monosaccharides of glycans on glycoconjugates. However, it is suggested that some lectins recognize not only terminal monosaccharides, but also internal monosaccharides, possibly influencing the binding affinity. Such analyses are difficult without novel bioinformatics techniques. Thus, in order to better understand the glycan recognition mechanism of such biomolecules, we have implemented a novel algorithm for aligning glycan tree structures, which we provide as a web tool called MCAW (Multiple Carbohydrate Alignment with Weights). From our web tool, we have analyzed several different lectins, and our results could confirm the existence of well-known glycan motifs. Our work can now be used in several other analyses of glycan structures, such as in the development of glycan score matrices as well as in state model determination of probabilistic tree models. Therefore, this work is a fundamental step in glycan pattern analysis to progress glycobiology research.

[1]  J. Marth,et al.  Glycosylation in Cellular Mechanisms of Health and Disease , 2006, Cell.

[2]  Tatsuya Akutsu,et al.  KCaM (KEGG Carbohydrate Matcher): a software tool for analyzing the structures of carbohydrate sugar chains , 2004, Nucleic Acids Res..

[3]  W. Fitch,et al.  Construction of phylogenetic trees. , 1967, Science.

[4]  Kiyoko F. Aoki-Kinoshita,et al.  ProfilePSTMM: capturing tree-structure motifs in carbohydrate sugar chains , 2006, ISMB.

[5]  Tatsuya Akutsu,et al.  A score matrix to reveal the hidden links in glycans , 2005, Bioinform..

[6]  Ruth Nussinov,et al.  A method for simultaneous alignment of multiple protein structures , 2004, Proteins.

[7]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[8]  Wei Lang,et al.  Advancing glycomics: implementation strategies at the consortium for functional glycomics. , 2006, Glycobiology.

[9]  Philip Bille,et al.  A survey on tree edit distance and related problems , 2005, Theor. Comput. Sci..

[10]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[11]  Kiyoko F Aoki-Kinoshita,et al.  The RINGS resource for glycome informatics analysis and data mining on the Web. , 2010, Omics : a journal of integrative biology.

[12]  Ten Feizi,et al.  Oligosaccharide microarrays for high-throughput detection and specificity assignments of carbohydrate-protein interactions , 2002, Nature Biotechnology.

[13]  O. Blixt,et al.  Identification of ligand specificities for glycan-binding proteins using glycan arrays. , 2006, Methods in enzymology.

[14]  Tatsuya Akutsu,et al.  A probabilistic model for mining labeled ordered trees: capturing patterns in carbohydrate sugar chains , 2005, IEEE Transactions on Knowledge and Data Engineering.

[15]  Kiyoko F. Aoki-Kinoshita Glycome Informatics: Methods and Applications , 2009 .