The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics

The Carbohydrate-Active Enzyme (CAZy) database is a knowledge-based resource specialized in the enzymes that build and breakdown complex carbohydrates and glycoconjugates. As of September 2008, the database describes the present knowledge on 113 glycoside hydrolase, 91 glycosyltransferase, 19 polysaccharide lyase, 15 carbohydrate esterase and 52 carbohydrate-binding module families. These families are created based on experimentally characterized proteins and are populated by sequences from public databases with significant similarity. Protein biochemical information is continuously curated based on the available literature and structural information. Over 6400 proteins have assigned EC numbers and 700 proteins have a PDB structure. The classification (i) reflects the structural features of these enzymes better than their sole substrate specificity, (ii) helps to reveal the evolutionary relationships between these enzymes and (iii) provides a convenient framework to understand mechanistic properties. This resource has been available for over 10 years to the scientific community, contributing to information dissemination and providing a transversal nomenclature to glycobiologists. More recently, this resource has been used to improve the quality of functional predictions of a number genome projects by providing expert annotation. The CAZy resource resides at URL: http://www.cazy.org/.

[1]  B Henrissat,et al.  A classification of glycosyl hydrolases based on amino acid sequence similarities. , 1991, The Biochemical journal.

[2]  A Bairoch,et al.  New families in the classification of glycosyl hydrolases based on amino acid sequence similarities. , 1993, The Biochemical journal.

[3]  David L. Wheeler,et al.  GenBank: update , 2004, Nucleic Acids Res..

[4]  B Henrissat,et al.  Cellulase families revealed by hydrophobic cluster analysis. , 1989, Gene.

[5]  R. Laine,et al.  A calculation of all possible oligosaccharide isomers both branched and linear yields 1.05 x 10(12) structures for a reducing hexasaccharide: the Isomer Barrier to development of single-method saccharide sequencing or synthesis systems. , 1994, Glycobiology.

[6]  Mário J. Silva,et al.  ProFAL: PROtein Functional Annotation through Literature , 2003, JISBD.

[7]  Pedro M. Coutinho,et al.  Carbohydrate-active enzymes : an integrated database approach , 1999 .

[8]  Birte Svensson,et al.  Recent Advances in Carbohydrate Bioengineering , 1999 .

[9]  Bernard Henrissat,et al.  Dividing the large glycoside hydrolase family 13 into subfamilies: towards improved functional annotations of alpha-amylase-related proteins. , 2006, Protein engineering, design & selection : PEDS.

[10]  B. Henrissat,et al.  Recent structural insights into the expanding world of carbohydrate-active enzymes. , 2005, Current opinion in structural biology.

[11]  A Bairoch,et al.  Protein annotation: detective work for function prediction. , 1998, Trends in genetics : TIG.

[12]  M. Gribskov,et al.  The Genome of Black Cottonwood, Populus trichocarpa (Torr. & Gray) , 2006, Science.

[13]  Sean R. Eddy,et al.  Multiple Alignment Using Hidden Markov Models , 1995, ISMB.

[14]  Roger A. Laine,et al.  Invited Commentary: A calculation of all possible oligosaccharide isomers both branched and linear yields 1.05 × 1012 structures for a reducing hexasaccharide: the Isomer Barrier to development of single-method saccharide sequencing or synthesis systems , 1994 .

[15]  Philip E. Bourne,et al.  The distribution and query systems of the RCSB Protein Data Bank , 2004, Nucleic Acids Res..

[16]  K. Nishitani,et al.  A Surprising Diversity and Abundance of Xyloglucan Endotransglucosylase/Hydrolases in Rice. Classification and Expression Analysis1 , 2004, Plant Physiology.

[17]  S. Withers,et al.  Breakdown of oligosaccharides by the process of elimination. , 2006, Current opinion in chemical biology.

[18]  Walter R. Gilks,et al.  Modeling the percolation of annotation errors in a database of protein sequences , 2002, Bioinform..

[19]  B. Sundberg,et al.  Poplar Carbohydrate-Active Enzymes. Gene Identification and Expression Analyses1[W] , 2006, Plant Physiology.

[20]  A Bairoch,et al.  Updating the sequence-based classification of glycosyl hydrolases. , 1996, The Biochemical journal.