UniCarbKB: building a knowledge platform for glycoproteomics

The UniCarb KnowledgeBase (UniCarbKB; http://unicarbkb.org) offers public access to a growing, curated database of information on the glycan structures of glycoproteins. UniCarbKB is an international effort that aims to further our understanding of structures, pathways and networks involved in glycosylation and glyco-mediated processes by integrating structural, experimental and functional glycoscience information. This initiative builds upon the success of the glycan structure database GlycoSuiteDB, together with the informatic standards introduced by EUROCarbDB, to provide a high-quality and updated resource to support glycomics and glycoproteomics research. UniCarbKB provides comprehensive information concerning glycan structures, and published glycoprotein information including global and site-specific attachment information. For the first release over 890 references, 3740 glycan structure entries and 400 glycoproteins have been curated. Further, 598 protein glycosylation sites have been annotated with experimentally confirmed glycan structures from the literature. Among these are 35 glycoproteins, 502 structures and 60 publications previously not included in GlycoSuiteDB. This article provides an update on the transformation of GlycoSuiteDB (featured in previous NAR Database issues and hosted by ExPASy since 2009) to UniCarbKB and its integration with UniProtKB and GlycoMod. Here, we introduce a refactored database, supported by substantial new curated data collections and intuitive user-interfaces that improve database searching.

[1]  Nicolle H. Packer,et al.  GlycoSuiteDB: a new curated relational database of glycoprotein glycan structures and their biological sources , 2001, Nucleic Acids Res..

[2]  Nicolle H Packer,et al.  Site-specific glycoproteomics confirms that protein structure dictates formation of N-glycan type, core fucosylation and branching. , 2012, Glycobiology.

[3]  Brandi L. Cantarel,et al.  The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics , 2008, Nucleic Acids Res..

[4]  Kiyoko F. Aoki-Kinoshita Glycome Informatics: Methods and Applications , 2009 .

[5]  Hiren J. Joshi,et al.  GlycoSuiteDB: a curated relational database of glycoprotein glycan structures and their biological sources. 2003, update , 2003, Nucleic Acids Res..

[6]  Hisashi Narimatsu,et al.  Construction of a human glycogene library and comprehensive functional analysis , 2004, Glycoconjugate Journal.

[7]  Martin Frank,et al.  EUROCarbDB: An open-access platform for glycoinformatics , 2010, Glycobiology.

[8]  Division on Earth,et al.  Transforming Glycoscience: A Roadmap for the Future , 2012 .

[9]  Lynette Hirschman,et al.  SugarBind database (SugarBindDB): a resource of pathogen lectins and corresponding glycan targets , 2013, Journal of molecular recognition : JMR.

[10]  Hisashi Narimatsu,et al.  Toolboxes for a standardised and systematic study of glycans , 2014, BMC Bioinformatics.

[11]  Hyun Joo An,et al.  Determination of glycosylation sites and site-specific heterogeneity in glycoproteins. , 2009, Current opinion in chemical biology.

[12]  J. Esko,et al.  The sweet and sour of cancer: glycans as novel therapeutic targets , 2005, Nature Reviews Cancer.

[13]  Erdmann Rapp,et al.  The Minimum Information Required for a Glycomics Experiment (MIRAGE) Project: Improving the Standards for Reporting Mass-spectrometry-based Glycoanalytic Data , 2013, Molecular & Cellular Proteomics.

[14]  C. Lieth,et al.  GlycoCT-a unifying sequence format for carbohydrates. , 2008, Carbohydrate research.

[15]  Daniel Kolarich,et al.  Determination of site-specific glycan heterogeneity on glycoproteins , 2012, Nature Protocols.

[16]  Christodoulos A. Floudas,et al.  Proteome-wide post-translational modification statistics: frequency analysis and curation of the swiss-prot database , 2011, Scientific reports.

[17]  Rene Ranzinger,et al.  The GlycanBuilder and GlycoWorkbench glycoinformatics tools: updates and new developments , 2012, Biological chemistry.

[18]  Vassilios Ioannidis,et al.  ExPASy: SIB bioinformatics resource portal , 2012, Nucleic Acids Res..

[19]  Wei Lang,et al.  Advancing glycomics: implementation strategies at the consortium for functional glycomics. , 2006, Glycobiology.

[20]  Kiyoko F Aoki-Kinoshita,et al.  The RINGS resource for glycome informatics analysis and data mining on the Web. , 2010, Omics : a journal of integrative biology.

[21]  Alessio Ceroni,et al.  The GlycanBuilder: a fast, intuitive and flexible software tool for building and displaying glycan structures , 2007, Source Code for Biology and Medicine.

[22]  R Apweiler,et al.  On the frequency of protein glycosylation, as deduced from analysis of the SWISS-PROT database. , 1999, Biochimica et biophysica acta.

[23]  J. Paulson,et al.  Glycomics: an integrated systems approach to structure-function relationships of glycans , 2005, Nature Methods.

[24]  Pauline M. Rudd,et al.  GlycoBase and autoGU: tools for HPLC-based glycan analysis , 2008, Bioinform..

[25]  Maria Jesus Martin,et al.  UniProtJAPI: a remote API for accessing UniProt data , 2008, Bioinform..

[26]  Antje Chang,et al.  BRENDA in 2013: integrated reactions, kinetic data, enzyme function data, improved disease classification: new options and contents in BRENDA , 2012, Nucleic Acids Res..

[27]  G. Hong,et al.  Nucleic Acids Research , 2015, Nucleic Acids Research.

[28]  Robert J. Moon,et al.  Transforming Glycoscience: A Roadmap for the Future , 2012 .

[29]  Raymond A. Dwek,et al.  Glycobiology: Toward Understanding the Function of Sugars. , 1996, Chemical reviews.

[30]  C. Bertozzi,et al.  Glycans in cancer and inflammation — potential for therapeutics and diagnostics , 2005, Nature Reviews Drug Discovery.

[31]  Catherine A. Hayes,et al.  UniCarb-DB: a database resource for glycomic discovery , 2011, Bioinform..

[32]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[33]  Kiyoko F. Aoki-Kinoshita,et al.  UniCarbKB: Putting the pieces together for glycomics research , 2011, Proteomics.