PROCARB: A Database of Known and Modelled Carbohydrate-Binding Protein Structures with Sequence-Based Prediction Tools

Understanding of the three-dimensional structures of proteins that interact with carbohydrates covalently (glycoproteins) as well as noncovalently (protein-carbohydrate complexes) is essential to many biological processes and plays a significant role in normal and disease-associated functions. It is important to have a central repository of knowledge available about these protein-carbohydrate complexes as well as preprocessed data of predicted structures. This can be significantly enhanced by tools de novo which can predict carbohydrate-binding sites for proteins in the absence of structure of experimentally known binding site. PROCARB is an open-access database comprising three independently working components, namely, (i) Core PROCARB module, consisting of three-dimensional structures of protein-carbohydrate complexes taken from Protein Data Bank (PDB), (ii) Homology Models module, consisting of manually developed three-dimensional models of N-linked and O-linked glycoproteins of unknown three-dimensional structure, and (iii) CBS-Pred prediction module, consisting of web servers to predict carbohydrate-binding sites using single sequence or server-generated PSSM. Several precomputed structural and functional properties of complexes are also included in the database for quick analysis. In particular, information about function, secondary structure, solvent accessibility, hydrogen bonds and literature reference, and so forth, is included. In addition, each protein in the database is mapped to Uniprot, Pfam, PDB, and so forth.

[1]  Mahesh Kulharia,et al.  InCa-SiteFinder: a method for structure-based prediction of inositol and carbohydrate binding sites on proteins. , 2009, Journal of molecular graphics & modelling.

[2]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[3]  J. Musser,et al.  Chapter 31. Carbohydrates as Drug Discovery Leads , 1992 .

[4]  Hugo Kubinyi,et al.  MOLEKULARE AHNLICHKEIT. 2. STRUKTURBASIERTER ENTWURF VON WIRKSTOFFEN , 1998 .

[5]  M. L. Jones,et al.  PDBsum: a Web-based database of summaries and analyses of all PDB structures. , 1997, Trends in biochemical sciences.

[6]  B. Lanne,et al.  Microbial interaction with animal cell surface carbohydrates. , 1992, APMIS. Supplementum.

[7]  M C Peitsch,et al.  Is apolipoprotein D a mammalian bilin-binding protein? , 1992, The New biologist.

[8]  R. Schnaar,et al.  Complex Carbohydrates in Drug Development , 1992, Advances in Pharmacology.

[9]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[10]  C. Strader,et al.  Mutational analysis of beta-adrenergic receptor glycosylation. , 1990, The Journal of biological chemistry.

[11]  S. Brunak,et al.  Prediction, conservation analysis, and structural characterization of mammalian mucin-type O-glycosylation sites. , 2005, Glycobiology.

[12]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[13]  J. Naismith,et al.  Structural Basis of Trimannoside Recognition by Concanavalin A (*) , 1996, The Journal of Biological Chemistry.

[14]  Vasant Honavar,et al.  Glycosylation site prediction using ensembles of Support Vector Machine classifiers , 2007, BMC Bioinformatics.

[15]  Shandar Ahmad,et al.  ASAView: Database and tool for solvent accessibility representation in proteins , 2003, BMC Bioinformatics.

[16]  Takashi Yamane,et al.  An empirical approach for structure-based prediction of carbohydrate-binding sites on proteins. , 2003, Protein engineering.

[17]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[18]  H Kubinyi [Molecular similarity. 2. The structural basis of drug design]. , 1998, Pharmazie in unserer Zeit.

[19]  S. Hakomori,et al.  Possible functions of tumor-associated carbohydrate antigens. , 1991, Current opinion in immunology.

[20]  J. Thornton,et al.  Satisfying hydrogen bonding potential in proteins. , 1994, Journal of molecular biology.

[21]  P. Kelly,et al.  Characterization of the structure and glycosylation properties of intracellular and cell surface rat hepatic prolactin receptors. , 1992, Endocrinology.

[22]  Søren Brunak,et al.  O-GLYCBASE version 4.0: a revised database of O-glycosylated proteins , 1999, Nucleic Acids Res..

[23]  J M Thornton,et al.  Analysis and prediction of carbohydrate binding sites. , 2000, Protein engineering.

[24]  T. Springer,et al.  The sensation and regulation of interactions with the extracellular environment: the cell biology of lymphocyte adhesion receptors. , 1990, Annual review of cell biology.

[25]  Jan Adam,et al.  Engineering of PA-IIL lectin from Pseudomonas aeruginosa – Unravelling the role of the specificity loop for sugar preference , 2007, BMC Structural Biology.

[26]  Adeel Malik,et al.  A molecular and in silico characterization of Hev b 4, a glycosylated latex allergen. , 2008, International journal of biological macromolecules.

[27]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[28]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[29]  K. Sen,et al.  Molecular Similarity II , 1995 .

[30]  G J Davies,et al.  Protein--carbohydrate interactions: learning lessons from nature. , 2001, Trends in biotechnology.

[31]  Firoze B. Jungalwala,et al.  Expression and biological functions of sulfoglucuronyl glycolipids (SGGLs) in the nervous system—A review , 1994, Neurochemical Research.

[32]  Manuel C. Peitsch,et al.  About the use of protein models , 2002, Bioinform..

[33]  Eric G. Bremer,et al.  Chapter 15 - Glycosphingolipids as Effectors of Growth and Differentiation , 1994 .

[34]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 , 2000, Nucleic Acids Res..

[35]  Bas Vroling,et al.  Homology modelling and spectroscopy, a never-ending love story , 2009, European Biophysics Journal.

[36]  M J Sternberg,et al.  Enhancement of protein modeling by human intervention in applying the automatic programs 3D‐JIGSAW and 3D‐PSSM , 2001, Proteins.

[37]  M C Peitsch,et al.  The first lipocalin with enzymatic activity. , 1991, Trends in biochemical sciences.

[38]  L. F. Kolakowski,et al.  The Role of N-Glycosylation for Functional Expression of the Human Platelet-activating Factor Receptor , 1995, The Journal of Biological Chemistry.

[39]  Cathy H. Wu,et al.  The Universal Protein Resource (UniProt): an expanding universe of protein information , 2005, Nucleic Acids Res..