Glycan Reader is improved to recognize most sugar types and chemical modifications in the Protein Data Bank

Motivation: Glycans play a central role in many essential biological processes. Glycan Reader was originally developed to simplify the reading of Protein Data Bank (PDB) files containing glycans through the automatic detection and annotation of sugars and glycosidic linkages between sugar units and to proteins, all based on atomic coordinates and connectivity information. Carbohydrates can have various chemical modifications at different positions, making their chemical space much diverse. Unfortunately, current PDB files do not provide exact annotations for most carbohydrate derivatives and more than 50% of PDB glycan chains have at least one carbohydrate derivative that could not be correctly recognized by the original Glycan Reader. Results: Glycan Reader has been improved and now identifies most sugar types and chemical modifications (including various glycolipids) in the PDB, and both PDB and PDBx/mmCIF formats are supported. CHARMM‐GUI Glycan Reader is updated to generate the simulation system and input of various glycoconjugates with most sugar types and chemical modifications. It also offers a new functionality to edit the glycan structures through addition/deletion/modification of glycosylation types, sugar types, chemical modifications, glycosidic linkages, and anomeric states. The simulation system and input files can be used for CHARMM, NAMD, GROMACS, AMBER, GENESIS, LAMMPS, Desmond, OpenMM, and CHARMM/OpenMM. Glycan Fragment Database in GlycanStructure.Org is also updated to provide an intuitive glycan sequence search tool for complex glycan structures with various chemical modifications in the PDB. Availability and implementation: http://www.charmm‐gui.org/input/glycan and http://www.glycanstructure.org. Contact: wonpil@lehigh.edu Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Andreas Bohne,et al.  SWEET - WWW-based rapid 3D construction of oligo- and polysaccharides , 1999, Bioinform..

[2]  Jing Huang,et al.  CHARMM36 all‐atom additive protein force field: Validation based on comparison to NMR data , 2013, J. Comput. Chem..

[3]  Claus-Wilhelm von der Lieth,et al.  pdb-care (PDB CArbohydrate REsidue check): a program to support annotation of complex carbohydrate structures in PDB files , 2004, BMC Bioinformatics.

[4]  Yasuhiro Matsunaga,et al.  GENESIS: a hybrid-parallel and multi-scale molecular dynamics simulator with enhanced sampling algorithms for biomolecular and cellular simulations , 2015, Wiley interdisciplinary reviews. Computational molecular science.

[5]  Alexander D. MacKerell,et al.  CHARMM-GUI PDB manipulator for advanced modeling and simulations of proteins containing nonstandard residues. , 2014, Advances in protein chemistry and structural biology.

[6]  Abhik Mukhopadhyay,et al.  Small molecule annotation for the Protein Data Bank , 2014, Database J. Biol. Databases Curation.

[7]  Charles L. Brooks,et al.  Parallelization and improvements of the generalized born model with a simple sWitching function for modern graphics processors , 2016, J. Comput. Chem..

[8]  Alexander D. MacKerell,et al.  CHARMM‐GUI 10 years for biomolecular modeling and simulation , 2017, J. Comput. Chem..

[9]  Alexander D. MacKerell,et al.  CHARMM additive all-atom force field for carbohydrate derivatives and its utility in polysaccharide and carbohydrate-protein modeling. , 2011, Journal of chemical theory and computation.

[10]  R Apweiler,et al.  On the frequency of protein glycosylation, as deduced from analysis of the SWISS-PROT database. , 1999, Biochimica et biophysica acta.

[11]  Sunhwan Jo,et al.  PBEQ-Solver for online visualization of electrostatic potential of biomolecules , 2008, Nucleic Acids Res..

[12]  S. Pinho,et al.  Glycosylation in cancer: mechanisms and clinical implications , 2015, Nature Reviews Cancer.

[13]  Claus-Wilhelm von der Lieth,et al.  GlyProt: in silico glycosylation of proteins , 2005, Nucleic Acids Res..

[14]  Sunhwan Jo,et al.  Preferred conformations of N-glycan core pentasaccharide in solution and in glycoproteins. , 2015, Glycobiology.

[15]  P. K. Mehta,et al.  Protective immunity to experimental tuberculosis by mannophosphoinositides of mycobacteria , 2004, Medical Microbiology and Immunology.

[16]  Mario Vento,et al.  A (sub)graph isomorphism algorithm for matching large graphs , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  J. Møller,et al.  Interaction of membrane proteins and lipids with solubilizing detergents. , 2000, Biochimica et biophysica acta.

[18]  Sunhwan Jo,et al.  Glycan fragment database: a database of PDB-based glycan 3D structures , 2012, Nucleic Acids Res..

[19]  Martin Frank,et al.  GLYCOSCIENCES.de: an Internet portal to support glycomics and glycobiology research. , 2006, Glycobiology.

[20]  Jeffery B. Klauda,et al.  CHARMM-GUI Membrane Builder for mixed bilayers and its application to yeast membranes. , 2009, Biophysical journal.

[21]  Jianpeng Ma,et al.  CHARMM: The biomolecular simulation program , 2009, J. Comput. Chem..

[22]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[23]  Diwakar Shukla,et al.  OpenMM 4: A Reusable, Extensible, Hardware Independent Library for High Performance Molecular Simulation. , 2013, Journal of chemical theory and computation.

[24]  D. Ornitz,et al.  FGFs, heparan sulfate and FGFRs: complex interactions essential for development. , 2000, BioEssays : news and reviews in molecular, cellular and developmental biology.

[25]  Sunhwan Jo,et al.  GS-align for glycan structure alignment and similarity measurement , 2015, Bioinform..

[26]  Philip E. Bourne,et al.  [30] Macromolecular crystallographic information file , 1997 .

[27]  Alexander D. MacKerell,et al.  Glycan reader: Automated sugar identification and simulation preparation for carbohydrates and glycoproteins , 2011, J. Comput. Chem..

[28]  Wonpil Im,et al.  Bilayer Properties of Lipid A from Various Gram-Negative Bacteria. , 2016, Biophysical journal.

[29]  Ian A. Wilson,et al.  Structural Characterization of Mycobacterial Phosphatidylinositol Mannoside Binding to Mouse CD1d12 , 2006, The Journal of Immunology.

[30]  C. W. von der Lieth,et al.  LINUCS: linear notation for unique description of carbohydrate sequences. , 2001, Carbohydrate research.

[31]  Holger Gohlke,et al.  The Amber biomolecular simulation programs , 2005, J. Comput. Chem..

[32]  Pauline M Rudd,et al.  Sugar-mediated ligand-receptor interactions in the immune system. , 2004, Trends in biotechnology.

[33]  Martin Frank,et al.  Data mining the protein data bank: automatic detection and assignment of carbohydrate structures. , 2004, Carbohydrate research.

[34]  Keith Paton,et al.  An algorithm for finding a fundamental set of cycles of a graph , 1969, CACM.

[35]  Serge Pérez,et al.  POLYS 2.0: An open source software package for building three‐dimensional structures of polysaccharides , 2014, Biopolymers.

[36]  Evan Bolton,et al.  Symbol Nomenclature for Graphical Representations of Glycans. , 2015, Glycobiology.

[37]  Nigel Chaffey,et al.  Alberts, B., Johnson, A., Lewis, J., Raff, M., Roberts, K. and Walter, P. Molecular biology of the cell. 4th edn. , 2003 .

[38]  Reinhard Schwartz-Albiez,et al.  Functions and Biosynthesis of O-Acetylated Sialic Acids , 2012, Topics in current chemistry.

[39]  Wonpil Im,et al.  Effects of N-glycosylation on protein conformation and dynamics: Protein Data Bank analysis and molecular dynamics simulation study , 2015, Scientific Reports.

[40]  W. Im,et al.  Roles of glycans in interactions between gp120 and HIV broadly neutralizing antibodies. , 2015, Glycobiology.

[41]  Berk Hess,et al.  GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers , 2015 .

[42]  Yuji Sugita,et al.  Conformational flexibility of N-glycans in solution studied by REMD simulations , 2012, Biophysical Reviews.

[43]  Magnus Lundborg,et al.  Structural analysis of glycans by NMR chemical shift prediction. , 2011, Analytical chemistry.

[44]  Jeffrey C Gildersleeve,et al.  Modifications of glycans: biological significance and therapeutic opportunities. , 2012, ACS chemical biology.

[45]  Xi Chen,et al.  Carbohydrate post-glycosylational modifications. , 2007, Organic & biomolecular chemistry.

[46]  Steve Plimpton,et al.  Fast parallel algorithms for short-range molecular dynamics , 1993 .

[47]  Carolyn R. Bertozzi,et al.  Essentials of Glycobiology , 1999 .

[48]  Serge Pérez,et al.  Glyco3D: a portal for structural glycosciences. , 2015, Methods in molecular biology.

[49]  Alexander D. MacKerell,et al.  CHARMM-GUI Input Generator for NAMD, GROMACS, AMBER, OpenMM, and CHARMM/OpenMM Simulations Using the CHARMM36 Additive Force Field , 2015, Journal of chemical theory and computation.

[50]  Taehoon Kim,et al.  CHARMM‐GUI: A web‐based graphical user interface for CHARMM , 2008, J. Comput. Chem..

[51]  Göran Widmalm,et al.  CarbBuilder: Software for building molecular models of complex oligo‐ and polysaccharide structures , 2016, J. Comput. Chem..

[52]  Sunhwan Jo,et al.  CHARMM‐GUI Membrane Builder toward realistic biological membrane simulations , 2014, J. Comput. Chem..

[53]  Robbie P Joosten,et al.  Carbohydrate 3D structure validation. , 2017, Current opinion in structural biology.

[54]  Laxmikant V. Kalé,et al.  Scalable molecular dynamics with NAMD , 2005, J. Comput. Chem..