glypy - An open source glycoinformatics library.

Glycoinformatics is a critical resource for the study of glycobiology, and glycobiology is a necessary component for understanding the complex interface between intra- and extra-cellular spaces. Despite this, there is limited software available to scientists studying these topics, requiring each to create fundamental data structures and representations anew for each of their applications. This leads to poor uptake of standardization and loss of focus on the real problems. We present glypy, a library written in Python for reading, writing, manipulating, and transforming glycans at several levels of precision. In addition to understanding several standardized formats for textual representation of glycans, the library also provides APIs for major community database, including GlyTouCan and UnicarbKB. The library is freely available under the Apache 2 common license, with source code available at https://github.com/mobiusklein/glypy and documentation at https://glypy.readthedocs.io.

[1]  Luis Carvalho,et al.  Application of network smoothing to glycan LC‐MS profiling , 2018, Bioinform..

[2]  Richard D. Cummings,et al.  The repertoire of glycan determinants in the human glycome. , 2009, Molecular bioSystems.

[3]  Evan Bolton,et al.  Symbol Nomenclature for Graphical Representations of Glycans. , 2015, Glycobiology.

[4]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[5]  Yaniv Altshuler,et al.  Glycoforum a Novel Linear Code ® Nomenclature for Complex Carbohydrates , 2022 .

[6]  Sriram Neelamegham,et al.  A Computational Framework for the Automated Construction of Glycosylation Reaction Networks , 2014, PloS one.

[7]  Joseph Zaia,et al.  Complete Molecular Weight Profiling of Low-Molecular Weight Heparins Using Size Exclusion Chromatography-Ion Suppressor-High-Resolution Mass Spectrometry. , 2016, Analytical chemistry.

[8]  María Martín,et al.  UniProt: A hub for protein information , 2015 .

[9]  The Uniprot Consortium,et al.  UniProt: a hub for protein information , 2014, Nucleic Acids Res..

[10]  C. Lebrilla,et al.  Glycans in the immune system and The Altered Glycan Theory of Autoimmunity: a critical review. , 2015, Journal of autoimmunity.

[11]  Lokesh Joshi,et al.  The cancer glycome: carbohydrates as mediators of metastasis. , 2015, Blood reviews.

[12]  Tatsuya Akutsu,et al.  Efficient tree-matching methods for accurate carbohydrate database queries. , 2003, Genome informatics. International Conference on Genome Informatics.

[13]  Ajit Varki,et al.  Biological roles of glycans , 2016, Glycobiology.

[14]  Kiyoko F. Aoki-Kinoshita,et al.  GlyTouCan: an accessible glycan structure repository. , 2017, Glycobiology.

[15]  Kiyoko F Aoki-Kinoshita,et al.  Comprehensive analysis of the N-glycan biosynthetic pathway using bioinformatics to generate UniCorn: A theoretical N-glycan structure database. , 2016, Carbohydrate research.

[16]  Kiyoko F. Aoki-Kinoshita,et al.  GlyTouCan 1.0 – The international glycan structure repository , 2015, Nucleic Acids Res..

[17]  A. McNaught Nomenclature of carbohydrates (IUPAC Recommendations 1996) , 1996, Advances in carbohydrate chemistry and biochemistry.

[18]  Joshua A Klein,et al.  Deep Sequencing of Complex Proteoglycans: A Novel Strategy for High Coverage and Site-specific Identification of Glycosaminoglycan-linked Peptides * , 2018, Molecular & Cellular Proteomics.

[19]  Martin Frank,et al.  EUROCarbDB: An open-access platform for glycoinformatics , 2010, Glycobiology.

[20]  Hans H Wandall,et al.  Global aspects of viral glycosylation , 2018, Glycobiology.

[21]  Lev I Levitsky,et al.  Pyteomics 4.0: Five Years of Development of a Python Proteomics Framework. , 2018, Journal of proteome research.

[22]  Robert J Woods,et al.  Integrated Omics and Computational Glycobiology Reveal Structural Basis for Influenza A Virus Glycan Microheterogeneity and Host Interactions , 2016, Molecular & Cellular Proteomics.

[23]  Kiyoko F. Aoki-Kinoshita A Practical Guide to Using Glycomics Databases , 2017 .

[24]  Portland Press Ltd IUPAC-IUB Joint Commission on Biochemical Nomenclature (JCBN). Nomenclature and symbolism for amino acids and peptides. Recommendations 1983 , 1984 .

[25]  Michael Hippler,et al.  pymzML v2.0: introducing a highly compressed and seekable gzip format , 2018, Bioinform..

[26]  Vassilios Ioannidis,et al.  ExPASy: SIB bioinformatics resource portal , 2012, Nucleic Acids Res..

[27]  Richard D Cummings,et al.  Symbol nomenclature for glycan representation , 2009, Proteomics.

[28]  Lars Malmström,et al.  pyOpenMS: A Python‐based interface to the OpenMS mass‐spectrometry algorithm library , 2014, Proteomics.

[29]  Oliver Horlacher,et al.  MzJava: An open source library for mass spectrometry data processing. , 2015, Journal of proteomics.

[30]  Y. Mechref,et al.  Quantitative Glycomics: A Combined Analytical and Bioinformatics Approach. , 2017, Methods in enzymology.

[31]  Hisashi Narimatsu,et al.  Toolboxes for a standardised and systematic study of glycans , 2014, BMC Bioinformatics.

[32]  Oliver Horlacher,et al.  Property Graph vs RDF Triple Store: A Comparison on Glycan Substructure Search , 2015, PloS one.

[33]  Martin Frank,et al.  Bioinformatics and molecular modeling in glycobiology , 2010, Cellular and Molecular Life Sciences.

[34]  Kiyoko F. Aoki-Kinoshita,et al.  WURCS 2.0 Update To Encapsulate Ambiguous Carbohydrate Structures , 2017, J. Chem. Inf. Model..

[35]  Catherine E Costello,et al.  Microfluidic Capillary Electrophoresis-Mass Spectrometry for Analysis of Monosaccharides, Oligosaccharides, and Glycopeptides. , 2017, Analytical chemistry.

[36]  Richard D Cummings,et al.  Protein glycosylation in cancer. , 2015, Annual review of pathology.

[37]  Yuri A. Mirokhin,et al.  Creating a Mass Spectral Reference Library for Oligosaccharides in Human Milk. , 2018, Analytical chemistry.

[38]  Ferenc Horkay,et al.  Microscale mapping of extracellular matrix elasticity of mouse joint cartilage: an approach to extracting bulk elasticity of soft matter with surface roughness. , 2018, Soft matter.

[39]  Sriram Neelamegham,et al.  Integration of systems glycobiology with bioinformatics toolboxes, glycoinformatics resources, and glycoproteomics data , 2015, Wiley interdisciplinary reviews. Systems biology and medicine.

[40]  Philip V. Toukach,et al.  GlycoRDF: an ontology to standardize glycomics data in RDF , 2015, Bioinform..

[41]  Bartek Wilczynski,et al.  Biopython: freely available Python tools for computational molecular biology and bioinformatics , 2009, Bioinform..

[42]  Kiyoko F. Aoki-Kinoshita,et al.  UniCarbKB: building a knowledge platform for glycoproteomics , 2013, Nucleic Acids Res..

[43]  C. Lieth,et al.  GlycoCT-a unifying sequence format for carbohydrates. , 2008, Carbohydrate research.