Enzyme annotation in UniProtKB using Rhea

Motivation To provide high quality computationally tractable enzyme annotation in UniProtKB using Rhea, a comprehensive expert-curated knowledgebase of biochemical reactions which describes reaction participants using the ontology ChEBI (Chemical Entities of Biological Interest). Results We replaced existing textual descriptions of biochemical reactions in UniProtKB with their equivalents from Rhea, which is now the standard for annotation of enzymatic reactions in UniProtKB. We developed improved search and query facilities for the UniProt website, REST API, and SPARQL endpoint that leverage the chemical structure data, nomenclature, and classification that Rhea and ChEBI provide. Availability and Implementation UniProtKB at https://www.uniprot.org/; UniProt REST API at https://www.uniprot.org/help/api; UniProt SPARQL endpoint at https://sparql.uniprot.org/sparql; Rhea at https://www.rhea-db.org/. Contact anne.morgat@sib.swiss

[1]  Amos Bairoch,et al.  The ENZYME database in 2000 , 2000, Nucleic Acids Res..

[2]  Morten Nielsen,et al.  IEDB-AR: immune epitope database—analysis resource in 2019 , 2019, Nucleic Acids Res..

[3]  Christoph Steinbeck,et al.  MetaboLights: An Open‐Access Database Repository for Metabolomics Data , 2016, Current protocols in bioinformatics.

[4]  Roland Eils,et al.  BioModels: expanding horizons to include more modelling approaches and formats , 2017, Nucleic Acids Res..

[5]  Evan Bolton,et al.  ClassyFire: automated chemical classification with a comprehensive, computable taxonomy , 2016, Journal of Cheminformatics.

[6]  Andrew G. McDonald,et al.  ExplorEnz: the primary source of the IUBMB enzyme list , 2008, Nucleic Acids Res..

[7]  David S. Wishart,et al.  HMDB 4.0: the human metabolome database for 2018 , 2017, Nucleic Acids Res..

[8]  Anne Morgat,et al.  Updates in Rhea – an expert curated resource of biochemical reactions , 2017, Nucleic Acids Res..

[9]  Evgeny M. Zdobnov,et al.  OrthoDB v10: sampling the diversity of animal, plant, fungal, protist, bacterial and viral genomes for evolutionary and functional annotations of orthologs , 2018, Nucleic Acids Res..

[10]  Pablo Carbonell,et al.  RetroRules: a database of reaction rules for engineering biology , 2018, Nucleic Acids Res..

[11]  Ludovic Cottret,et al.  MetExplore: collaborative edition and exploration of metabolic networks , 2018, Nucleic Acids Res..

[12]  Peter B. McGarvey,et al.  Infrastructure for the life sciences: design and implementation of the UniProt website , 2009, BMC Bioinformatics.

[13]  James C. Hu,et al.  The Gene Ontology Resource: 20 years and still GOing strong , 2019 .

[14]  M. Farràs,et al.  Trimethylamine N-Oxide: A Link among Diet, Gut Microbiota, Gene Regulation of Liver and Intestine Cholesterol Homeostasis and HDL Function , 2018, International journal of molecular sciences.

[15]  Andrew G McDonald,et al.  Fifty‐five years of enzyme classification: advances and difficulties , 2014, The FEBS journal.

[16]  Johannes Goll,et al.  Protein interaction data curation: the International Molecular Exchange (IMEx) consortium , 2012, Nature Methods.

[17]  Elisabeth Coudert,et al.  HAMAP in 2015: updates to the protein family classification and annotation system , 2014, Nucleic Acids Res..

[18]  Jonathan D Tyzack,et al.  Exploring Enzyme Evolution from Changes in Sequence, Structure, and Function. , 2018, Methods in molecular biology.

[19]  Anne Morgat,et al.  Updates in Rhea: SPARQLing biochemical reaction data , 2018, Nucleic Acids Res..

[20]  Olivier Martin,et al.  MetaNetX/MNXref – reconciliation of metabolites and biochemical reactions to bring together genome-scale metabolic networks , 2015, Nucleic Acids Res..

[21]  Philip Miller,et al.  BiGG Models: A platform for integrating, standardizing and sharing genome-scale models , 2015, Nucleic Acids Res..

[22]  The UniProt Consortium,et al.  UniProt: a worldwide hub of protein knowledge , 2018, Nucleic Acids Res..

[23]  Alan Bridge,et al.  New and continuing developments at PROSITE , 2012, Nucleic Acids Res..

[24]  Henning Hermjakob,et al.  The complex portal - an encyclopaedia of macromolecular complexes , 2014, Nucleic Acids Res..

[25]  Gaston H. Gonnet,et al.  The OMA orthology database in 2018: retrieving evolutionary relationships among all domains of life through richer web and programmatic interfaces , 2017, Nucleic Acids Res..

[26]  Astrid Gall,et al.  Ensembl 2018 , 2017, Nucleic Acids Res..

[27]  Alan Bridge,et al.  The UniProtKB guide to the human proteome , 2016, Database J. Biol. Databases Curation.

[28]  Alan Bridge,et al.  The SwissLipids knowledgebase for lipid biology , 2015, Bioinform..

[29]  Neeraj Parakh,et al.  The Metabolite Trimethylamine-N-Oxide is an Emergent Biomarker of Human Health. , 2017, Current medicinal chemistry.

[30]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[31]  Christoph Steinbeck,et al.  Reaction Decoder Tool (RDT): extracting features from chemical reactions , 2016, Bioinform..

[32]  Sophia Ananiadou,et al.  Europe PMC: a full-text literature database for the life sciences and platform for innovation , 2014, Nucleic Acids Res..

[33]  George Papadatos,et al.  The ChEMBL database in 2017 , 2016, Nucleic Acids Res..

[34]  Christoph Steinbeck,et al.  ChEBI in 2016: Improved services and an expanding collection of metabolites , 2015, Nucleic Acids Res..

[35]  Eoin Fahy,et al.  Metabolomics Workbench: An international repository for metabolomics data and metadata, metabolite standards, protocols, tutorials and training, and analysis tools , 2015, Nucleic Acids Res..