EnzyMine: a comprehensive database for enzyme function annotation with enzymatic reaction chemical feature

Addition of chemical structural information in enzymatic reactions has proven to be significant for accurate enzyme function prediction. However, such chemical data lack systematic feature mining and hardly exist in enzyme-related databases. Therefore, global mining of enzymatic reactions will offer a unique landscape for researchers to understand the basic functional mechanisms of natural bioprocesses and facilitate enzyme function annotation. Here, we established a new knowledge base called EnzyMine, through which we propose to elucidate enzymatic reaction features and then link them with sequence and structural annotations. EnzyMine represents an advanced database that extends enzyme knowledge by incorporating reaction chemical feature strategies, strengthening the connectivity between enzyme and metabolic reactions. Therefore, it has the potential to reveal many new metabolic pathways involved with given enzymes, as well as expand enzyme function annotation. Database URL: http://www.rxnfinder.org/enzymine/.

[1]  Yu Tian,et al.  RxnBLAST: molecular scaffold and reactive chemical environment feature extractor for biochemical reactions , 2020, Bioinform..

[2]  David A. Lee,et al.  CATH: an expanded resource to predict protein function through structure and sequence , 2016, Nucleic Acids Res..

[3]  Susumu Goto,et al.  PathPred: an enzyme-catalyzed metabolic pathway prediction server , 2010, Nucleic Acids Res..

[4]  D. Machado,et al.  Fast automated reconstruction of genome-scale metabolic models for microbial species and communities , 2018, bioRxiv.

[5]  Jeffrey Skolnick,et al.  EFICAz2.5: application of a high-precision enzyme function predictor to 396 proteomes , 2012, Bioinform..

[6]  Lihua Li,et al.  DEEPre: sequence-based enzyme EC number prediction by deep learning , 2017, Bioinform..

[7]  S. Placzek,et al.  The BRENDA enzyme information system-From a database to an expert system. , 2017, Journal of biotechnology.

[8]  Yu Tian,et al.  Bio2Rxn: sequence-based enzymatic reaction predictions by a consensus strategy , 2020, Bioinform..

[9]  Anne Morgat,et al.  Enzyme annotation in UniProtKB using Rhea , 2020, Bioinform..

[10]  Naoki Watanabe,et al.  Exploration and Evaluation of Machine Learning-Based Models for Predicting Enzymatic Reactions , 2020, J. Chem. Inf. Model..

[11]  Andrew G. McDonald,et al.  ExplorEnz: the primary source of the IUBMB enzyme list , 2008, Nucleic Acids Res..

[12]  Tao Jiang,et al.  A maximum common substructure-based algorithm for searching and predicting drug-like compounds , 2008, ISMB.

[13]  Pablo Carbonell,et al.  RetroPath2.0: A retrosynthesis workflow for metabolic engineers. , 2018, Metabolic engineering.

[14]  Oliver Fiehn,et al.  MINEs: open access databases of computationally predicted enzyme promiscuity products for untargeted metabolomics , 2015, Journal of Cheminformatics.

[15]  Pablo Carbonell,et al.  RetroRules: a database of reaction rules for engineering biology , 2018, Nucleic Acids Res..

[16]  María Martín,et al.  UniProt: A hub for protein information , 2015 .

[17]  Alexey G. Murzin,et al.  SCOP2 prototype: a new approach to protein structure mining , 2014, Nucleic Acids Res..

[18]  Yoshihiro Yamanishi,et al.  E-zyme: predicting potential EC numbers from the chemical transformation pattern of substrate-product pairs , 2009, Bioinform..

[19]  Sean R. Eddy,et al.  Accelerated Profile HMM Searches , 2011, PLoS Comput. Biol..

[20]  David A. Lee,et al.  CATH FunFHMMer web server: protein functional annotations using functional family assignments , 2015, Nucleic Acids Res..

[21]  Chunhui Li,et al.  Exploring the diversity of complex metabolic networks , 2005, Bioinform..

[22]  Tsuyoshi Kato,et al.  EzCatDB: the enzyme reaction database, 2015 update , 2014, Nucleic Acids Res..

[23]  Alan Bridge,et al.  New and continuing developments at PROSITE , 2012, Nucleic Acids Res..

[24]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[25]  Yang Zhang,et al.  COFACTOR: improved protein function prediction by combining structure, sequence and protein–protein interaction information , 2017, Nucleic Acids Res..

[26]  Károly Héberger,et al.  Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations? , 2015, Journal of Cheminformatics.

[27]  V. Hatzimanikatis,et al.  ATLAS of Biochemistry: A Repository of All Possible Biochemical Reactions for Synthetic Biology and Metabolic Engineering Studies. , 2016, ACS synthetic biology.

[28]  Shaozhen Ding,et al.  BCSExplorer: a customized biosynthetic chemical space explorer with multifunctional objective function analysis , 2020, Bioinform..

[29]  Janet M. Thornton,et al.  Mechanism and Catalytic Site Atlas (M-CSA): a database of enzyme reaction mechanisms and active sites , 2017, Nucleic Acids Res..

[30]  Gemma L. Holliday,et al.  EC-BLAST: A Tool to Automatically Search and Compare Enzyme Reactions , 2014, Nature Methods.

[31]  Alexander S. Rose,et al.  NGL Viewer: a web application for molecular visualization , 2015, Nucleic Acids Res..