Bio2Rxn: sequence-based enzymatic reaction predictions by a consensus strategy

SUMMARY The development of sequencing technologies has generated large amounts of protein sequence data. The automated prediction of the enzymatic reactions of uncharacterized proteins is a major challenge in the field of bioinformatics. Here, we present Bio2Rxn as a web-based tool to provide putative enzymatic reaction predictions for uncharacterized protein sequences. Bio2Rxn adopts a consensus strategy by incorporating six types of enzyme prediction tools. It allows for the efficient integration of these computational resources to maximize the accuracy and comprehensiveness of enzymatic reaction predictions, which facilitates the characterization of the functional roles of target proteins in metabolism. Bio2Rxn further links the enzyme function prediction with more than 300,000 enzymatic reactions, which were manually curated by more than 100 people over the past 9 years from more than 580,000 publications. AVAILABILITY Bio2Rxn is available at: http://design.rxnfinder.org/bio2rxn/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Jae Yong Ryu,et al.  Deep learning enables high-quality and high-throughput prediction of enzyme commission numbers , 2019, Proceedings of the National Academy of Sciences.

[2]  Dong-Sheng Cao,et al.  RxnFinder: biochemical reaction search engines using molecular structures, molecular fragments and reaction similarity , 2011, Bioinform..

[3]  Yu Tian,et al.  Data-driven rational biosynthesis design: from molecules to cell factories , 2020, Briefings Bioinform..

[4]  Maria Jesus Martin,et al.  ECPred: a tool for the prediction of the enzymatic functions of protein sequences based on the EC nomenclature , 2018, BMC Bioinformatics.

[5]  Yang Zhang,et al.  COFACTOR: improved protein function prediction by combining structure, sequence and protein–protein interaction information , 2017, Nucleic Acids Res..

[6]  Jeffrey Skolnick,et al.  EFICAz2.5: application of a high-precision enzyme function predictor to 396 proteomes , 2012, Bioinform..

[7]  Shaozhen Ding,et al.  BCSExplorer: a customized biosynthetic chemical space explorer with multifunctional objective function analysis , 2020, Bioinform..

[8]  Lihua Li,et al.  DEEPre: sequence-based enzyme EC number prediction by deep learning , 2017, Bioinform..

[9]  Ren Long,et al.  Identification of Multi-Functional Enzyme with Multi-Label Classifier , 2016, PloS one.

[10]  Dietmar Schomburg,et al.  BrEPS 2.0: Optimization of sequence pattern prediction for enzyme annotation , 2017, PloS one.

[11]  Chetan Kumar,et al.  A top-down approach to classify enzyme functional classes and sub-classes using random forest , 2012, EURASIP J. Bioinform. Syst. Biol..

[12]  Jaques Reifman,et al.  Genome‐wide enzyme annotation with precision control: Catalytic families (CatFam) databases , 2009, Proteins.

[13]  Volker F. Wendisch,et al.  Propionate oxidation in Escherichia coli: evidence for operation of a methylcitrate cycle in bacteria , 1997, Archives of Microbiology.

[14]  John Parkinson,et al.  Improved enzyme annotation with EC-specific cutoffs using DETECT v2 , 2018, Bioinform..

[15]  A. Hochman,et al.  Purification and characterization of a catalase-peroxidase and a typical catalase from the bacterium Klebsiella pneumoniae. , 1991, Biochimica et biophysica acta.

[16]  Yu Tian,et al.  PrecursorFinder: a customized biosynthetic precursor explorer , 2019, Bioinform..