Selenzyme: enzyme selection tool for pathway design

Summary Synthetic biology applies the principles of engineering to biology in order to create biological functionalities not seen before in nature. One of the most exciting applications of synthetic biology is the design of new organisms with the ability to produce valuable chemicals including pharmaceuticals and biomaterials in a greener; sustainable fashion. Selecting the right enzymes to catalyze each reaction step in order to produce a desired target compound is, however, not trivial. Here, we present Selenzyme, a free online enzyme selection tool for metabolic pathway design. The user is guided through several decision steps in order to shortlist the best candidates for a given pathway step. The tool graphically presents key information about enzymes based on existing databases and tools such as: similarity of sequences and of catalyzed reactions; phylogenetic distance between source organism and intended host species; multiple alignment highlighting conserved regions, predicted catalytic site, and active regions and relevant properties such as predicted solubility and transmembrane regions. Selenzyme provides bespoke sequence selection for automated workflows in biofoundries. Availability and implementation The tool is integrated as part of the pathway design stage into the design-build-test-learn SYNBIOCHEM pipeline. The Selenzyme web server is available at http://selenzyme.synbiochem.co.uk. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  Xin Gao,et al.  MRE: a web tool to suggest foreign enzymes for the biosynthesis pathway design with competing endogenous reactions in mind , 2016, Nucleic Acids Res..

[2]  Susumu Goto,et al.  PathPred: an enzyme-catalyzed metabolic pathway prediction server , 2010, Nucleic Acids Res..

[3]  Kai Blin,et al.  antiSMASH 3.0—a comprehensive resource for the genome mining of biosynthetic gene clusters , 2015, Nucleic Acids Res..

[4]  Carsten Kemena,et al.  Using the T-Coffee package to build multiple sequence alignments of protein, RNA, DNA sequences and 3D structures , 2011, Nature Protocols.

[5]  Gemma L. Holliday,et al.  EC-BLAST: A Tool to Automatically Search and Compare Enzyme Reactions , 2014, Nature Methods.

[6]  Rainer Breitling,et al.  Bioinformatics for the synthetic biology of natural products: integrating across the Design–Build–Test cycle , 2016, Natural product reports.

[7]  Douglas B. Kell,et al.  PartsGenie: an integrated tool for optimising and sharing synthetic biology parts , 2017 .

[8]  Pablo Carbonell,et al.  A retrosynthetic biology approach to metabolic pathway design for therapeutic production , 2011, BMC Systems Biology.

[9]  D. Kell Systems biology, metabolic modelling and metabolomics in drug discovery and development. , 2006, Drug discovery today.

[10]  Douglas B. Kell,et al.  Software review: the KNIME workflow environment and its applications in genetic programming and machine learning , 2015, Genetic Programming and Evolvable Machines.

[11]  Alain Viari,et al.  The CanOE Strategy: Integrating Genomic and Metabolic Contexts across Multiple Prokaryote Genomes to Find Candidate Genes for Orphan Enzymes , 2012, PLoS Comput. Biol..

[12]  V. Hatzimanikatis,et al.  ATLAS of Biochemistry: A Repository of All Possible Biochemical Reactions for Synthetic Biology and Metabolic Engineering Studies. , 2016, ACS synthetic biology.

[13]  Pablo Carbonell,et al.  Retropath: automated pipeline for embedded metabolic circuits. , 2014, ACS synthetic biology.

[14]  Pablo Carbonell,et al.  Semisupervised Gaussian Process for Automated Enzyme Search. , 2016, ACS synthetic biology.

[15]  Pablo Carbonell,et al.  RetroPath2.0: a retrosynthesis workflow for metabolic engineers , 2017, bioRxiv.

[16]  Sophia Ananiadou,et al.  biochem4j: Integrated and extensible biochemical knowledge through graph databases , 2017, PloS one.

[17]  Burkhard Rost,et al.  MSAViewer: interactive JavaScript visualization of multiple sequence alignments , 2016, Bioinform..

[18]  Michael R Berthold,et al.  KNIME for reproducible cross-domain analysis of life science data. , 2017, Journal of biotechnology.