Lessons from Two Design-Build-Test-Learn Cycles of Dodecanol Production in Escherichia coli Aided by Machine Learning.

The Design-Build-Test-Learn (DBTL) cycle, facilitated by exponentially improving capabilities in synthetic biology, is an increasingly adopted metabolic engineering framework that represents a more systematic and efficient approach to strain development than historical efforts in biofuels and biobased products. Here, we report on implementation of two DBTL cycles to optimize 1-dodecanol production from glucose using 60 engineered Escherichia coli MG1655 strains. The first DBTL cycle employed a simple strategy to learn efficiently from a relatively small number of strains (36), wherein only the choice of ribosome-binding sites and an acyl-ACP/acyl-CoA reductase were modulated in a single pathway operon including genes encoding a thioesterase (UcFatB1), an acyl-ACP/acyl-CoA reductase (Maqu_2507, Maqu_2220, or Acr1), and an acyl-CoA synthetase (FadD). Measured variables included concentrations of dodecanol and all proteins in the engineered pathway. We used the data produced in the first DBTL cycle to train several machine-learning algorithms and to suggest protein profiles for the second DBTL cycle that would increase production. These strategies resulted in a 21% increase in dodecanol titer in Cycle 2 (up to 0.83 g/L, which is more than 6-fold greater than previously reported batch values for minimal medium). Beyond specific lessons learned about optimizing dodecanol titer in E. coli, this study had findings of broader relevance across synthetic biology applications, such as the importance of sequencing checks on plasmids in production strains as well as in cloning strains, and the critical need for more accurate protein expression predictive tools.

[1]  M. Pollard,et al.  A specific acyl-ACP thioesterase implicated in medium-chain fatty acid production in immature cotyledons of Umbellularia californica. , 1991, Archives of biochemistry and biophysics.

[2]  C. Somerville,et al.  Isolation of mutants of Acinetobacter calcoaceticus deficient in wax ester synthesis and complementation of one mutation with a gene encoding a fatty acyl coenzyme A reductase , 1997, Journal of bacteriology.

[3]  Travis E. Oliphant,et al.  Python for Scientific Computing , 2007, Computing in Science & Engineering.

[4]  Carola Engler,et al.  A One Pot, One Step, Precision Cloning Method with High Throughput Capability , 2008, PloS one.

[5]  G. Siuzdak,et al.  Nanostructure-initiator mass spectrometry: a protocol for preparing and applying NIMS surfaces for high-sensitivity mass analysis , 2008, Nature Protocols.

[6]  Rich Caruana,et al.  An empirical evaluation of supervised learning in high dimensions , 2008, ICML '08.

[7]  D. G. Gibson,et al.  Enzymatic assembly of DNA molecules up to several hundred kilobases , 2009, Nature Methods.

[8]  Carola Engler,et al.  Golden Gate Shuffling: A One-Pot DNA Shuffling Method Based on Type IIs Restriction Enzymes , 2009, PloS one.

[9]  Christopher A. Voigt,et al.  Automated Design of Synthetic Ribosome Binding Sites to Precisely Control Protein Expression , 2009, Nature Biotechnology.

[10]  A. Schirmer,et al.  Microbial Biosynthesis of Alkanes , 2010, Science.

[11]  Connor J. Liu,et al.  Isolation and Characterization of Novel , 2010 .

[12]  J. Keasling,et al.  Microbial production of fatty-acid-derived fuels and chemicals from plant biomass , 2010, Nature.

[13]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[14]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[15]  R. M. Willis,et al.  Characterization of a fatty acyl-CoA reductase from Marinobacter aquaeolei VT8: a bacterial enzyme catalyzing the reduction of fatty acyl-CoA to fatty alcohol. , 2011, Biochemistry.

[16]  M. Hamberg,et al.  A prokaryotic acyl‐CoA reductase performing reduction of fatty acyl‐CoA to fatty alcohol , 2011, FEBS letters.

[17]  Nathan J Hillson,et al.  j5 DNA assembly design automation software. , 2012, ACS synthetic biology.

[18]  Patrik R. Jones,et al.  Carboxylic acid reductase is a versatile enzyme for the conversion of fatty acids into fuels and chemical commodities , 2012, Proceedings of the National Academy of Sciences.

[19]  Timothy S. Ham,et al.  Design, implementation and practice of JBEI-ICE: an open source biological part registry platform and tools , 2012, Nucleic acids research.

[20]  M. Jewett,et al.  Cell-free synthetic biology: thinking outside the cell. , 2012, Metabolic engineering.

[21]  Nathan J Hillson,et al.  DeviceEditor visual biological CAD canvas , 2012, Journal of Biological Engineering.

[22]  Yanning Zheng,et al.  Optimization of fatty alcohol biosynthesis pathway for selectively enhanced production of C12/14 and C16/18 fatty alcohols in engineered Escherichia coli , 2012, Microbial Cell Factories.

[23]  Xiaoming Tan,et al.  Fatty alcohol production in engineered E. coli expressing Marinobacter fatty acyl-CoA reductases , 2013, Applied Microbiology and Biotechnology.

[24]  Brian F Pfleger,et al.  Production of medium chain length fatty alcohols from glucose in Escherichia coli. , 2013, Metabolic engineering.

[25]  H. Salis,et al.  Translation rate is controlled by coupled trade-offs between site accessibility, selective RNA unfolding and sliding at upstream standby sites , 2013, Nucleic acids research.

[26]  Taichi E. Takasuka,et al.  Rapid kinetic characterization of glycosyl hydrolases based on oxime derivatization and nanostructure-initiator mass spectrometry (NIMS). , 2014, ACS chemical biology.

[27]  Ee-Been Goh,et al.  Substantial improvements in methyl ketone production in E. coli and insights on the pathway from in vitro studies. , 2014, Metabolic engineering.

[28]  Brendan MacLean,et al.  Panorama: A Targeted Proteomics Knowledge Base , 2014, Journal of proteome research.

[29]  Christopher A. Voigt,et al.  Algorithmic co-optimization of genetic constructs and growth conditions: application to 6-ACA, a potential nylon-6 precursor , 2015, Nucleic acids research.

[30]  Jay D Keasling,et al.  Development of an orthogonal fatty acid biosynthesis system in E. coli for oleochemical production. , 2015, Metabolic engineering.

[31]  T. Lee,et al.  Natural products as biofuels and bio-based chemicals: fatty acids and isoprenoids. , 2015, Natural product reports.

[32]  Matthew R. Pocock,et al.  SBOL Visual: A Graphical Language for Genetic Designs , 2015, PLoS biology.

[33]  Paul D. Adams,et al.  Standard Flow Liquid Chromatography for Shotgun Proteomics in Bioenergy Research , 2015, Front. Bioeng. Biotechnol..

[34]  J. Keasling,et al.  Principal component analysis of proteomics (PCAP) as a tool to direct metabolic engineering. , 2015, Metabolic engineering.

[35]  S. Yazdani,et al.  Identification of long chain specific aldehyde reductase and its use in enhanced fatty alcohol production in E. coli. , 2016, Metabolic engineering.

[36]  Randal S. Olson,et al.  Automating Biomedical Data Science Through Tree-Based Pipeline Optimization , 2016, EvoApplications.

[37]  Markus J. Herrgård,et al.  Predictable tuning of protein expression in bacteria , 2016, Nature Methods.

[38]  Nathan J Hillson,et al.  The Experiment Data Depot: A Web-Based Software Tool for Biological Experimental Data Storage, Sharing, and Visualization. , 2017, ACS synthetic biology.

[39]  Christopher P. Long,et al.  Comprehensive analysis of glucose and xylose metabolism in Escherichia coli under aerobic and anaerobic conditions by 13C metabolic flux analysis. , 2017, Metabolic engineering.

[40]  Oliver Rübel,et al.  OpenMSI Arrayed Analysis Toolkit: Analyzing Spatially Defined Samples Using Mass Spectrometry Imaging. , 2017, Analytical chemistry.

[41]  G. Stephanopoulos,et al.  Improving Metabolic Pathway Efficiency by Statistical Model-Based Multivariate Regulatory Metabolic Engineering. , 2017, ACS synthetic biology.

[42]  Jay D. Keasling,et al.  Isolation and characterization of novel mutations in the pSC101 origin that increase copy number , 2018, Scientific Reports.

[43]  J. Keasling,et al.  Improving methyl ketone production in Escherichia coli by heterologous expression of NADH‐dependent FabG , 2018, Biotechnology and bioengineering.

[44]  Carole Goble,et al.  An automated Design-Build-Test-Learn pipeline for enhanced microbial production of fine chemicals , 2018, Communications Biology.

[45]  Neil Swainston,et al.  Machine Learning of Designed Translational Control Allows Predictive Pathway Optimization in Escherichia coli. , 2019, ACS synthetic biology.