Sequence Specific Modeling of E. coli Cell-Free Protein Synthesis.

Cell-free protein synthesis (CFPS) is a widely used research tool in systems and synthetic biology. However, if CFPS is to become a mainstream technology for applications such as point of care manufacturing, we must understand the performance limits and costs of these systems. Toward this question, we used sequence specific constraint based modeling to evaluate the performance of E. coli cell-free protein synthesis. A core E. coli metabolic network, describing glycolysis, the pentose phosphate pathway, energy metabolism, amino acid biosynthesis, and degradation was augmented with sequence specific descriptions of transcription and translation and effective models of promoter function. Model parameters were largely taken from literature; thus the constraint based approach coupled the transcription and translation of the protein product, and the regulation of gene expression, with the availability of metabolic resources using only a limited number of adjustable model parameters. We tested this approach by simulating the expression of two model proteins: chloramphenicol acetyltransferase and dual emission green fluorescent protein, for which we have data sets; we then expanded the simulations to a range of additional proteins. Protein expression simulations were consistent with measurements for a variety of cases. The constraint based simulations confirmed that oxidative phosphorylation was active in the CAT cell-free extract, as without it there was no feasible solution within the experimental constraints of the system. We then compared the metabolism of theoretically optimal and experimentally constrained CFPS reactions, and developed parameter free correlations which could be used to estimate productivity as a function of carbon number and promoter type. Lastly, global sensitivity analysis identified the key metabolic processes that controlled CFPS productivity and energy efficiency. In summary, sequence specific constraint based modeling of CFPS offered a novel means to a priori estimate the performance of a cell-free system, using only a limited number of adjustable parameters. While we modeled the production of a single protein in this study, the approach could easily be extended to multiprotein synthetic circuits, RNA circuits, or the cell-free production of small molecule products.

[1]  M. Bagdasarian Gene cloning and expression , 1994 .

[2]  Paola Annoni,et al.  Variance based sensitivity analysis of model output. Design and estimator for the total sensitivity index , 2010, Comput. Phys. Commun..

[3]  Terence Hwa,et al.  Transcriptional regulation by the numbers: models. , 2005, Current opinion in genetics & development.

[4]  Bernhard O. Palsson,et al.  Metabolic flux balance analysis and the in silico analysis of Escherichia coli K-12 gene deletions , 2000, BMC Bioinformatics.

[5]  I. Grossmann,et al.  Recursive MILP model for finding all the alternate optima in LP models for metabolic networks , 2000 .

[6]  M. Jewett,et al.  Cell-free synthetic biology: thinking outside the cell. , 2012, Metabolic engineering.

[7]  B. Palsson,et al.  Theory for the systemic definition of metabolic pathways and their use in interpreting metabolic function from a pathway-oriented perspective. , 2000, Journal of theoretical biology.

[8]  I. Sobol Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates , 2001 .

[9]  B. Palsson,et al.  Constraining the metabolic genotype–phenotype relationship using a phylogeny of in silico methods , 2012, Nature Reviews Microbiology.

[10]  B. Palsson,et al.  Genome-scale Reconstruction of Metabolic Network in Bacillus subtilis Based on High-throughput Phenotyping and Gene Essentiality Data* , 2007, Journal of Biological Chemistry.

[11]  R. Mahadevan,et al.  The effects of alternate optimal solutions in constraint-based genome-scale metabolic models. , 2003, Metabolic engineering.

[12]  Edward J. O'Brien,et al.  Genome-scale models of metabolism and gene expression extend and refine growth phenotype prediction , 2013, Molecular systems biology.

[13]  Adam M. Feist,et al.  A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information , 2007, Molecular systems biology.

[14]  Joseph D Puglisi,et al.  Quantitative polysome analysis identifies limitations in bacterial cell-free protein synthesis. , 2005, Biotechnology and bioengineering.

[15]  U. Sauer,et al.  Systematic evaluation of objective functions for predicting intracellular fluxes in Escherichia coli , 2007, Molecular systems biology.

[16]  James R. Swartz,et al.  Transforming biochemical engineering with cell-free biology , 2012 .

[17]  J. H. Matthaei,et al.  Characteristics and stabilization of DNAase-sensitive protein synthesis in E. coli extracts. , 1961, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Dong-Myung Kim,et al.  Efficient production of a bioactive, multiple disulfide‐bonded protein using modified extracts of Escherichia coli , 2004, Biotechnology and bioengineering.

[19]  Markus J. Herrgård,et al.  Integrating high-throughput and computational data elucidates bacterial networks , 2004, Nature.

[20]  Stefan Klumpp,et al.  A Model for Sigma Factor Competition in Bacterial Cells , 2014, PLoS Comput. Biol..

[21]  F. De Filippis,et al.  A Selected Core Microbiome Drives the Early Stages of Three Popular Italian Cheese Manufactures , 2014, PloS one.

[22]  James J. Collins,et al.  Portable, On-Demand Biomolecular Manufacturing , 2016, Cell.

[23]  George M. Church,et al.  Improved Cell-Free RNA and Protein Synthesis System , 2014, PloS one.

[24]  Jeffrey D. Orth,et al.  In silico method for modelling metabolism and gene product expression at genome scale , 2012, Nature Communications.

[25]  Dong-Myung Kim,et al.  Regeneration of adenosine triphosphate from glycolytic intermediates for cell-free protein synthesis. , 2001, Biotechnology and bioengineering.

[26]  A. Spirin,et al.  A continuous cell-free translation system capable of producing polypeptides in high yield. , 1988, Science.

[27]  M. Jewett,et al.  Mimicking the Escherichia coli cytoplasmic environment activates long‐lived and efficient cell‐free protein synthesis , 2004, Biotechnology and bioengineering.

[28]  Vincent Noireaux,et al.  The All E. coli TX-TL Toolbox 2.0: A Platform for Cell-Free Synthetic Biology. , 2016, ACS synthetic biology.

[29]  B. Palsson,et al.  Stoichiometric flux balance models quantitatively predict growth and metabolic by-product secretion in wild-type Escherichia coli W3110 , 1994, Applied and environmental microbiology.

[30]  M. Jewett,et al.  Substrate replenishment extends protein synthesis with an in vitro translation system designed to mimic the cytoplasm , 2004, Biotechnology and bioengineering.

[31]  Bonnie E. Shook-Sa,et al.  . CC-BY-NC-ND 4 . 0 International licenseIt is made available under a is the author / funder , who has granted medRxiv a license to display the preprint in perpetuity , 2021 .

[32]  Roger L. Chang,et al.  Structural Systems Biology Evaluation of Metabolic Thermotolerance in Escherichia coli , 2013, Science.

[33]  Ines Thiele,et al.  Three-Dimensional Structural View of the Central Metabolic Network of Thermotoga maritima , 2009, Science.

[34]  Adam M. Feist,et al.  Reconstruction of biochemical networks in microorganisms , 2009, Nature Reviews Microbiology.

[35]  D. Fell,et al.  A general definition of metabolic pathways useful for systematic organization and analysis of complex metabolic networks , 2000, Nature Biotechnology.

[36]  B. Palsson,et al.  The Escherichia coli MG1655 in silico metabolic genotype: its definition, characteristics, and capabilities. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[37]  W. Wiechert 13C metabolic flux analysis. , 2001, Metabolic engineering.

[38]  T. Kigawa,et al.  Cell-free synthesis and amino acid-selective stable isotope labeling of proteins for NMR analysis , 1995, Journal of biomolecular NMR.

[39]  Bernhard Ø Palsson,et al.  Sequence-based analysis of metabolic demands for protein synthesis in prokaryotes. , 2003, Journal of theoretical biology.

[40]  Christopher A. Voigt,et al.  Genetic programs constructed from layered logic gates in single cells , 2012, Nature.

[41]  J E Bailey,et al.  Genetically structured models forlac promoter–operator function in the Escherichia coli chromosome and in multicopy plasmids: Lac operator function , 1984, Biotechnology and bioengineering.

[42]  Kentaro Shimizu,et al.  Mechanism of the difference in the binding affinity of E. coli tRNAGln to glutaminyl-tRNA synthetase caused by noninterface nucleotides in variable loop. , 2007, Biophysical journal.

[43]  Joshua J. Hamilton,et al.  Quantitative assessment of thermodynamic constraints on the solution space of genome-scale metabolic models. , 2013, Biophysical journal.

[44]  J. Swartz,et al.  An Economical Method for Cell‐Free Protein Synthesis using Glucose and Nucleoside Monophosphates , 2008, Biotechnology progress.

[45]  Julius B. Lucks,et al.  Generating effective models and parameters for RNA genetic circuits , 2015, bioRxiv.

[46]  Marshall W. Nirenberg,et al.  The dependence of cell-free protein synthesis in E. coli upon naturally occurring or synthetic polyribonucleotides , 1961, Proceedings of the National Academy of Sciences.

[47]  James R. Swartz,et al.  Production and stabilization of the trimeric influenza hemagglutinin stem domain for potentially broadly protective influenza vaccines , 2013, Proceedings of the National Academy of Sciences.

[48]  Michael C Jewett,et al.  An integrated cell-free metabolic platform for protein production and synthetic biology , 2008, Molecular systems biology.

[49]  C. Sánchez,et al.  Flux balance analysis in the production of clavulanic acid by Streptomyces clavuligerus , 2015, Biotechnology progress.

[50]  U. Sauer,et al.  13C-based metabolic flux analysis , 2009, Nature Protocols.