Total synthesis of Escherichia coli with a recoded genome

Nature uses 64 codons to encode the synthesis of proteins from the genome, and chooses 1 sense codon—out of up to 6 synonyms—to encode each amino acid. Synonymous codon choice has diverse and important roles, and many synonymous substitutions are detrimental. Here we demonstrate that the number of codons used to encode the canonical amino acids can be reduced, through the genome-wide substitution of target codons by defined synonyms. We create a variant of Escherichia coli with a four-megabase synthetic genome through a high-fidelity convergent total synthesis. Our synthetic genome implements a defined recoding and refactoring scheme—with simple corrections at just seven positions—to replace every known occurrence of two sense codons and a stop codon in the genome. Thus, we recode 18,214 codons to create an organism with a 61-codon genome; this organism uses 59 codons to encode the 20 amino acids, and enables the deletion of a previously essential transfer RNA.High-fidelity convergent total synthesis is used to produce Escherichia coli with a 61-codon synthetic genome that uses 59 codons to encode all of the canonical amino acids.

[1]  Julius Fredens,et al.  Defining synonymous codon compression schemes by genome recoding , 2016, Nature.

[2]  Iain G. Johnston,et al.  The Essential Genome of Escherichia coli K-12 , 2017, mBio.

[3]  Jianhui Gong,et al.  Deep functional analysis of synII, a 770-kilobase synthetic yeast chromosome , 2017, Science.

[4]  J. Lederberg,et al.  Gene Recombination in Escherichia Coli , 1946, Nature.

[5]  Farren J. Isaacs,et al.  Precise manipulation of bacterial chromosomes by conjugative assembly genome engineering , 2014, Nature Protocols.

[6]  Yizhi Cai,et al.  Design of a synthetic yeast genome , 2017, Science.

[7]  Jianhui Gong,et al.  Engineering the ribosomal DNA in a megabase synthetic chromosome , 2017, Science.

[8]  M Yarus,et al.  Rates of aminoacyl-tRNA selection at 29 sense codons in vivo. , 1989, Journal of molecular biology.

[9]  Thomas H Segall-Shapiro,et al.  Creation of a Bacterial Cell Controlled by a Chemically Synthesized Genome , 2010, Science.

[10]  F. Neidhardt,et al.  Escherichia Coli and Salmonella: Typhimurium Cellular and Molecular Biology , 1987 .

[11]  Peter G. Schultz,et al.  Genomically Recoded Organisms Expand Biological Functions , 2013, Science.

[12]  S. Brenner,et al.  General Nature of the Genetic Code for Proteins , 1961, Nature.

[13]  Joshua B. Plotkin,et al.  Codon usage influences fitness through RNA toxicity , 2018, Proceedings of the National Academy of Sciences.

[14]  Yan Wang,et al.  “Perfect” designer chromosome V and behavior of a ring derivative , 2017, Science.

[15]  Zoya Ignatova,et al.  Transient ribosomal attenuation coordinates protein synthesis and co-translational folding , 2009, Nature Structural &Molecular Biology.

[16]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration , 2012, Briefings Bioinform..

[17]  Timothy B. Stockwell,et al.  Complete Chemical Synthesis, Assembly, and Cloning of a Mycoplasma genitalium Genome , 2008, Science.

[18]  Sangya Pundir,et al.  UniProt Protein Knowledgebase. , 2017, Methods in molecular biology.

[19]  M. Mann,et al.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification , 2008, Nature Biotechnology.

[20]  Gene-Wei Li,et al.  The anti-Shine-Dalgarno sequence drives translational pausing and codon choice in bacteria , 2012, Nature.

[21]  M. Mann,et al.  Andromeda: a peptide search engine integrated into the MaxQuant environment. , 2011, Journal of proteome research.

[22]  J. Belasco,et al.  RNase E autoregulates its synthesis by controlling the degradation rate of its own mRNA in Escherichia coli: unusual sensitivity of the rne transcript to RNase E activity. , 1995, Genes & development.

[23]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[24]  M. Sørensen,et al.  Absolute in vivo translation rates of individual codons in Escherichia coli. The two glutamic acid codons GAA and GAG are translated with a threefold difference in rate. , 1991, Journal of molecular biology.

[25]  J. Chin,et al.  Tagging and Enriching Proteins Enables Cell-Specific Proteomics , 2016, Cell chemical biology.

[26]  D. Endy,et al.  Refactoring bacteriophage T7 , 2005, Molecular systems biology.

[27]  George M. Church,et al.  Design, synthesis, and testing toward a 57-codon genome , 2016, Science.

[28]  J. Belasco,et al.  An evolutionarily conserved RNA stem-loop functions as a sensor that directs feedback regulation of RNase E gene expression. , 2000, Genes & development.

[29]  Pamela A. Silver,et al.  Large-scale recoding of a bacterial genome by iterative recombineering of synthetic DNA , 2017, Nucleic acids research.

[30]  V. Noskov,et al.  Exploring transformation-associated recombination cloning for selective isolation of genomic regions. , 2004, Methods in molecular biology.

[31]  Farren J. Isaacs,et al.  Precise Manipulation of Chromosomes in Vivo Enables Genome-Wide Codon Replacement , 2011, Science.

[32]  Atsushi Yamaguchi,et al.  Reassignment of a rare sense codon to a non-canonical amino acid in Escherichia coli , 2015, Nucleic acids research.

[33]  Feng Gao,et al.  Bug mapping and fitness testing of chemically synthesized chromosome X , 2017, Science.

[34]  A. Yamaguchi,et al.  Highly reproductive Escherichia coli cells with no specific assignment to the UAG codon , 2015, Scientific Reports.

[35]  Dieter Söll,et al.  Emergent rules for codon choice elucidated by editing rare arginine codons in Escherichia coli , 2016, Proceedings of the National Academy of Sciences.

[36]  Joel S. Bader,et al.  Synthetic chromosome arms function in yeast and generate phenotypic diversity by design , 2011, Nature.

[37]  F. Claverie-Martin,et al.  Analysis of the altered mRNA stability (ams) gene from Escherichia coli. Nucleotide sequence, transcriptional analysis, and homology of its product to MRP3, a mitochondrial ribosomal protein from Neurospora crassa. , 1991, The Journal of biological chemistry.

[38]  E. Corey,et al.  The Logic of Chemical Synthesis , 1989 .

[39]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[40]  Judy Qiu,et al.  Total Synthesis of a Functional Designer Eukaryotic Chromosome , 2014, Science.

[41]  Marco Tripodi,et al.  Labeling and identifying cell-specific proteomes in the mouse brain , 2017, Nature Biotechnology.

[42]  Karsten Zengler,et al.  The transcription unit architecture of the Escherichia coli genome , 2009, Nature Biotechnology.

[43]  F. Blattner,et al.  Emergent Properties of Reduced-Genome Escherichia coli , 2006, Science.

[44]  H. K. Dai,et al.  Synthesis, debugging, and effects of synthetic chromosome consolidation: synVI and beyond , 2017, Science.

[45]  D. Söll,et al.  Codon Bias as a Means to Fine-Tune Gene Expression. , 2015, Molecular cell.

[46]  J. Belasco,et al.  RNase E autoregulates its synthesis in Escherichia coli by binding directly to a stem‐loop in the rne 5′ untranslated region , 2009, Molecular microbiology.

[47]  D. G. Gibson,et al.  Design and synthesis of a minimal bacterial genome , 2016, Science.

[48]  Adam Paul Arkin,et al.  Evaluation of 244,000 synthetic sequences reveals design principles to optimize translation in Escherichia coli , 2018, Nature Biotechnology.

[49]  Shigeyuki Yokoyama,et al.  Codon reassignment in the Escherichia coli genetic code , 2010, Nucleic acids research.

[50]  David Tollervey,et al.  Coding-Sequence Determinants of Gene Expression in Escherichia coli , 2009, Science.

[51]  Svein Valla,et al.  A New and Improved Host-Independent Plasmid System for RK2-Based Conjugal Transfer , 2014, PloS one.

[52]  Jason W. Chin,et al.  Expanding and reprogramming the genetic code , 2017, Nature.

[53]  Lippincott-Schwartz,et al.  Supporting Online Material Materials and Methods Som Text Figs. S1 to S8 Table S1 Movies S1 to S3 a " Silent " Polymorphism in the Mdr1 Gene Changes Substrate Specificity Corrected 30 November 2007; See Last Page , 2022 .

[54]  J. Chin,et al.  Proteome labeling and protein identification in specific tissues and at specific developmental stages in an animal , 2014, Nature Biotechnology.

[55]  I. Matsumura,et al.  Rational Design of a Plasmid Origin That Replicates Efficiently in Both Gram-Positive and Gram-Negative Bacteria , 2010, PloS one.