Faculty Opinions recommendation of Multiplexed gene synthesis in emulsions for exploring protein functional landscapes.

Improving our ability to construct and functionally characterize DNA sequences would broadly accelerate progress in biology. Here, we introduce DropSynth, a scalable, low-cost method to build thousands of defined gene-length constructs in a pooled (multiplexed) manner. DropSynth uses a library of barcoded beads that pull down the oligonucleotides necessary for a gene’s assembly, which are then processed and assembled in water-in-oil emulsions. We use DropSynth to successfully build >7000 synthetic genes that encode phylogenetically-diverse homologs of two essential genes in E. coli. We tested the ability of phosphopantetheine adenylyltransferase homologs to complement a knockout E. coli strain in multiplex, revealing core functional motifs and reasons underlying homolog incompatibility. DropSynth coupled with multiplexed functional assays allow us to rationally explore sequence-function relationships at unprecedented scale. One Sentence Summary: A gene synthesis method, DropSynth, allows for the synthesis and characterization of thousands of genes in a pooled format.

[1]  Mark D'Souza,et al.  From Genetic Footprinting to Antimicrobial Drug Targets: Examples in Cofactor Biosynthetic Pathways , 2002, Journal of bacteriology.

[2]  Alan Bensky,et al.  Technologies and applications , 2019, Short-range Wireless Communication.

[3]  Dmitry Chudakov,et al.  Local fitness landscape of the green fluorescent protein , 2016, Nature.

[4]  Jay Shendure,et al.  Accurate gene synthesis with tag-directed retrieval of sequence-verified DNA molecules , 2012, Nature Methods.

[5]  B Wieland,et al.  Identification of novel essential Escherichia coli genes conserved among pathogenic bacteria. , 2001, Journal of molecular microbiology and biotechnology.

[6]  Tilo Buschmann,et al.  Levenshtein error-correcting barcodes for multiplexed DNA sequencing , 2013, BMC Bioinformatics.

[7]  David K. Smith,et al.  ggtree: an r package for visualization and annotation of phylogenetic trees with their covariates and other associated data , 2017 .

[8]  Thomas A. Hopf,et al.  Mutation effects predicted from sequence co-variation , 2017, Nature Biotechnology.

[9]  Sriram Kosuri,et al.  Scalable gene synthesis by selective amplification of DNA pools from high-fidelity microchips , 2010, Nature Biotechnology.

[10]  J. Shendure,et al.  The power of multiplexed functional analysis of genetic variants , 2016, Nature Protocols.

[11]  T. Izard,et al.  The crystal structure of a novel bacterial adenylyltransferase reveals half of sites reactivity , 1999, The EMBO journal.

[12]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[13]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[14]  T. Izard A Novel Adenylate Binding Site Confers Phosphopantetheine Adenylyltransferase Interactions with Coenzyme A , 2003, Journal of bacteriology.

[15]  Alexandros Stamatakis,et al.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies , 2014, Bioinform..

[16]  H. Mori,et al.  Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection , 2006, Molecular systems biology.

[17]  Takaya Saito,et al.  Precrec: fast and accurate precision–recall and ROC curve calculations in R , 2016, Bioinform..

[18]  Angus M. Sidore,et al.  A systematic comparison of error correction enzymes by next-generation sequencing , 2017, bioRxiv.

[19]  Mona Singh,et al.  Predicting functionally important residues from sequence conservation , 2007, Bioinform..

[20]  T. Hsiau,et al.  A Method for Multiplex Gene Synthesis Employing Error Correction Based on Expression , 2015, PloS one.

[21]  Christopher A. Voigt,et al.  Ribozyme-based insulator parts buffer synthetic circuits from genetic context , 2012, Nature Biotechnology.

[22]  David Baker,et al.  Multiplex pairwise assembly of array-derived DNA oligonucleotides , 2015, Nucleic acids research.

[23]  Jesse D. Bloom,et al.  An Experimentally Determined Evolutionary Model Dramatically Improves Phylogenetic Fit , 2014, bioRxiv.

[24]  B. Wanner,et al.  One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[25]  T. Izard The crystal structures of phosphopantetheine adenylyltransferase with bound substrates reveal the enzyme's catalytic mechanism. , 2002, Journal of molecular biology.

[26]  Silvio C. E. Tosatto,et al.  InterPro in 2017—beyond protein family and domain annotations , 2016, Nucleic Acids Res..

[27]  Thomas A. Hopf,et al.  Protein structure prediction from sequence variation , 2012, Nature Biotechnology.

[28]  S. Fields,et al.  Deep mutational scanning: a new style of protein science , 2014, Nature Methods.

[29]  W. V. Shaw,et al.  Purification and Characterization of Phosphopantetheine Adenylyltransferase from Escherichia coli * , 1999, The Journal of Biological Chemistry.

[30]  Amy I Gilson,et al.  Transient protein-protein interactions perturb E. coli metabolome and cause gene dosage toxicity , 2016, bioRxiv.

[31]  M. Elowitz,et al.  A synthetic three-color scaffold for monitoring genetic regulation and noise , 2010, Journal of biological engineering.

[32]  Andrew D Ellington,et al.  Synthetic DNA Synthesis and Assembly: Putting the Synthetic in Synthetic Biology. , 2017, Cold Spring Harbor perspectives in biology.

[33]  Nicholas C Tang,et al.  DNA synthesis, assembly and applications in synthetic biology. , 2012, Current opinion in chemical biology.

[34]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[35]  Transient protein-protein interactions perturb E.coli metabolome and cause gene dosage toxicity , 2016 .

[36]  Guillaume J. Filion,et al.  Starcode: sequence clustering based on all-pairs search , 2015, Bioinform..

[37]  E. Cox,et al.  Site-specific chromosomal integration of large synthetic constructs , 2010, Nucleic acids research.

[38]  Najeeb M. Halabi,et al.  Protein Sectors: Evolutionary Units of Three-Dimensional Structure , 2009, Cell.

[39]  G. Church,et al.  Large-scale de novo DNA synthesis: technologies and applications , 2014, Nature Methods.

[40]  Duhee Bang,et al.  ‘Shotgun DNA synthesis’ for the high-throughput construction of large DNA molecules , 2012, Nucleic acids research.

[41]  K. Sykes,et al.  High-quality gene assembly directly from unpurified mixtures of microarray-synthesized oligonucleotides , 2010, Nucleic acids research.

[42]  H. Ni,et al.  Discovery of Inhibitors of 4′-Phosphopantetheine Adenylyltransferase (PPAT) To Validate PPAT as a Target for Antibacterial Therapy , 2013, Antimicrobial Agents and Chemotherapy.

[43]  T. Terwilliger,et al.  Engineering and characterization of a superfolder green fluorescent protein , 2006, Nature Biotechnology.

[44]  A. Emili,et al.  Global Functional Atlas of Escherichia coli Encompassing Previously Uncharacterized Proteins , 2009, PLoS biology.

[45]  N. Ahituv,et al.  Decoding enhancers using massively parallel reporter assays. , 2015, Genomics.

[46]  Nicholas C Tang,et al.  Parallel on-chip gene synthesis and application to optimization of protein expression , 2011, Nature Biotechnology.

[47]  D. Baker,et al.  Global analysis of protein folding using massively parallel design, synthesis, and testing , 2017, Science.

[48]  A. Emili,et al.  Interaction network containing conserved and essential protein complexes in Escherichia coli , 2005, Nature.