Verification of genetic engineering in yeasts with nanopore whole genome sequencing

Yeast genomes can be assembled from sequencing data, but genome integrations and episomal plasmids often fail to be resolved with accuracy, completeness, and contiguity. Resolution of these features is critical for many synthetic biology applications, including strain quality control and identifying engineering in unknown samples. Here, we report an integrated workflow, named Prymetime, that uses sequencing reads from inexpensive NGS platforms, assembly and error correction software, and a list of synthetic biology parts to achieve accurate whole genome sequences of yeasts with engineering annotated. To build the workflow, we first determined which sequencing methods and software packages returned an accurate, complete, and contiguous genome of an engineered S. cerevisiae strain with two similar plasmids and an integrated pathway. We then developed a sequence feature annotation step that labels synthetic biology parts from a standard list of yeast engineering sequences or from a custom sequence list. We validated the workflow by sequencing a collection of 15 engineered yeasts built from different parent S. cerevisiae and nonconventional yeast strains. We show that each integrated pathway and episomal plasmid can be correctly assembled and annotated, even in strains that have part repeats and multiple similar plasmids. Interestingly, Prymetime was able to identify deletions and unintended integrations that were subsequently confirmed by other methods. Furthermore, the whole genomes are accurate, complete, and contiguous. To illustrate this clearly, we used a publicly available S. cerevisiae CEN.PK113 reference genome and the accompanying reads to show that a Prymetime genome assembly is equivalent to the reference using several standard metrics. Finally, we used Prymetime to resequence the nonconventional yeasts Y. lipolytica Po1f and K. phaffii CBS 7435, producing an improved genome assembly for each strain. Thus, our workflow can achieve accurate, complete, and contiguous whole genome sequences of yeast strains before and after engineering. Therefore, Prymetime enables NGS-based strain quality control through assembly and identification of engineering features.

[1]  Matthew Deaner,et al.  Recent advancements in fungal-derived fuel and chemical production and commercialization. , 2019, Current opinion in biotechnology.

[2]  E. Birney,et al.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs. , 2008, Genome research.

[3]  Guangbo Liu,et al.  High-throughput transformation of Saccharomyces cerevisiae using liquid handling robots , 2017, PloS one.

[4]  Jens Nielsen,et al.  De novo sequencing, assembly and analysis of the genome of the laboratory strain Saccharomyces cerevisiae CEN.PK113-7D, a model for modern industrial biotechnology , 2012, Microbial Cell Factories.

[5]  Sergey Koren,et al.  Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation , 2016, bioRxiv.

[6]  William C. Deloache,et al.  A Highly Characterized Yeast Toolkit for Modular, Multipart Assembly. , 2015, ACS synthetic biology.

[7]  F. de la Cruz,et al.  Plasmid Diversity and Adaptation Analyzed by Massive Sequencing of Escherichia coli Plasmids , 2014, Microbiology spectrum.

[8]  J. Keasling,et al.  Microbial engineering for the production of advanced biofuels , 2012, Nature.

[9]  Eric M. Young,et al.  Synthetic biology for bio-derived structural materials , 2019, Current Opinion in Chemical Engineering.

[10]  Hal S Alper,et al.  Synthetic Biology Expands the Industrial Potential of Yarrowia lipolytica. , 2018, Trends in biotechnology.

[11]  David Hernández,et al.  De novo bacterial genome sequencing: millions of very short reads assembled on a desktop computer. , 2008, Genome research.

[12]  Sergio Arredondo-Alonso,et al.  On the (im)possibility of reconstructing plasmids from whole-genome short-read sequencing data , 2017, Microbial genomics.

[13]  Jacob Beal,et al.  Technological challenges and milestones for writing genomes , 2019, Science.

[14]  Jef D. Boeke,et al.  Karyotype engineering by chromosome fusion leads to reproductive isolation in yeast , 2018, Nature.

[15]  Martin Dragosits,et al.  Adaptive laboratory evolution – principles and applications for biotechnology , 2013, Microbial Cell Factories.

[16]  Johannes A. Roubos,et al.  CRISPR/Cpf1 enables fast and simple genome editing of Saccharomyces cerevisiae , 2017, Yeast.

[17]  Rui Gan,et al.  A Pressure Test to Make 10 Molecules in 90 Days: External Evaluation of Methods to Engineer Biology. , 2018, Journal of the American Chemical Society.

[18]  Huimin Zhao,et al.  Automated multiplex genome-scale engineering in yeast , 2017, Nature Communications.

[19]  E. Boersma,et al.  Prevention of Catheter-Related Bacteremia with a Daily Ethanol Lock in Patients with Tunnelled Catheters: A Randomized, Placebo-Controlled Trial , 2010, PloS one.

[20]  Sergey Koren,et al.  Aggressive assembly of pyrosequencing reads with mates , 2008, Bioinform..

[21]  Claude Thermes,et al.  The Third Revolution in Sequencing Technology. , 2018, Trends in genetics : TIG.

[22]  N. Perna,et al.  progressiveMauve: Multiple Genome Alignment with Gene Gain, Loss and Rearrangement , 2010, PloS one.

[23]  Heng Li,et al.  Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences , 2015, Bioinform..

[24]  Alexander Goesmann,et al.  High-quality genome sequence of Pichia pastoris CBS7435. , 2011, Journal of biotechnology.

[25]  K. Thorn,et al.  Optimized cassettes for fluorescent protein tagging in Saccharomyces cerevisiae , 2004, Yeast.

[26]  J. Christopher Love,et al.  Comparative genomics and transcriptomics of Pichia pastoris , 2016, BMC Genomics.

[27]  R. Müller,et al.  Yeast vectors for the controlled expression of heterologous proteins in different genetic backgrounds. , 1995, Gene.

[28]  Brigitte Gasser,et al.  Curation of the genome annotation of Pichia pastoris (Komagataella phaffii) CBS7435 from gene level to protein function. , 2016, FEMS yeast research.

[29]  Jean-Marc Daran,et al.  FnCpf1: a novel and efficient genome editing tool for Saccharomyces cerevisiae , 2017, Nucleic acids research.

[30]  Matthias G. Steiger,et al.  Metabolic engineering of Pichia pastoris. , 2018, Metabolic engineering.

[31]  Niranjan Nagarajan,et al.  Fast and accurate de novo genome assembly from long uncorrected reads. , 2017, Genome research.

[32]  Jing Wang,et al.  High-Level Production of Beta-Carotene in Saccharomyces cerevisiae by Successive Transformation with Carotenogenic Genes from Xanthophyllomyces dendrorhous , 2007, Applied and Environmental Microbiology.

[33]  T. Ellis,et al.  Improved betulinic acid biosynthesis using synthetic yeast chromosome recombination and semi-automated rapid LC-MS screening , 2020, Nature Communications.

[34]  Jay D Keasling,et al.  Industrial brewing yeast engineered for the production of primary flavor determinants in hopped beer , 2018, Nature Communications.

[35]  Jules Beekwilder,et al.  Capturing of the monoterpene olefin limonene produced in Saccharomyces cerevisiae , 2014, Yeast.

[36]  J. Landolin,et al.  Assembling large genomes with single-molecule sequencing and locality-sensitive hashing , 2014, Nature Biotechnology.

[37]  Michael Roberts,et al.  The MaSuRCA genome assembler , 2013, Bioinform..

[38]  Merja Penttilä,et al.  Yeast oligo-mediated genome engineering (YOGE). , 2013, ACS synthetic biology.

[39]  Erchin Serpedin,et al.  Review of General Algorithmic Features for Genome Assemblers for Next Generation Sequencers , 2012, Genom. Proteom. Bioinform..

[40]  Vladimir Jiranek,et al.  Genome Sequence of Australian Indigenous Wine Yeast Torulaspora delbrueckii COFT1 Using Nanopore Sequencing , 2018, Genome Announcements.

[41]  Timothy K Lu,et al.  Synthetic biology and microbioreactor platforms for programmable production of biologics at the point-of-care , 2016, Nature Communications.

[42]  Dmitry Antipov,et al.  hybridSPAdes: an algorithm for hybrid assembly of short and long reads , 2016, Bioinform..

[43]  Chad A. Cowan,et al.  Low incidence of off-target mutations in individual CRISPR-Cas9 and TALEN targeted human stem cell clones detected by whole-genome sequencing. , 2014, Cell stem cell.

[44]  J. Hegemann,et al.  Delete and repeat: a comprehensive toolkit for sequential gene knockout in the budding yeast Saccharomyces cerevisiae. , 2011, Methods in molecular biology.

[45]  R. Schiestl,et al.  High-efficiency yeast transformation using the LiAc/SS carrier DNA/PEG method , 2007, Nature Protocols.

[46]  Christopher A. Voigt,et al.  Formation of Nitrogenase NifDK Tetramers in the Mitochondria of Saccharomyces cerevisiae , 2017, ACS synthetic biology.

[47]  E. Kerkhoven,et al.  Barriers and opportunities in bio-based production of hydrocarbons , 2018, Nature Energy.

[48]  Yu Lin,et al.  Assembly of long, error-prone reads using repeat graphs , 2018, Nature Biotechnology.

[49]  C. Smolke,et al.  Complete biosynthesis of opioids in yeast , 2015, Science.

[50]  Lei Zhang,et al.  Microbial synthesis of propane by engineering valine pathway and aldehyde-deformylating oxygenase , 2016, Biotechnology for Biofuels.

[51]  Irene M Ong,et al.  Genome Sequence and Analysis of a Stress-Tolerant, Wild-Derived Strain of Saccharomyces cerevisiae Used in Biofuels Research , 2016, G3: Genes, Genomes, Genetics.

[52]  Leszek P. Pryszcz,et al.  Draft Genome Sequences of the Highly Halotolerant Strain Zygosaccharomyces rouxii ATCC 42981 and the Novel Allodiploid Strain Zygosaccharomyces sapae ATB301T Obtained Using the MinION Platform , 2018, Microbiology Resource Announcements.

[53]  Andrea Porzel,et al.  Elucidation of the biosynthesis of carnosic acid and its reconstitution in yeast , 2016, Nature Communications.

[54]  P. Puccetti,et al.  Phagocytic killing of Candida albicans by different murine effector cells. , 1983, Sabouraudia.

[55]  S. Salzberg,et al.  Versatile and open software for comparing large genomes , 2004, Genome Biology.

[56]  Brigitte Gasser,et al.  A yeast for all seasons - Is Pichia pastoris a suitable chassis organism for future bioproduction? , 2018, FEMS microbiology letters.

[57]  Alison L. Van Eenennaam,et al.  Genomic and phenotypic analyses of six offspring of a genome-edited hornless bull , 2019, Nature Biotechnology.

[58]  Adam M. Feist,et al.  Laboratory evolution reveals regulatory and metabolic trade-offs of glycerol utilization in Saccharomyces cerevisiae. , 2018, Metabolic engineering.

[59]  Bernat Gel,et al.  karyoploteR: an R/Bioconductor package to plot customizable genomes displaying arbitrary data , 2017, bioRxiv.

[60]  J. Keasling,et al.  Semi-synthetic artemisinin: a model for the use of synthetic biology in pharmaceutical development , 2014, Nature Reviews Microbiology.

[61]  Francesco Vezzi,et al.  De novo assembly of Dekkera bruxellensis: a multi technology approach using short and long-read sequencing and optical mapping , 2015, GigaScience.

[62]  Zengyi Shao,et al.  DNA assembler, an in vivo genetic method for rapid construction of biochemical pathways , 2008, Nucleic acids research.

[63]  Mick Watson,et al.  Errors in long-read assemblies can critically affect protein prediction , 2019, Nature Biotechnology.

[64]  B A Blount,et al.  Rapid host strain improvement by in vivo rearrangement of a synthetic yeast chromosome , 2018, Nature Communications.

[65]  C. Madzak,et al.  Strong hybrid promoters and integrative expression/secretion vectors for quasi-constitutive expression of heterologous proteins in the yeast Yarrowia lipolytica. , 2000, Journal of molecular microbiology and biotechnology.

[66]  Kentaro K. Shimizu,et al.  Reference-guided de novo assembly approach improves genome reconstruction for related species , 2017, BMC Bioinformatics.

[67]  Christina A. Cuomo,et al.  Pilon: An Integrated Tool for Comprehensive Microbial Variant Detection and Genome Assembly Improvement , 2014, PloS one.

[68]  Jean Peccoud,et al.  Challenges and opportunities for strain verification by whole-genome sequencing , 2019, Scientific Reports.

[69]  Thomas Abeel,et al.  Nanopore sequencing enables near-complete de novo assembly of Saccharomyces cerevisiae reference strain CEN.PK113-7D , 2017, bioRxiv.

[70]  Jean-Marc Daran,et al.  The genome sequence of the popular hexose-transport-deficient Saccharomyces cerevisiae strain EBY.VW4000 reveals LoxP/Cre-induced translocations and gene loss. , 2015, FEMS yeast research.

[71]  Jay D Keasling,et al.  CrEdit: CRISPR mediated multi-loci gene integration in Saccharomyces cerevisiae , 2015, Microbial Cell Factories.

[72]  Nan Li,et al.  Comparison of the two major classes of assembly algorithms: overlap-layout-consensus and de-bruijn-graph. , 2012, Briefings in functional genomics.

[73]  R. Sikorski,et al.  A system of shuttle vectors and yeast host strains designed for efficient manipulation of DNA in Saccharomyces cerevisiae. , 1989, Genetics.

[74]  Tom Ellis,et al.  Biosynthesis of the antibiotic nonribosomal peptide penicillin in baker's yeast , 2016, Nature Communications.

[75]  J. Nielsen,et al.  Advances in metabolic engineering of yeast Saccharomyces cerevisiae for production of chemicals , 2014, Biotechnology journal.

[76]  Joseph P Noel,et al.  A Red Algal Bourbonane Sesquiterpene Synthase Defined by Microgram-Scale NMR-Coupled Crystalline Sponge X-ray Diffraction Analysis. , 2017, Journal of the American Chemical Society.

[77]  Brian P. Anton,et al.  Complete Genome Sequence of the Engineered Escherichia coli SHuffle Strains and Their Wild-Type Parents , 2016, Genome Announcements.

[78]  Hal S. Alper,et al.  Draft Genome Sequence of the Oleaginous Yeast Yarrowia lipolytica PO1f, a Commonly Used Metabolic Engineering Host , 2014, Genome Announcements.

[79]  Nasir Ali,et al.  Current Nucleic Acid Extraction Methods and Their Implications to Point-of-Care Diagnostics , 2017, BioMed research international.

[80]  Peter Jackson,et al.  Rewriting yeast central carbon metabolism for industrial isoprenoid production , 2016, Nature.

[81]  Wendell A. Lim,et al.  Improved Blue, Green, and Red Fluorescent Protein Tagging Vectors for S. cerevisiae , 2013, PloS one.

[82]  Ryan R. Wick,et al.  Unicycler: resolving bacterial genome assemblies from short and long sequencing reads , 2016, bioRxiv.

[83]  D. Stillman,et al.  Yeast vectors for integration at the HO locus. , 2001, Nucleic acids research.

[84]  J. Boeke,et al.  Designer deletion strains derived from Saccharomyces cerevisiae S288C: A useful set of strains and plasmids for PCR‐mediated gene disruption and other applications , 1998, Yeast.

[85]  Tadayuki Imanaka,et al.  Metabolic engineering of oleaginous yeast Yarrowia lipolytica for limonene overproduction , 2016, Biotechnology for Biofuels.

[86]  Jens Nielsen,et al.  Screening of 2A peptides for polycistronic gene expression in yeast , 2018, FEMS yeast research.

[87]  Rodney Rothstein,et al.  Elevated recombination rates in transcriptionally active DNA , 1989, Cell.

[88]  Evgeny M. Zdobnov,et al.  BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs , 2015, Bioinform..

[89]  Steven J. M. Jones,et al.  Abyss: a Parallel Assembler for Short Read Sequence Data Material Supplemental Open Access , 2022 .

[90]  I. Nookaew,et al.  Complete genomic and transcriptional landscape analysis using third-generation sequencing: a case study of Saccharomyces cerevisiae CEN.PK113-7D , 2018, Nucleic acids research.

[91]  Jie Zhang,et al.  Whole genome sequencing reveals rare off‐target mutations and considerable inherent genetic or/and somaclonal variations in CRISPR/Cas9‐edited cotton plants , 2018, Plant biotechnology journal.

[92]  Jacob Beal,et al.  Organizing genome engineering for the gigabase scale , 2020, Nature Communications.

[93]  T. Lu,et al.  Tunable and Multifunctional Eukaryotic Transcription Factors Based on CRISPR/Cas , 2013, ACS synthetic biology.

[94]  G. Stephanopoulos,et al.  Metabolic engineering in the host Yarrowia lipolytica. , 2018, Metabolic engineering.

[95]  E. Young,et al.  Genetic engineering of host organisms for pharmaceutical synthesis. , 2018, Current opinion in biotechnology.

[96]  Alexey A. Gurevich,et al.  QUAST: quality assessment tool for genome assemblies , 2013, Bioinform..

[97]  C. Thermes,et al.  Library preparation methods for next-generation sequencing: tone down the bias. , 2014, Experimental cell research.

[98]  Karl Friehs,et al.  Non-canonical integration events in Pichia pastoris encountered during standard transformation analysed with genome sequencing , 2016, Scientific Reports.

[99]  P. Suñé,et al.  Positive Outcomes Influence the Rate and Time to Publication, but Not the Impact Factor of Publications of Clinical Trial Results , 2013, PloS one.

[100]  M. Dante,et al.  Multifunctional yeast high-copy-number shuttle vectors. , 1992, Gene.

[101]  Gerrit Eggink,et al.  Production of protein-based polymers in Pichia pastoris , 2019, Biotechnology advances.

[102]  Patrick Wincker,et al.  High-Quality de Novo Genome Assembly of the Dekkera bruxellensis Yeast Using Nanopore MinION Sequencing , 2017, G3: Genes, Genomes, Genetics.

[103]  George M. Church,et al.  Genome engineering in Saccharomyces cerevisiae using CRISPR-Cas systems , 2013, Nucleic acids research.

[104]  M. Yandell,et al.  A beginner's guide to eukaryotic genome annotation , 2012, Nature Reviews Genetics.

[105]  Srikrishna Subramanian,et al.  Complete genome sequence and comparative genomics of the probiotic yeast Saccharomyces boulardii , 2017, Scientific Reports.

[106]  M. Baker 1,500 scientists lift the lid on reproducibility , 2016, Nature.

[107]  Pamela Silver,et al.  Faculty of 1000 evaluation for A pressure test to make 10 molecules in 90 days: external evaluation of methods to engineer biology. , 2018 .

[108]  Haley R Pipkins,et al.  Polyamine transporter potABCD is required for virulence of encapsulated but not nonencapsulated Streptococcus pneumoniae , 2017, PloS one.

[109]  Christopher A. Voigt,et al.  Iterative algorithm-guided design of massive strain libraries, applied to itaconic acid production in yeast. , 2018, Metabolic engineering.

[110]  Jing Li,et al.  De novo yeast genome assemblies from MinION, PacBio and MiSeq platforms , 2017, Scientific Reports.

[111]  Ian Wheeldon,et al.  Synthetic RNA Polymerase III Promoters Facilitate High-Efficiency CRISPR-Cas9-Mediated Genome Editing in Yarrowia lipolytica. , 2016, ACS synthetic biology.

[112]  Robert Mans,et al.  Under pressure: evolutionary engineering of yeast strains for improved performance in fuels and chemicals production. , 2018, Current opinion in biotechnology.