Comparative genomics of biotechnologically important yeasts

Significance The highly diverse Ascomycete yeasts have enormous biotechnological potential. Collectively, these yeasts convert a broad range of substrates into useful compounds, such as ethanol, lipids, and vitamins, and can grow in extremes of temperature, salinity, and pH. We compared 29 yeast genomes with the goal of correlating genetics to useful traits. In one rare species, we discovered a genetic code that translates CUG codons to alanine rather than canonical leucine. Genome comparison enabled correlation of genes to useful metabolic properties and showed the synteny of the mating-type locus to be conserved over a billion years of evolution. Our study provides a roadmap for future biotechnological exploitations. Ascomycete yeasts are metabolically diverse, with great potential for biotechnology. Here, we report the comparative genome analysis of 29 taxonomically and biotechnologically important yeasts, including 16 newly sequenced. We identify a genetic code change, CUG-Ala, in Pachysolen tannophilus in the clade sister to the known CUG-Ser clade. Our well-resolved yeast phylogeny shows that some traits, such as methylotrophy, are restricted to single clades, whereas others, such as l-rhamnose utilization, have patchy phylogenetic distributions. Gene clusters, with variable organization and distribution, encode many pathways of interest. Genomics can predict some biochemical traits precisely, but the genomic basis of others, such as xylose utilization, remains unresolved. Our data also provide insight into early evolution of ascomycetes. We document the loss of H3K9me2/3 heterochromatin, the origin of ascomycete mating-type switching, and panascomycete synteny at the MAT locus. These data and analyses will facilitate the engineering of efficient biosynthetic and degradative pathways and gateways for genomic manipulation.

[1]  Martin Kollmar,et al.  A novel nuclear genetic code alteration in yeasts and the evolution of codon reassignment in eukaryotes , 2016, bioRxiv.

[2]  Peter Ruhdal Conversion of the biodiesel by-product glycerol by the non-conventional yeast Pachysolen tannophilus , 2016 .

[3]  C. Trinh,et al.  Activating and Elucidating Metabolism of Complex Sugars in Yarrowia lipolytica , 2015, Applied and Environmental Microbiology.

[4]  T. Jeffries,et al.  Genomics and the making of yeast biodiversity. , 2015, Current opinion in genetics & development.

[5]  Seán S. ÓhÉigeartaigh,et al.  Clade- and species-specific features of genome evolution in the Saccharomycetaceae , 2015, FEMS yeast research.

[6]  C. T. Hittinger,et al.  Temperature and host preferences drive the diversification of Saccharomyces and other yeasts: a survey and the discovery of eight new yeast species. , 2015, FEMS yeast research.

[7]  N. Kyrpides,et al.  Complete genome sequence of Planctomyces brasiliensis type strain (DSM 5305T), phylogenomic analysis and reclassification of Planctomycetes including the descriptions of Gimesia gen. nov., Planctopirus gen. nov. and Rubinisphaera gen. nov. and emended descriptions of the order Planctomycetales and t , 2014, Standards in genomic sciences.

[8]  Antonis Rokas,et al.  The Evolution of Fungal Metabolic Pathways , 2014, PLoS genetics.

[9]  Y. Kaneko,et al.  Inversion of the Chromosomal Region between Two Mating Type Loci Switches the Mating Type in Hansenula polymorpha , 2014, PLoS genetics.

[10]  Kevin P. Byrne,et al.  Mating-type switching by chromosomal inversion in methylotrophic yeasts suggests an origin for the three-locus Saccharomyces cerevisiae system , 2014, Proceedings of the National Academy of Sciences.

[11]  Martin Kollmar,et al.  Molecular Phylogeny of Sequenced Saccharomycetes Reveals Polyphyly of the Alternative Yeast Codon Usage , 2014, Genome biology and evolution.

[12]  D. Hibbett,et al.  Latent homology and convergent regulatory evolution underlies the repeated emergence of yeasts , 2014, Nature Communications.

[13]  Inna Dubchak,et al.  MycoCosm portal: gearing up for 1000 fungal genomes , 2013, Nucleic Acids Res..

[14]  C. Kurtzman,et al.  Relationships among genera of the Saccharomycotina (Ascomycota) from multigene phylogenetic analysis of type species. , 2013, FEMS yeast research.

[15]  R. Gibbs,et al.  Mind the Gap: Upgrading Genomes with Pacific Biosciences RS Long-Read Sequencing Technology , 2012, PloS one.

[16]  M. Blackwell,et al.  Multilocus Phylogenetic Study of the Scheffersomyces Yeast Clade and Characterization of the N-Terminal Region of Xylose Reductase Gene , 2012, PloS one.

[17]  Mikko Arvas,et al.  Characterisation of the gene cluster for l-rhamnose catabolism in the yeast Scheffersomyces (Pichia) stipitis. , 2012, Gene.

[18]  Y. Ju,et al.  Lipid production from Yarrowia lipolytica Po1g grown in sugarcane bagasse hydrolysate. , 2011, Bioresource technology.

[19]  H. Klenk,et al.  Codivergence of Mycoviruses with Their Hosts , 2011, PloS one.

[20]  Alla Lapidus,et al.  Comparative genomics of xylose-fermenting fungi for enhanced biofuel production , 2011, Proceedings of the National Academy of Sciences.

[21]  L. Rusche,et al.  Reinventing Heterochromatin in Budding Yeasts: Sir2 and the Origin Recognition Complex Take Center Stage , 2011, Eukaryotic Cell.

[22]  Ioannis Xenarios,et al.  T-Coffee: a web server for the multiple sequence alignment of protein and RNA sequences using structural information and homology extension , 2011, Nucleic Acids Res..

[23]  Gabriela R. Moura,et al.  The genetic code of the fungal CTG clade. , 2011, Comptes rendus biologies.

[24]  Hideki Tohda,et al.  New insights into galactose metabolism by Schizosaccharomyces pombe: isolation and characterization of a galactose-assimilating mutant. , 2011, Journal of bioscience and bioengineering.

[25]  A. Gnirke,et al.  High-quality draft assemblies of mammalian genomes from massively parallel sequence data , 2010, Proceedings of the National Academy of Sciences.

[26]  Lynne A. Goodwin,et al.  The Genome Sequence of Methanohalophilus mahii SLPT Reveals Differences in the Energy Metabolism among Members of the Methanosarcinaceae Inhabiting Freshwater and Saline Environments , 2010, Archaea.

[27]  B. Dujon Yeast evolutionary genomics , 2010, Nature Reviews Genetics.

[28]  J. Degnan,et al.  Fast and consistent estimation of species trees using supermatrix rooted triples. , 2010, Molecular biology and evolution.

[29]  Alla Lapidus,et al.  Gap Resolution: A Software Package for Improving Newbler Genome Assemblies , 2009 .

[30]  E. Birney,et al.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs. , 2008, Genome research.

[31]  F. Dietrich,et al.  The Reacquisition of Biotin Prototrophy in Saccharomyces cerevisiae Involved Horizontal Gene Transfer, Gene Duplication and Gene Clustering , 2007, Genetics.

[32]  Yong-Su Jin,et al.  Genome sequence of the lignocellulose-bioconverting and xylose-fermenting yeast Pichia stipitis , 2007, Nature Biotechnology.

[33]  J. Gatesy,et al.  The supermatrix approach to systematics. , 2007, Trends in ecology & evolution.

[34]  R. Thornton,et al.  Transformation of a glucose negative mutant ofPachysolen tannophilus with a plasmid carrying the cloned hexokinase PII gene fromSaccharomyces cerevisiae , 1989, Biotechnology Letters.

[35]  Antonis Rokas,et al.  Parallel inactivation of multiple GAL pathway genes and ecological diversification in yeasts. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Merja Penttilä,et al.  Endogenous Xylose Pathway in Saccharomyces cerevisiae , 2004, Applied and Environmental Microbiology.

[37]  J. Piškur,et al.  Horizontal gene transfer promoted evolution of the ability to propagate under anaerobic conditions in yeasts , 2004, Molecular Genetics and Genomics.

[38]  S Blair Hedges,et al.  BMC Evolutionary Biology BioMed Central , 2003 .

[39]  C. Claudel-Renard,et al.  Enzyme-specific profiles for genome annotation: PRIAM. , 2003, Nucleic acids research.

[40]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[41]  M. Tuite,et al.  The non‐standard genetic code of Candida spp.: an evolving genetic code or a novel mechanism for adaptation? , 1997, Molecular microbiology.

[42]  S. Eddy,et al.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. , 1997, Nucleic acids research.

[43]  Richard Chamberlin,et al.  Ribosome-mediated incorporation of a non-standard amino acid into a peptide through expansion of the genetic code , 1992, Nature.

[44]  G. Wegner Emerging applications of the methylotrophic yeasts. , 1990, FEMS microbiology reviews.

[45]  Hiroshi Honda,et al.  The codon CUG is read as serine in an asporogenic yeast Candida cylindracea , 1989, Nature.

[46]  M. Johnston A model fungal gene regulatory mechanism: the GAL genes of Saccharomyces cerevisiae. , 1987, Microbiological reviews.

[47]  Teun Boekhout,et al.  The yeasts : a taxonomic study , 1972 .