Nuclear genetic codes with a different meaning of the UAG and the UAA codon

BackgroundDepartures from the standard genetic code in eukaryotic nuclear genomes are known for only a handful of lineages and only a few genetic code variants seem to exist outside the ciliates, the most creative group in this regard. Most frequent code modifications entail reassignment of the UAG and UAA codons, with evidence for at least 13 independent cases of a coordinated change in the meaning of both codons. However, no change affecting each of the two codons separately has been documented, suggesting the existence of underlying evolutionary or mechanistic constraints.ResultsHere, we present the discovery of two new variants of the nuclear genetic code, in which UAG is translated as an amino acid while UAA is kept as a termination codon (along with UGA). The first variant occurs in an organism noticed in a (meta)transcriptome from the heteropteran Lygus hesperus and demonstrated to be a novel insect-dwelling member of Rhizaria (specifically Sainouroidea). This first documented case of a rhizarian with a non-canonical genetic code employs UAG to encode leucine and represents an unprecedented change among nuclear codon reassignments. The second code variant was found in the recently described anaerobic flagellate Iotanema spirale (Metamonada: Fornicata). Analyses of transcriptomic data revealed that I. spirale uses UAG to encode glutamine, similarly to the most common variant of a non-canonical code known from several unrelated eukaryotic groups, including hexamitin diplomonads (also a lineage of fornicates). However, in these organisms, UAA also encodes glutamine, whereas it is the primary termination codon in I. spirale. Along with phylogenetic evidence for distant relationship of I. spirale and hexamitins, this indicates two independent genetic code changes in fornicates.ConclusionsOur study documents, for the first time, that evolutionary changes of the meaning of UAG and UAA codons in nuclear genomes can be decoupled and that the interpretation of the two codons by the cytoplasmic translation apparatus is mechanistically separable. The latter conclusion has interesting implications for possibilities of genetic code engineering in eukaryotes. We also present a newly developed generally applicable phylogeny-informed method for inferring the meaning of reassigned codons.

[1]  Y. Inagaki,et al.  Convergence and constraint in eukaryotic release factor 1 (eRF1) domain 1: the evolution of stop codon specificity. , 2002, Nucleic acids research.

[2]  C. G. Schrago,et al.  Expanded phylogenetic analyses of the class Heterotrichea (Ciliophora, Postciliodesmatophora) using five molecular markers and morphological data. , 2016, Molecular phylogenetics and evolution.

[3]  H. Gross,et al.  Identity elements of human tRNA(Leu): structural requirements for converting human tRNA(Ser) into a leucine acceptor in vitro. , 1995, Nucleic acids research.

[4]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[5]  Tereza Ševčíková,et al.  An Unprecedented Non-canonical Nuclear Genetic Code with All Three Termination Codons Reassigned as Sense Codons , 2016, Current Biology.

[6]  P. Keeling,et al.  Environmental PCR survey to determine the distribution of a non-canonical genetic code in uncultivable oxymonads. , 2007, Environmental microbiology.

[7]  Sarah R. Smith,et al.  The Marine Microbial Eukaryote Transcriptome Sequencing Project (MMETSP): Illuminating the Functional Diversity of Eukaryotic Life in the Oceans through Transcriptome Sequencing , 2014, PLoS biology.

[8]  R Giegé,et al.  Universal rules and idiosyncratic features in tRNA identity. , 1998, Nucleic acids research.

[9]  Alan Brown,et al.  Structural basis for stop codon recognition in eukaryotes , 2015, Nature.

[10]  P. Lewis,et al.  Gene Arrangement Convergence, Diverse Intron Content, and Genetic Code Modifications in Mitochondrial Genomes of Sphaeropleales (Chlorophyta) , 2014, Genome biology and evolution.

[11]  Godelieve Gheysen,et al.  A unique genetic code change in the mitochondrial genome of the parasitic nematode Radopholus similis , 2009, BMC Research Notes.

[12]  I. Čepička,et al.  Ultrastructure and Molecular Phylogeny of Iotanema spirale gen. nov. et sp. nov., a New Lineage of Endobiotic Fornicata with Strikingly Simplified Ultrastructure , 2017, The Journal of eukaryotic microbiology.

[13]  C. Slamovits,et al.  Evolutionary Origins of Rhizarian Parasites. , 2016, Molecular biology and evolution.

[14]  Koichi Ito,et al.  How protein reads the stop codon and terminates translation , 1998, Genes to cells : devoted to molecular & cellular mechanisms.

[15]  M. Tuite,et al.  The non‐standard genetic code of Candida spp.: an evolving genetic code or a novel mechanism for adaptation? , 1997, Molecular microbiology.

[16]  Eduardo Villalobo,et al.  A New Noncanonical Nuclear Genetic Code Translation of UAA into Glutamate , 2003, Current Biology.

[17]  Mikael Olsson Table , 2019, CSS3 Quick Syntax Reference.

[18]  Tomas Johansson,et al.  Key biosynthetic gene subfamily recruited for pheromone production prior to the extensive radiation of Lepidoptera , 2008, BMC Evolutionary Biology.

[19]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration , 2012, Briefings Bioinform..

[20]  Shōzō Ōsawa,et al.  Evolution of the genetic code , 1995 .

[21]  Alexandros Stamatakis,et al.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies , 2014, Bioinform..

[22]  M. Nowacki,et al.  Genetic Codes with No Dedicated Stop Codon: Context-Dependent Translation Termination , 2016, Cell.

[23]  J. Preer,et al.  Deviation from the universal code shown by the gene for surface protein 51A in Paramecium , 1985, Nature.

[24]  T. Ohama,et al.  UAG is a sense codon in several chlorophycean mitochondria , 1996, Current Genetics.

[25]  P. Keeling,et al.  Genomics: Evolution of the Genetic Code , 2016, Current Biology.

[26]  F. Zhao,et al.  Phylogenomics of non-model ciliates based on transcriptomic analyses , 2015, Protein & Cell.

[27]  B. Leander,et al.  Characterisation of a non-canonical genetic code in the oxymonad Streblomastix strix. , 2003, Journal of molecular biology.

[28]  Jie Xiong,et al.  Phylogenomic analyses reveal subclass Scuticociliatia as the sister group of subclass Hymenostomatia within class Oligohymenophorea. , 2015, Molecular phylogenetics and evolution.

[29]  D. Scott An Annotated Listing of Host Plants of Lygus hesperus Knight , 1977 .

[30]  Jan P. Meier-Kolthoff,et al.  Comparative genomics of biotechnologically important yeasts , 2016, Proceedings of the National Academy of Sciences.

[31]  Gaurav Vaidya,et al.  SequenceMatrix: concatenation software for the fast assembly of multi‐gene datasets with character set and codon information , 2011, Cladistics : the international journal of the Willi Hennig Society.

[32]  Dieter Söll,et al.  Genetic code flexibility in microorganisms: novel mechanisms and impact on physiology , 2015, Nature Reviews Microbiology.

[33]  Dapeng Xu,et al.  The All-Data-Based Evolutionary Hypothesis of Ciliated Protists with a Revised Classification of the Phylum Ciliophora (Eukaryota, Alveolata) , 2016, Scientific Reports.

[34]  Daniel Stubbs,et al.  PhyloBayes MPI: phylogenetic reconstruction with infinite mixtures of profiles in a parallel environment. , 2013, Systematic biology.

[35]  Edward Susko,et al.  An amino acid substitution-selection model adjusts residue fitness to improve phylogenetic estimation. , 2014, Molecular biology and evolution.

[36]  Tsutomu Suzuki,et al.  Convergent evolution of AUA decoding in bacteria and archaea , 2014, RNA biology.

[37]  W. Doolittle,et al.  A non‐canonical genetic code in an early diverging eukaryotic lineage. , 1996, The EMBO journal.

[38]  Manuel A. S. Santos,et al.  Non-Standard Genetic Codes Define New Concepts for Protein Engineering , 2015, Life.

[39]  A. Korostelev Structural aspects of translation termination on the ribosome. , 2011, RNA.

[40]  K. Katoh,et al.  MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability , 2013, Molecular biology and evolution.

[41]  I. Stansfield,et al.  Terminating eukaryote translation: domain 1 of release factor eRF1 functions in stop codon recognition. , 2000, RNA.

[42]  J. Fabrick,et al.  Sequencing and De Novo Assembly of the Western Tarnished Plant Bug (Lygus hesperus) Transcriptome , 2013, PloS one.

[43]  Sohta A. Ishikawa,et al.  A deviant genetic code in the green alga-derived plastid in the dinoflagellate Lepidodinium chlorophorum. , 2011, Molecular phylogenetics and evolution.

[44]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[45]  H. Beier,et al.  Three Tetrahymena tRNA(Gln) isoacceptors as tools for studying unorthodox codon recognition and codon context effects during protein synthesis in vitro. , 1994, Nucleic acids research.

[46]  Y. Inagaki,et al.  A wide diversity of previously undetected free-living relatives of diplomonads isolated from marine/saline habitats. , 2010, Environmental microbiology.

[47]  Björn Usadel,et al.  Trimmomatic: a flexible trimmer for Illumina sequence data , 2014, Bioinform..

[48]  De novo construction of an expanded transcriptome assembly for the western tarnished plant bug, Lygus hesperus , 2016, GigaScience.

[49]  A. Simpson,et al.  Creneis carolina gen. et sp. nov. (Heterolobosea), a novel marine anaerobic protist with strikingly derived morphology and life cycle. , 2014, Protist.

[50]  Paul D. Shaw,et al.  Using Tablet for visual exploration of second-generation sequencing data , 2013, Briefings Bioinform..

[51]  Martin Kollmar,et al.  A novel nuclear genetic code alteration in yeasts and the evolution of codon reassignment in eukaryotes , 2016, bioRxiv.

[52]  Y. Kuchino,et al.  Dramatic events in ciliate evolution: alteration of UAA and UAG termination codons to glutamine codons due to anticodon mutations in two Tetrahymena tRNAsGln , 1986, The EMBO journal.

[53]  Matthew W. Brown,et al.  Coprophilic amoebae and flagellates, including Guttulinopsis, Rosculus and Helkesimastix, characterise a divergent and diverse rhizarian radiation and contribute to a large diversity of faecal-associated protists. , 2016, Environmental microbiology.

[54]  F. Dini,et al.  Large-scale phylogenomic analysis reveals the phylogenetic position of the problematic taxon Protocruzia and unravels the deep phylogenetic affinities of the ciliate lineages. , 2014, Molecular phylogenetics and evolution.

[55]  B. Lang,et al.  Mitochondrial tRNAs in the lower fungus Spizellomyces punctatus: tRNA editing and UAG 'stop' codons recognized as leucine. , 1997, Nucleic acids research.

[56]  Andrew J. Roger,et al.  A Eukaryote without a Mitochondrial Organelle , 2016, Current Biology.

[57]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[58]  Romain Derelle,et al.  Bacterial proteins pinpoint a single eukaryotic root , 2015, Proceedings of the National Academy of Sciences.

[59]  Mark A. Miller,et al.  Creating the CIPRES Science Gateway for inference of large phylogenetic trees , 2010, 2010 Gateway Computing Environments Workshop (GCE).

[60]  P. Keeling,et al.  Untangling the early diversification of eukaryotes: a phylogenomic study of the evolutionary origins of Centrohelida, Haptophyta and Cryptista , 2016, Proceedings of the Royal Society B: Biological Sciences.

[61]  Y. Inagaki,et al.  Multigene phylogenies of diverse Carpediemonas-like organisms identify the closest relatives of 'amitochondriate' diplomonads and retortamonads. , 2012, Protist.

[62]  S. Karpov,et al.  Obligately phagotrophic aphelids turned out to branch with the earliest-diverging fungi. , 2013, Protist.

[63]  S. Martinis,et al.  tRNA synthetase: tRNA aminoacylation and beyond , 2014, Wiley interdisciplinary reviews. RNA.

[64]  Laura F. Landweber,et al.  Rewiring the keyboard: evolvability of the genetic code , 2001, Nature Reviews Genetics.

[65]  A. Simpson,et al.  Molecular phylogeny of diplomonads and enteromonads based on SSU rRNA, alpha-tubulin and HSP90 genes: Implications for the evolutionary history of the double karyomastigont of diplomonads , 2008, BMC Evolutionary Biology.

[66]  Laura F. Landweber,et al.  The molecular basis of nuclear genetic code change in ciliates , 2001, Current Biology.

[67]  N. Friedman,et al.  Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data , 2011, Nature Biotechnology.

[68]  Marco Mariotti,et al.  Novel Ciliate Genetic Code Variants Including the Reassignment of All Three Stop Codons to Sense Codons in Condylostoma magnum , 2016, Molecular biology and evolution.

[69]  P. Keeling,et al.  Complex phylogenetic distribution of a non-canonical genetic code in green algae , 2010, BMC Evolutionary Biology.

[70]  D. Bedwell,et al.  Identification of eRF1 residues that play critical and complementary roles in stop codon recognition. , 2012, RNA.

[71]  B. K. Davis Evolution of the genetic code. , 1999, Progress in biophysics and molecular biology.

[72]  O. Namy,et al.  New insights into stop codon recognition by eRF1 , 2015, Nucleic acids research.

[73]  A. von Haeseler,et al.  IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies , 2014, Molecular biology and evolution.

[74]  M. Gorovsky,et al.  An unusual genetic code in nuclear genes of Tetrahymena. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[75]  François-Joseph Lapointe,et al.  Clanistics: a multi-level perspective for harvesting unrooted gene trees. , 2010, Trends in microbiology.

[76]  Matthew W. Brown,et al.  Aggregative Multicellularity Evolved Independently in the Eukaryotic Supergroup Rhizaria , 2012, Current Biology.

[77]  Alexander C. J. Roth,et al.  Measuring codon usage bias , 2012 .