Widespread polycistronic gene expression in green algae

Significance Historically, it has been understood that for gene expression in eukaryotes, each messenger RNA encodes a single protein. With the recent development of technologies to sequence full-length transcripts en masse, we have discovered hundreds of examples in two species of green algae where two, three, or more proteins are translated from a single transcript. These “polycistronic” transcripts are found in diverse species throughout the green algal lineage, which highlights their biological importance. We have leveraged these findings to coexpress pairs of genes on polycistronic transcripts in vitro, which should facilitate efforts to engineer algae for research and industrial applications. Polycistronic gene expression, common in prokaryotes, was thought to be extremely rare in eukaryotes. The development of long-read sequencing of full-length transcript isomers (Iso-Seq) has facilitated a reexamination of that dogma. Using Iso-Seq, we discovered hundreds of examples of polycistronic expression of nuclear genes in two divergent species of green algae: Chlamydomonas reinhardtii and Chromochloris zofingiensis. Here, we employ a range of independent approaches to validate that multiple proteins are translated from a common transcript for hundreds of loci. A chromatin immunoprecipitation analysis using trimethylation of lysine 4 on histone H3 marks confirmed that transcription begins exclusively at the upstream gene. Quantification of polyadenylated [poly(A)] tails and poly(A) signal sequences confirmed that transcription ends exclusively after the downstream gene. Coexpression analysis found nearly perfect correlation for open reading frames (ORFs) within polycistronic loci, consistent with expression in a shared transcript. For many polycistronic loci, terminal peptides from both ORFs were identified from proteomics datasets, consistent with independent translation. Synthetic polycistronic gene pairs were transcribed and translated in vitro to recapitulate the production of two distinct proteins from a common transcript. The relative abundance of these two proteins can be modified by altering the Kozak-like sequence of the upstream gene. Replacement of the ORFs with selectable markers or reporters allows production of such heterologous proteins, speaking to utility in synthetic biology approaches. Conservation of a significant number of polycistronic gene pairs between C. reinhardtii, C. zofingiensis, and five other species suggests that this mechanism may be evolutionarily ancient and biologically important in the green algal lineage.

[1]  Jie Zhou,et al.  Multi-strategic RNA-seq analysis reveals a high-resolution transcriptional landscape in cotton , 2019, Nature Communications.

[2]  S. Merchant,et al.  A Series of Fortunate Events: Introducing Chlamydomonas as a Reference Organism. , 2019, The Plant cell.

[3]  A. Knoll,et al.  Neoproterozoic origin and multiple transitions to macroscopic growth in green seaweeds , 2019, Proceedings of the National Academy of Sciences.

[4]  Crysten E. Blaby-Haas,et al.  Comparative and Functional Algal Genomics. , 2019, Annual review of plant biology.

[5]  K. Niyogi,et al.  Regulation of Oxygenic Photosynthesis during Trophic Transitions in the Green Alga Chromochloris zofingiensis[OPEN] , 2019, Plant Cell.

[6]  M. Hirschey,et al.  Sensing Mitochondrial Acetyl-CoA to Tune Respiration , 2019, Trends in Endocrinology & Metabolism.

[7]  Lu-Ning Liu,et al.  Engineering and Modulating Functional Cyanobacterial CO2-Fixing Organelles , 2018, Front. Plant Sci..

[8]  Vera Meyer,et al.  Polycistronic gene expression in Aspergillus niger , 2017, Microbial Cell Factories.

[9]  Katherine M. Tucker,et al.  Analysis of SDHAF3 in familial and sporadic pheochromocytoma and paraganglioma , 2017, BMC Cancer.

[10]  J. Pittman,et al.  Metal bioremediation by CrMTP4 over-expressing Chlamydomonas reinhardtii in comparison to natural wastewater-tolerant microalgae strains , 2017 .

[11]  G. Menschaert,et al.  eIF1 modulates the recognition of suboptimal translation initiation sites and steers gene expression via uORFs , 2017, Nucleic Acids Research.

[12]  K. Niyogi,et al.  Chromosome-level genome assembly and transcriptome of the green alga Chromochloris zofingiensis illuminates astaxanthin production , 2017, Proceedings of the National Academy of Sciences.

[13]  R. Heermann,et al.  TOM9.2 Is a Calmodulin-Binding Protein Critical for TOM Complex Assembly but Not for Mitochondrial Protein Import in Arabidopsis thaliana. , 2017, Molecular plant.

[14]  J. Pringle,et al.  Robust Transgene Expression from Bicistronic mRNA in the Green Alga Chlamydomonas reinhardtii , 2016, G3: Genes, Genomes, Genetics.

[15]  M. Guarnieri,et al.  Simultaneous production of triacylglycerol and high-value carotenoids by the astaxanthin-producing oleaginous green microalga Chlorella zofingiensis. , 2016, Bioresource technology.

[16]  F. Cross Tying Down Loose Ends in the Chlamydomonas Genome: Functional Significance of Abundant Upstream Open Reading Frames , 2015, G3: Genes, Genomes, Genetics.

[17]  A. Visel,et al.  Lineage-specific chromatin signatures reveal a regulator of lipid metabolism in microalgae , 2015, Nature Plants.

[18]  Xiandong Meng,et al.  Widespread Polycistronic Transcripts in Fungi Revealed by Single-Molecule mRNA Sequencing , 2015, PloS one.

[19]  Steven J. Marygold,et al.  Gene Model Annotations for Drosophila melanogaster: The Rule-Benders , 2015, G3: Genes, Genomes, Genetics.

[20]  H. Angerer Eukaryotic LYR Proteins Interact with Mitochondrial Protein Complexes , 2015, Biology.

[21]  Kriston L. McGary,et al.  Physical linkage of metabolic genes in fungi is an adaptation against the accumulation of toxic intermediate compounds , 2013, Proceedings of the National Academy of Sciences.

[22]  G. Giuliano,et al.  A Chlamydomonas-Derived Human Papillomavirus 16 E7 Vaccine Induces Specific Tumor Protection , 2013, PloS one.

[23]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration , 2012, Briefings Bioinform..

[24]  René H Wijffels,et al.  The impact of nitrogen starvation on the dynamics of triacylglycerol accumulation in nine microalgae strains. , 2012, Bioresource technology.

[25]  Richard J Jackson,et al.  Termination and post-termination events in eukaryotic translation. , 2012, Advances in protein chemistry and structural biology.

[26]  T. Blumenthal,et al.  Trans‐splicing , 2011, Wiley interdisciplinary reviews. RNA.

[27]  Chun Liang,et al.  Unique Features of Nuclear mRNA Poly(A) Signals and Alternative Polyadenylation in Chlamydomonas reinhardtii , 2008, Genetics.

[28]  M. Ryan,et al.  Occurrence, function and evolutionary origins of ‘2A-like’ sequences in virus genomes , 2008, The Journal of General Virology.

[29]  J. Heesemann,et al.  Improvement of reporter activity by IRES-mediated polycistronic reporter system , 2008, Nucleic acids research.

[30]  Sara L. Zimmer,et al.  The Chlamydomonas Genome Reveals the Evolution of Key Animal and Plant Functions , 2007, Science.

[31]  E. Gálová,et al.  Chlamydomonas reinhardtii: a convenient model system for the study of DNA repair in photoautotrophic eukaryotes , 2007, Current Genetics.

[32]  J. L. Petersen,et al.  REX1, a Novel Gene Required for DNA Repair* , 2003, Journal of Biological Chemistry.

[33]  D. Sachs,et al.  Expression of MHC Class II DQ α/β Heterodimers from Recombinant Polycistronic Retroviral Genomes , 2003, Surgery Today.

[34]  M. Kozak,et al.  Pushing the limits of the scanning mechanism for initiation of translation , 2002, Gene.

[35]  S. Merchant,et al.  Reciprocal Expression of Two Candidate Di-Iron Enzymes Affecting Photosystem I and Light-Harvesting Complex Accumulation , 2002, The Plant Cell Online.

[36]  T A Gray,et al.  An imprinted, mammalian bicistronic transcript encodes two independent proteins. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[37]  T. Fujita,et al.  Cloning of a polycistronic cDNA from tomato encoding γ-glutamyl kinase and γ-glutamyl phosphate reductase , 1997 .

[38]  D. Pauli,et al.  An unusual split Drosophila heat shock gene expressed during embryogenesis, pupation and in testis. , 1988, Journal of molecular biology.