Unique Features of Nuclear mRNA Poly(A) Signals and Alternative Polyadenylation in Chlamydomonas reinhardtii

To understand nuclear mRNA polyadenylation mechanisms in the model alga Chlamydomonas reinhardtii, we generated a data set of 16,952 in silico-verified poly(A) sites from EST sequencing traces based on Chlamydomonas Genome Assembly v.3.1. Analysis of this data set revealed a unique and complex polyadenylation signal profile that is setting Chlamydomonas apart from other organisms. In contrast to the high-AU content in the 3′-UTRs of other organisms, Chlamydomonas shows a high-guanylate content that transits to high-cytidylate around the poly(A) site. The average length of the 3′-UTR is 595 nucleotides (nt), significantly longer than that of Arabidopsis and rice. The dominant poly(A) signal, UGUAA, was found in 52% of the near-upstream elements, and its occurrence may be positively correlated with higher gene expression levels. The UGUAA signal also exists in Arabidopsis and in some mammalian genes but mainly in the far-upstream elements, suggesting a shift in function. The C-rich region after poly(A) sites with unique signal elements is a characteristic downstream element that is lacking in higher plants. We also found a high level of alternative polyadenylation in the Chlamydomonas genome, with a range of up to 33% of the 4057 genes analyzed having at least two unique poly(A) sites and ∼1% of these genes having poly(A) sites residing in predicted coding sequences, introns, and 5′-UTRs. These potentially contribute to transcriptome diversity and gene expression regulation.

[1]  G. Gilmartin Eukaryotic mRNA 3' processing: a common means to different ends. , 2005, Genes & development.

[2]  Guoli Ji,et al.  Genome level analysis of rice mRNA 3′-end processing signals and alternative polyadenylation , 2008, Nucleic acids research.

[3]  Qingshun Quinn Li,et al.  Compilation of mRNA Polyadenylation Signals in Arabidopsis Revealed a New Signal Element and Potential Secondary Structures1[w] , 2005, Plant Physiology.

[4]  C. Moore,et al.  Analysis of RNA cleavage at the adenovirus‐2 L3 polyadenylation site. , 1986, The EMBO journal.

[5]  Peter Berthold,et al.  An engineered Streptomyces hygroscopicus aph 7" gene mediates dominant resistance against hygromycin B in Chlamydomonas reinhardtii. , 2002, Protist.

[6]  G. Glöckner,et al.  Gain and loss of polyadenylation signals during evolution of green algae , 2007, BMC Evolutionary Biology.

[7]  J. Wilusz,et al.  Cleavage site determinants in the mammalian polyadenylation signal. , 1995, Nucleic acids research.

[8]  B. Tian,et al.  Bioinformatic identification of candidate cis-regulatory elements involved in human mRNA polyadenylation. , 2005, RNA.

[9]  M. Peterson Regulated immunoglobulin (Ig) RNA processing does not require specific cis-acting sequences: non-Ig RNA can be alternatively processed in B cells and plasma cells , 1994, Molecular and cellular biology.

[10]  Thomas D. Wu,et al.  GMAP: a genomic mapping and alignment program for mRNA and EST sequence , 2005, Bioinform..

[11]  Shimyn Slomovic,et al.  RNA Polyadenylation in Prokaryotes and Organelles; Different Tails Tell Different Tales , 2006 .

[12]  C R Cantor,et al.  Genomic detection of new yeast pre-mRNA 3'-end-processing signals. , 1999, Nucleic acids research.

[13]  H. Lou,et al.  Alternative RNA processing--its role in regulating expression of calcitonin/calcitonin gene-related peptide. , 1998, The Journal of endocrinology.

[14]  Xiaohui Wu,et al.  Modeling Plant mRNA Poly(A) Sites: Software Design and Implementation , 2007 .

[15]  Q. Li,et al.  Calmodulin Interacts with and Regulates the RNA-Binding Activity of an Arabidopsis Polyadenylation Factor Subunit1[OA] , 2006, Plant Physiology.

[16]  M. Wickens,et al.  Point mutations in AAUAAA and the poly (A) addition site: effects on the accuracy and efficiency of cleavage and polyadenylation in vitro. , 1990, Nucleic acids research.

[17]  J. van Helden,et al.  Statistical analysis of yeast genomic downstream sequences reveals putative polyadenylation signals. , 2000, Nucleic acids research.

[18]  K. Venkataraman,et al.  Analysis of a noncanonical poly(A) site reveals a tripartite mechanism for vertebrate poly(A) site recognition. , 2005, Genes & development.

[19]  Jing Zhao,et al.  Formation of mRNA 3′ Ends in Eukaryotes: Mechanism, Regulation, and Interrelationships with Other Steps in mRNA Synthesis , 1999, Microbiology and Molecular Biology Reviews.

[20]  Nick Proudfoot,et al.  New perspectives on connecting messenger RNA 3' end formation to transcription. , 2004, Current opinion in cell biology.

[21]  Sihua Peng,et al.  An exploration of 3'-end processing signals and their tissue distribution in Oryza sativa. , 2007, Gene.

[22]  Chun Liang,et al.  Expressed Sequence Tags With cDNA Termini: Previously Overlooked Resources for Gene Annotation and Transcriptome Exploration in Chlamydomonas reinhardtii , 2008, Genetics.

[23]  S. Berget,et al.  Identification of exon sequences and an exon binding protein involved in alternative RNA splicing of calcitonin/CGRP. , 1992, Nucleic acids research.

[24]  Stevo K. Jaćimovski,et al.  Statistical and Dynamical Equivalence of Different Elementary Cells , 2007 .

[25]  Gang Wang,et al.  WebTraceMiner: a web service for processing and mining EST sequence trace files , 2007, Nucleic Acids Res..

[26]  Jacques van Helden,et al.  Regulatory Sequence Analysis Tools , 2003, Nucleic Acids Res..

[27]  Gang Wang,et al.  ConiferEST: an integrated bioinformatics system for data reprocessing and mining of conifer expressed sequence tags (ESTs) , 2007, BMC Genomics.

[28]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[29]  Bin Tian,et al.  A large-scale analysis of mRNA polyadenylation of human and mouse genes , 2005, Nucleic acids research.

[30]  V. Quesada,et al.  FY Is an RNA 3′ End-Processing Factor that Interacts with FCA to Control the Arabidopsis Floral Transition , 2003, Cell.

[31]  Martin Serrano,et al.  Nucleic Acids Research Advance Access published October 18, 2007 ChemBank: a small-molecule screening and , 2007 .

[32]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[33]  Q. Li,et al.  A near-upstream element in a plant polyadenylation signal consists of more than six nucleotides , 1995, Plant Molecular Biology.

[34]  A. Hunt Messenger RNA 3′-end Formation and the Regulation of Gene Expression , 2007 .

[35]  Xiaohui Wu,et al.  Predictive modeling of plant messenger RNA polyadenylation sites , 2007, BMC Bioinformatics.

[36]  Shivakundan Singh Tej,et al.  Analysis of the transcriptional complexity of Arabidopsis thaliana by massively parallel signature sequencing , 2004, Nature Biotechnology.

[37]  Haibo Zhang,et al.  Biased alternative polyadenylation in human tissues , 2005, Genome Biology.

[38]  R. Chisholm,et al.  The two alpha-tubulin genes of Chlamydomonas reinhardi code for slightly different proteins , 1985, Molecular and cellular biology.

[39]  Sara L. Zimmer,et al.  The Chlamydomonas Genome Reveals the Evolution of Key Animal and Plant Functions , 2007, Science.

[40]  Q. Li,et al.  The Polyadenylation of RNA in Plants , 1997, Plant physiology.

[41]  C. Bassett Regulation of gene expression in plants: the role of transcript structure and processing. , 2007 .