Genome-wide landscape of polyadenylation in Arabidopsis provides evidence for extensive alternative polyadenylation

Alternative polyadenylation (APA) has been shown to play an important role in gene expression regulation in animals and plants. However, the extent of sense and antisense APA at the genome level is not known. We developed a deep-sequencing protocol that queries the junctions of 3′UTR and poly(A) tails and confidently maps the poly(A) tags to the annotated genome. The results of this mapping show that 70% of Arabidopsis genes use more than one poly(A) site, excluding microheterogeneity. Analysis of the poly(A) tags reveal extensive APA in introns and coding sequences, results of which can significantly alter transcript sequences and their encoding proteins. Although the interplay of intron splicing and polyadenylation potentially defines poly(A) site uses in introns, the polyadenylation signals leading to the use of CDS protein-coding region poly(A) sites are distinct from the rest of the genome. Interestingly, a large number of poly(A) sites correspond to putative antisense transcripts that overlap with the promoter of the associated sense transcript, a mode previously demonstrated to regulate sense gene expression. Our results suggest that APA plays a far greater role in gene expression in plants than previously expected.

[1]  Q. Li,et al.  Alternative polyadenylation and gene expression regulation in plants , 2011, Wiley interdisciplinary reviews. RNA.

[2]  G. Micklem,et al.  Poly(A) Signals Located near the 5′ End of Genes Are Silenced by a General Mechanism That Prevents Premature 3′-End Processing , 2010, Molecular and Cellular Biology.

[3]  Sebastian D. Mackowiak,et al.  The Landscape of C. elegans 3′UTRs , 2010, Science.

[4]  M. Nalls,et al.  Evidence for natural antisense transcript-mediated inhibition of microRNA function , 2010, Genome Biology.

[5]  Casey R. Richardson,et al.  Analysis of Antisense Expression by Whole Genome Tiling Microarrays and siRNAs Suggests Mis-Annotation of Arabidopsis Orphan Protein-Coding Genes , 2010, PloS one.

[6]  G. Simpson,et al.  The spen family protein FPA controls alternative cleavage and polyadenylation of RNA. , 2010, Developmental cell.

[7]  C. Lister,et al.  Targeted 3′ Processing of Antisense Transcripts Triggers Arabidopsis FLC Chromatin Silencing , 2010, Science.

[8]  B. Tian,et al.  Reprogramming of 3′ Untranslated Regions of mRNAs by Alternative Polyadenylation in Generation of Pluripotent Stem Cells from Different Cell Types , 2009, PloS one.

[9]  C. Wahlestedt,et al.  Regulatory roles of natural antisense transcripts , 2009, Nature Reviews Molecular Cell Biology.

[10]  C. Mayr,et al.  Widespread Shortening of 3′UTRs by Alternative Cleavage and Polyadenylation Activates Oncogenes in Cancer Cells , 2009, Cell.

[11]  M. Settles,et al.  Large-scale analysis of antisense transcription in wheat using the Affymetrix GeneChip Wheat Genome Array , 2009, BMC Genomics.

[12]  B. Tian,et al.  Progressive lengthening of 3′ untranslated regions of mRNAs by alternative polyadenylation during mouse embryonic development , 2009, Proceedings of the National Academy of Sciences.

[13]  L. Steinmetz,et al.  Bidirectional promoters generate pervasive transcription in yeast , 2009, Nature.

[14]  Christophe Malabat,et al.  Widespread bidirectional promoters are the major source of cryptic transcripts in yeast , 2009, Nature.

[15]  Steven W. Flavell,et al.  Genome-Wide Analysis of MEF2 Transcriptional Program Reveals Synaptic Target Genes and Neuronal Activity-Dependent Polyadenylation Site Selection , 2008, Neuron.

[16]  Ramanjulu Sunkar,et al.  Genome-wide identification and analysis of small RNAs originated from natural antisense transcripts in Oryza sativa. , 2008, Genome research.

[17]  S. Baldauf,et al.  Evolution of nonstop, no-go and nonsense-mediated mRNA decay and their termination factor-derived components , 2008, BMC Evolutionary Biology.

[18]  Q. Li,et al.  A Polyadenylation Factor Subunit Implicated in Regulating Oxidative Signaling in Arabidopsis thaliana , 2008, PloS one.

[19]  Guoli Ji,et al.  Genome level analysis of rice mRNA 3′-end processing signals and alternative polyadenylation , 2008, Nucleic acids research.

[20]  Bin Liu,et al.  Genome-wide analysis for discovery of rice microRNAs reveals natural antisense microRNAs (nat-miRNAs) , 2008, Proceedings of the National Academy of Sciences.

[21]  A. Fernie Faculty Opinions recommendation of Genome-wide analysis of mRNA decay rates and their determinants in Arabidopsis thaliana. , 2008 .

[22]  Guennaelle Dieppois,et al.  Antisense RNA Stabilization Induces Transcriptional Gene Silencing via Histone Deacetylation in S. cerevisiae , 2007, Cell.

[23]  Tanya Z. Berardini,et al.  The Arabidopsis Information Resource (TAIR): gene structure and function annotation , 2007, Nucleic Acids Res..

[24]  L. Maquat,et al.  Quality control of eukaryotic mRNA: safeguarding cells from abnormal mRNA function. , 2007, Genes & development.

[25]  Stefan R. Henz,et al.  Distinct Expression Patterns of Natural Antisense Transcripts in Arabidopsis1[C][W] , 2007, Plant Physiology.

[26]  J. Pelletier,et al.  Translation of nonSTOP mRNA is repressed post‐initiation in mammalian cells , 2007, The EMBO journal.

[27]  A. Reddy Alternative splicing of pre-messenger RNAs in plants in the genomic era. , 2007, Annual review of plant biology.

[28]  T. Inada,et al.  Translation of the poly(A) tail plays crucial roles in nonstop mRNA surveillance via translation repression and protein destabilization by proteasome in yeast. , 2007, Genes & development.

[29]  Bin Tian,et al.  Widespread mRNA polyadenylation events in introns indicate dynamic interplay between polyadenylation and splicing. , 2007, Genome research.

[30]  Bin Tian,et al.  PolyA_DB 2: mRNA polyadenylation sites in vertebrate genes , 2007, Nucleic Acids Res..

[31]  Qingshun Quinn Li,et al.  Compilation of mRNA Polyadenylation Signals in Arabidopsis Revealed a New Signal Element and Potential Secondary Structures1[w] , 2005, Plant Physiology.

[32]  D. Westhead,et al.  Natural antisense transcripts with coding capacity in Arabidopsis may have a regulatory role that is not linked to double-stranded RNA degradation , 2005, Genome Biology.

[33]  G. Phillips,et al.  Identification of transcribed sequences in Arabidopsis thaliana by using high-resolution genome tiling arrays. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[34]  Bin Tian,et al.  A large-scale analysis of mRNA polyadenylation of human and mouse genes , 2005, Nucleic acids research.

[35]  Shivakundan Singh Tej,et al.  Analysis of the transcriptional complexity of Arabidopsis thaliana by massively parallel signature sequencing , 2004, Nature Biotechnology.

[36]  Shoshi Kikuchi,et al.  Antisense transcripts with rice full-length cDNAs , 2003, Genome Biology.

[37]  V. Quesada,et al.  FY Is an RNA 3′ End-Processing Factor that Interacts with FCA to Control the Arabidopsis Floral Transition , 2003, Cell.

[38]  Kathryn A. O’Donnell,et al.  An mRNA Surveillance Mechanism That Eliminates Transcripts Lacking Termination Codons , 2002, Science.

[39]  Roy Parker,et al.  Exosome-Mediated Recognition and Degradation of mRNAs Lacking a Termination Codon , 2002, Science.

[40]  C R Cantor,et al.  In silico detection of control signals: mRNA 3'-end-processing sequences in diverse species. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[41]  T. Hohn,et al.  Proximity to the promoter inhibits recognition of cauliflower mosaic virus polyadenylation signal , 1990, Nature.

[42]  J. Harms,et al.  THE LANDSCAPE OF , 2010 .

[43]  M. Schuler Splice site requirements and switches in plants. , 2008, Current topics in microbiology and immunology.

[44]  A. Hunt Messenger RNA 3' end formation in plants. , 2008, Current topics in microbiology and immunology.

[45]  A. Hunt Messenger RNA 3′-end Formation and the Regulation of Gene Expression , 2007 .

[46]  C. Bassett Regulation of gene expression in plants: the role of transcript structure and processing. , 2007 .

[47]  Nick James,et al.  NASCArrays: a repository for microarray data generated by NASC's transcriptomics service , 2004, Nucleic Acids Res..