Genome level analysis of rice mRNA 3′-end processing signals and alternative polyadenylation

The position of a poly(A) site of eukaryotic mRNA is determined by sequence signals in pre-mRNA and a group of polyadenylation factors. To reveal rice poly(A) signals at a genome level, we constructed a dataset of 55 742 authenticated poly(A) sites and characterized the poly(A) signals. This resulted in identifying the typical tripartite cis-elements, including FUE, NUE and CE, as previously observed in Arabidopsis. The average size of the 3′-UTR was 289 nucleotides. When mapped to the genome, however, 15% of these poly(A) sites were found to be located in the currently annotated intergenic regions. Moreover, an extensive alternative polyadenylation profile was evident where 50% of the genes analyzed had more than one unique poly(A) site (excluding microheterogeneity sites), and 13% had four or more poly(A) sites. About 4% of the analyzed genes possessed alternative poly(A) sites at their introns, 5′-UTRs, or protein coding regions. The authenticity of these alternative poly(A) sites was partially confirmed using MPSS data. Analysis of nucleotide profile and signal patterns indicated that there may be a different set of poly(A) signals for those poly(A) sites found in the coding regions. Based on the features of rice poly(A) signals, an updated algorithm termed PASS-Rice was designed to predict poly(A) sites.

[1]  Shivakundan Singh Tej,et al.  Analysis of the transcriptional complexity of Arabidopsis thaliana by massively parallel signature sequencing , 2004, Nature Biotechnology.

[2]  J. Graber,et al.  A multispecies comparison of the metazoan 3'-processing downstream elements and the CstF-64 RNA recognition motif , 2006, BMC Genomics.

[3]  C. Moore,et al.  Analysis of RNA cleavage at the adenovirus‐2 L3 polyadenylation site. , 1986, The EMBO journal.

[4]  Rithy K. Roth,et al.  Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays , 2000, Nature Biotechnology.

[5]  C R Cantor,et al.  Genomic detection of new yeast pre-mRNA 3'-end-processing signals. , 1999, Nucleic acids research.

[6]  Haibo Zhang,et al.  Biased alternative polyadenylation in human tissues , 2005, Genome Biology.

[7]  C R Cantor,et al.  In silico detection of control signals: mRNA 3'-end-processing sequences in diverse species. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[8]  B. Tian,et al.  Bioinformatic identification of candidate cis-regulatory elements involved in human mRNA polyadenylation. , 2005, RNA.

[9]  R. Macknight,et al.  Components of the Arabidopsis autonomous floral promotion pathway, FCA and FY, are conserved in monocots. , 2005, Functional plant biology : FPB.

[10]  J. van Helden,et al.  Statistical analysis of yeast genomic downstream sequences reveals putative polyadenylation signals. , 2000, Nucleic acids research.

[11]  Stephen M. Mount,et al.  Comprehensive analysis of alternative splicing in rice and comparative analyses with Arabidopsis , 2006, BMC Genomics.

[12]  Jing Zhao,et al.  Formation of mRNA 3′ Ends in Eukaryotes: Mechanism, Regulation, and Interrelationships with Other Steps in mRNA Synthesis , 1999, Microbiology and Molecular Biology Reviews.

[13]  Xiaohui Wu,et al.  Predictive modeling of plant messenger RNA polyadenylation sites , 2007, BMC Bioinformatics.

[14]  Robert M. Miura,et al.  Prediction of mRNA polyadenylation sites by support vector machine , 2006, Bioinform..

[15]  K. Venkataraman,et al.  Analysis of a noncanonical poly(A) site reveals a tripartite mechanism for vertebrate poly(A) site recognition. , 2005, Genes & development.

[16]  Ying Lu,et al.  Sequence analysis of mRNA polyadenylation signals of rice genes , 2006 .

[17]  Temple F. Smith,et al.  Probabilistic prediction of Saccharomyces cerevisiae mRNA 3'-processing sites. , 2002, Nucleic acids research.

[18]  Xiaohui Wu,et al.  Modeling Plant mRNA Poly(A) Sites: Software Design and Implementation , 2007 .

[19]  Qingshun Quinn Li,et al.  Compilation of mRNA Polyadenylation Signals in Arabidopsis Revealed a New Signal Element and Potential Secondary Structures1[w] , 2005, Plant Physiology.

[20]  D. Gautheret,et al.  The disparate nature of "intergenic" polyadenylation sites. , 2006, RNA.

[21]  Martin Serrano,et al.  Nucleic Acids Research Advance Access published October 18, 2007 ChemBank: a small-molecule screening and , 2007 .

[22]  D. Weigel,et al.  Conservation and Divergence of FCA Function between Arabidopsis and Rice , 2005, Plant Molecular Biology.

[23]  G. Gilmartin Eukaryotic mRNA 3' processing: a common means to different ends. , 2005, Genes & development.

[24]  Q. Li,et al.  Calmodulin Interacts with and Regulates the RNA-Binding Activity of an Arabidopsis Polyadenylation Factor Subunit1[OA] , 2006, Plant Physiology.

[25]  T. Hohn,et al.  The contribution of AAUAAA and the upstream element UUUGUA to the efficiency of mRNA 3′‐end formation in plants. , 1994, The EMBO journal.

[26]  M. Wickens,et al.  Point mutations in AAUAAA and the poly (A) addition site: effects on the accuracy and efficiency of cleavage and polyadenylation in vitro. , 1990, Nucleic acids research.

[27]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[28]  V. Quesada,et al.  Regulated RNA processing in the control of Arabidopsis flowering. , 2005, The International journal of developmental biology.

[29]  Heleń M. Rothnie,et al.  Plant mRNA 3′-end formation , 1996, Plant Molecular Biology.

[30]  B. Meyers,et al.  An expression atlas of rice mRNAs and small RNAs , 2007, Nature Biotechnology.

[31]  Hongwei Zhao,et al.  Arabidopsis PCFS4, a homologue of yeast polyadenylation factor Pcf11p, regulates FCA alternative processing and promotes flowering time. , 2008, The Plant journal : for cell and molecular biology.

[32]  J. Fütterer,et al.  Polyadenylation in Rice Tungro Bacilliform Virus:cis-Acting Signals and Regulation , 2001, Journal of Virology.

[33]  D. Bartel MicroRNAs Genomics, Biogenesis, Mechanism, and Function , 2004, Cell.

[34]  V. Quesada,et al.  FY Is an RNA 3′ End-Processing Factor that Interacts with FCA to Control the Arabidopsis Floral Transition , 2003, Cell.

[35]  Q. Li,et al.  A near-upstream element in a plant polyadenylation signal consists of more than six nucleotides , 1995, Plant Molecular Biology.

[36]  Xiaohong Zhu,et al.  The Bifunctional LKR/SDH Locus of Plants Also Encodes a Highly Active Monofunctional Lysine-Ketoglutarate Reductase Using a Polyadenylation Signal Located within an Intron1,212 , 2002, Plant Physiology.

[37]  D. Bartel,et al.  Computational identification of plant microRNAs and their targets, including a stress-induced miRNA. , 2004, Molecular cell.

[38]  T. Shenk,et al.  Functional analysis of point mutations in the AAUAAA motif of the SV40 late polyadenylation signal. , 1989, Nucleic acids research.

[39]  M. Peterson Mechanisms controlling production of membrane and secreted immunoglobulin during B cell development , 2007, Immunologic research.

[40]  Sihua Peng,et al.  An exploration of 3'-end processing signals and their tissue distribution in Oryza sativa. , 2007, Gene.

[41]  J. Wilusz,et al.  Cleavage site determinants in the mammalian polyadenylation signal. , 1995, Nucleic acids research.

[42]  D. Gautheret,et al.  Conservation of alternative polyadenylation patterns in mammalian genes , 2006, BMC Genomics.

[43]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[44]  C. Bassett Regulation of gene expression in plants: the role of transcript structure and processing. , 2007 .

[45]  M. Morgante,et al.  Global expression analysis of nucleotide binding site-leucine rich repeat-encoding and related genes in Arabidopsis , 2007, BMC Plant Biology.

[46]  Q. Li,et al.  The Polyadenylation of RNA in Plants , 1997, Plant physiology.

[47]  A. Kerstan,et al.  Alternative polyadenylation events contribute to the induction of NF-ATc in effector T cells. , 1999, Immunity.

[48]  A. Furger,et al.  Integrating mRNA Processing with Transcription , 2002, Cell.