Genome-wide analysis of the associations between polyadenylation sites and repeated sequences in Arabidopsis thaliana

In eukaryotes, polyadenylation [poly(A)] is one of the critical processes during gene expression. It plays an important role in gene expression and gene regulation. Repeated sequences are widely distributed in eukaryotic genomes, which are important parts of the genome. There are two categories of repeated sequences in genome: interspersed repeats and tandem repeats (TRs). Interspersed repeats mainly come from transposable elements (TEs). Here we focus on the study of associations between poly(A) sites and two main categories of repeated sequences - TEs and TRs. We examined the poly(A) sites associated with repeated sequences on the whole genome of Arabidopsis thaliana. The results suggest that there are some potential associations between poly(A) sites and repeated sequences. Compared to the associations of poly(A) sites with TRs, TEs have closer links with poly(A) sites, especially for subfamilies of short interspersed elements (SINEs) and long interspersed elements (LINEs). Significant bias of association of poly(A) sites with TEs on different strand was found, while there was no bias found about TRs. The distributions of the association between poly(A) sites and TEs or TRs on different chromosomes share some similarities. This study will provide general insight into the associations between poly(A) sites and repeated sequences in Arabidopsis thaliana and other plants.

[1]  P. Civáň,et al.  On the Coevolution of Transposable Elements and Plant Genomes , 2011 .

[2]  Olivier Panaud,et al.  A universal classification of eukaryotic transposable elements implemented in Repbase , 2008, Nature Reviews Genetics.

[3]  P. Deininger,et al.  RNA truncation by premature polyadenylation attenuates human mobile element activity , 2003, Nature Genetics.

[4]  K. Verstrepen,et al.  Beyond Junk-Variable Tandem Repeats as Facilitators of Rapid Evolution of Regulatory and Coding Sequences , 2012, Genes.

[5]  D. Mager,et al.  Transposable elements: an abundant and natural source of regulatory sequences for host genes. , 2012, Annual review of genetics.

[6]  Yutaka Okumoto,et al.  A genome-wide view of miniature inverted-repeat transposable elements (MITEs) in rice, Oryza sativa ssp. japonica. , 2008, Genes & genetic systems.

[7]  Hongwei Zhao,et al.  Arabidopsis mRNA polyadenylation machinery: comprehensive analysis of protein-protein interactions and gene expression profiling , 2008, BMC Genomics.

[8]  Guoli Ji,et al.  Genome-Wide Control of Polyadenylation Site Choice by CPSF30 in Arabidopsis[C][W][OA] , 2012, Plant Cell.

[9]  Denghui Xing,et al.  Alternative polyadenylation and gene expression regulation in plants , 2011, Wiley interdisciplinary reviews. RNA.

[10]  D. Gautheret,et al.  Using Alu elements as polyadenylation sites: A case of retroposon exaptation. , 2009, Molecular biology and evolution.

[11]  E. Lerat Identifying repeats and transposable elements in sequenced genomes: how to find your way through the dense forest of programs , 2010, Heredity.

[12]  C. Liang,et al.  Genome-Wide Analysis of Tandem Repeats in Plants and Green Algae , 2013, G3: Genes, Genomes, Genetics.

[13]  Douglas R. Hoen,et al.  A Gene Family Derived from Transposable Elements during Early Angiosperm Evolution Has Reproductive Fitness Benefits in Arabidopsis thaliana , 2012, PLoS genetics.

[14]  J. Jurka,et al.  Repbase Update, a database of eukaryotic repetitive elements , 2005, Cytogenetic and Genome Research.

[15]  Nick Proudfoot,et al.  New perspectives on connecting messenger RNA 3' end formation to transcription. , 2004, Current opinion in cell biology.

[16]  K. Fujikawa,et al.  Amino acid sequence of human factor XI, a blood coagulation factor with four tandem repeats that are highly homologous with plasma prekallikrein. , 1986, Biochemistry.

[17]  P. Deininger,et al.  Human retroelements may introduce intragenic polyadenylation signals , 2005, Cytogenetic and Genome Research.

[18]  Christoph Mayer,et al.  Genome-wide analysis of tandem repeats in Daphnia pulex - a comparative approach , 2010, BMC Genomics.

[19]  H. Quesneville,et al.  Deep Investigation of Arabidopsis thaliana Junk DNA Reveals a Continuum between Repetitive Elements and Genomic Dark Matter , 2014, PloS one.

[20]  K. Murayama,et al.  Tandem repeat structure of rhamnose-binding lectin from catfish (Silurus asotus) eggs. , 1999, Biochimica et biophysica acta.

[21]  B. Tian,et al.  Phylogenetic analysis of mRNA polyadenylation sites reveals a role of transposable elements in evolution of the 3′-end of genes , 2008, Nucleic acids research.