Widespread splicing of repetitive element loci into coding regions of gene transcripts.

We performed a thorough characterization of expressed repetitive element loci (RE) in the human orbitofrontal cortex (OFC) using directional RNA sequencing data. Considering only sequencing reads that map uniquely onto the human genome, we discovered that the overwhelming majority of intronic and exonic RE are expressed in the same orientation as the gene in which they reside. Our mapping approach enabled the identification of novel differentially expressed RE transcripts between the OFC and peripheral blood lymphocytes. Further analysis revealed that RE are extensively spliced into coding regions of gene transcripts yielding thousands of novel mRNA variants with altered coding potential. Lower frequency splicing of RE into untranslated regions of gene transcripts was also observed. The same pattern of RE splicing in the brain was also detected for Drosophila, zebrafish, mouse, rat, dog and rabbit. RE splicing occurs largely at canonical GT-AG splice junctions with LINE and SINE elements forming the most RE splice junctions in the human OFC. This type of splicing usually gives rise to a minor splice variant of the endogenous gene and in silico analysis suggests that RE splicing has the potential to introduce novel open reading frames. Reanalysis of previously published sequencing data performed in the mouse cerebellum revealed that thousands of RE splice variants are associated with translating ribosomes. Our results demonstrate that RE expression is more complex than previously envisioned and raise the possibility that RE splicing might generate functional protein isoforms.

[1]  L. Maquat,et al.  A rule for termination-codon position within intron-containing genes: when nonsense affects RNA abundance. , 1998, Trends in biochemical sciences.

[2]  W. Fan,et al.  Inferring the expression variability of human transposable element-derived exons by linear model analysis of deep RNA sequencing data , 2013, BMC Genomics.

[3]  Robert Gentleman,et al.  Software for Computing and Annotating Genomic Ranges , 2013, PLoS Comput. Biol..

[4]  Svitlana Tyekucheva,et al.  Establishing the baseline level of repetitive element expression in the human cortex , 2011, BMC Genomics.

[5]  Sam Griffiths-Jones,et al.  miRBase: the microRNA sequence database. , 2006, Methods in molecular biology.

[6]  Cole Trapnell,et al.  TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions , 2013, Genome Biology.

[7]  R. Martienssen,et al.  Transposable elements and the epigenetic regulation of the genome , 2007, Nature Reviews Genetics.

[8]  K. Mirnics,et al.  Gene expression profiling of the brain: Pondering facts and fiction , 2012, Neurobiology of Disease.

[9]  Deepak Grover,et al.  dbRIP: A highly integrated database of retrotransposon insertion polymorphisms in humans , 2006, Human mutation.

[10]  J. Boeke,et al.  Active transposition in genomes. , 2012, Annual review of genetics.

[11]  Liang Yang,et al.  Computational promoter analysis of mouse, rat and human antimicrobial peptide-coding genes , 2006, BMC Bioinformatics.

[12]  Laurent Lestrade,et al.  snoRNA-LBME-db, a comprehensive database of human H/ACA and C/D box snoRNAs , 2005, Nucleic Acids Res..

[13]  Cole Trapnell,et al.  Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. , 2011, Genes & development.

[14]  K. Kasai,et al.  Increased L1 Retrotransposition in the Neuronal Genome in Schizophrenia , 2014, Neuron.

[15]  Skirmantas Kriaucionis,et al.  MeCP2 binds to 5hmC enriched within active genes and accessible chromatin in the nervous system. , 2012, Cell.

[16]  Ulrich Schall,et al.  Finding the needle in the haystack: A review of microarray gene expression research into schizophrenia , 2012, The Australian and New Zealand journal of psychiatry.

[17]  J. Brosius,et al.  Beyond DNA: RNA editing and steps toward Alu exonization in primates. , 2008, Journal of molecular biology.

[18]  F. Gage,et al.  LINE-1 retrotransposons: mediators of somatic variation in neuronal genomes? , 2010, Trends in Neurosciences.

[19]  T. Jensen,et al.  Nonsense-mediated mRNA decay: an intricate machinery that shapes transcriptomes , 2015, Nature Reviews Molecular Cell Biology.

[20]  Sam Griffiths-Jones,et al.  The microRNA Registry , 2004, Nucleic Acids Res..

[21]  D. Haussler,et al.  Species-specific endogenous retroviruses shape the transcriptional network of the human tumor suppressor protein p53 , 2007, Proceedings of the National Academy of Sciences.

[22]  Jianzhong Li,et al.  The impact of sample imbalance on identifying differentially expressed genes , 2006, BMC Bioinformatics.

[23]  R. Yolken,et al.  The Stanley Foundation brain collection and Neuropathology Consortium , 2000, Schizophrenia Research.

[24]  J. V. Moran,et al.  Hot L1s account for the bulk of retrotransposition in the human population , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[25]  D. C. Hancks,et al.  Active human retrotransposons: variation and disease. , 2012, Current opinion in genetics & development.

[26]  J. Mccoy,et al.  Syncytin is a captive retroviral envelope protein involved in human placental morphogenesis , 2000, Nature.

[27]  J. Kawai,et al.  The regulated retrotransposon transcriptome of mammalian cells , 2009, Nature Genetics.

[28]  Joseph K. Pickrell,et al.  Comment on “Widespread RNA and DNA Sequence Differences in the Human Transcriptome” , 2012, Science.

[29]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.