A genome-wide map of hyper-edited RNA reveals numerous new sites

Adenosine-to-inosine editing is one of the most frequent post-transcriptional modifications, manifested as A-to-G mismatches when comparing RNA sequences with their source DNA. Recently, a number of RNA-seq data sets have been screened for the presence of A-to-G editing, and hundreds of thousands of editing sites identified. Here we show that existing screens missed the majority of sites by ignoring reads with excessive (‘hyper’) editing that do not easily align to the genome. We show that careful alignment and examination of the unmapped reads in RNA-seq studies reveal numerous new sites, usually many more than originally discovered, and in precisely those regions that are most heavily edited. Specifically, we discover 327,096 new editing sites in the heavily studied Illumina Human BodyMap data and more than double the number of detected sites in several published screens. We also identify thousands of new sites in mouse, rat, opossum and fly. Our results establish that hyper-editing events account for the majority of editing sites.

[1]  G. Church,et al.  Evidence for large diversity in the human transcriptome created by Alu RNA editing , 2009, Nucleic acids research.

[2]  H. Hakonarson,et al.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.

[3]  Ernesto Picardi,et al.  REDItools: high-throughput RNA editing detection made easy , 2013, Bioinform..

[4]  A. Scadden,et al.  Tudor-SN and ADAR1 are components of cytoplasmic stress granules. , 2012, RNA.

[5]  P. Anderson,et al.  RNA granules: post-transcriptional and epigenetic modulators of gene expression , 2009, Nature Reviews Molecular Cell Biology.

[6]  T. Matise,et al.  Widespread RNA editing of embedded alu elements in the human transcriptome. , 2004, Genome research.

[7]  G. Carmichael,et al.  Altered nuclear retention of mRNAs containing inverted repeats in human embryonic stem cells: functional role of a nuclear noncoding RNA. , 2009, Molecular cell.

[8]  Jin Billy Li,et al.  RADAR: a rigorously annotated database of A-to-I RNA editing , 2013, Nucleic Acids Res..

[9]  G. Church,et al.  Genome-Wide Identification of Human RNA Editing Sites by Parallel DNA Capturing and Sequencing , 2009, Science.

[10]  S Rozen,et al.  Primer3 on the WWW for general users and for biologist programmers. , 2000, Methods in molecular biology.

[11]  George M Church,et al.  Deciphering the functions and regulation of brain-enriched A-to-I RNA editing , 2013, Nature Neuroscience.

[12]  B. Bass,et al.  Inosine exists in mRNA at tissue‐specific levels and is most abundant in brain mRNA , 1998, The EMBO journal.

[13]  Zipora Y. Fligelman,et al.  Systematic identification of abundant A-to-I editing sites in the human transcriptome , 2004, Nature Biotechnology.

[14]  A. Scadden The RISC subunit Tudor-SN binds to hyper-edited double-stranded RNA and promotes its cleavage , 2005, Nature Structural &Molecular Biology.

[15]  Ayelet T. Lamm,et al.  Competition between ADAR and RNAi pathways for an extensive class of RNA targets , 2011, Nature Structural &Molecular Biology.

[16]  K. Nishikura Functions and regulation of RNA editing by ADAR deaminases. , 2010, Annual review of biochemistry.

[17]  Thomas L. Madden,et al.  BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences. , 1999, FEMS microbiology letters.

[18]  Jin Billy Li,et al.  Edinburgh Research Explorer Identifying Rna Editing Sites Using Rna Sequencing Data Alone , 2022 .

[19]  B. Williams,et al.  RNA editing in the human ENCODE RNA-seq data , 2012, Genome research.

[20]  G. Church,et al.  Large-scale DNA editing of retrotransposons accelerates mammalian genome evolution. , 2011, Nature communications.

[21]  B. Bass,et al.  C. elegans and H. sapiens mRNAs with edited 3' UTRs are present on polysomes. , 2008, RNA.

[22]  B. Bass,et al.  RNA hairpins in noncoding regions of human brain and Caenorhabditis elegans mRNA are edited by adenosine deaminases that act on RNA , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Ana Kozomara,et al.  miRBase: annotating high confidence microRNAs using deep sequencing data , 2013, Nucleic Acids Res..

[24]  Pavel V. Baranov,et al.  DARNED: a DAtabase of RNa EDiting in humans , 2010, Bioinform..

[25]  Brenda L. Bass,et al.  Predicting sites of ADAR editing in double-stranded RNA , 2011, Nature communications.

[26]  Alexander Rich,et al.  Widespread A-to-I RNA Editing of Alu-Containing mRNAs in the Human Transcriptome , 2004, PLoS biology.

[27]  A. Scadden Inosine-Containing dsRNA Binds a Stress-Granule-like Complex and Downregulates Gene Expression In trans , 2007, Molecular cell.

[28]  G. Silberberg,et al.  Alu elements shape the primate transcriptome by cis-regulation of RNA editing , 2014, Genome Biology.

[29]  Erez Y. Levanon,et al.  Widespread occurrence of antisense transcription in the human genome , 2003, Nature Biotechnology.

[30]  Wenwei Zhang,et al.  Comprehensive analysis of RNA-Seq data reveals extensive RNA editing in a human transcriptome , 2012, Nature Biotechnology.

[31]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[32]  C. Burge,et al.  Evolutionary Dynamics of Gene and Isoform Regulation in Mammalian Tissues , 2012, Science.

[33]  Pavel V. Baranov,et al.  Darned in 2013: inclusion of model organisms and linking with Wikipedia , 2012, Nucleic Acids Res..

[34]  C. Burge,et al.  Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets , 2005, Cell.

[35]  Brenda L Bass,et al.  RNA editing by adenosine deaminases that act on RNA. , 2002, Annual review of biochemistry.

[36]  Jin Billy Li,et al.  Comment on “Widespread RNA and DNA Sequence Differences in the Human Transcriptome” , 2012, Science.

[37]  Yishay Pinto,et al.  Mammalian conserved ADAR targets comprise only a small fragment of the human editosome , 2014, Genome Biology.

[38]  S. Bergmann,et al.  The evolution of gene expression levels in mammalian organs , 2011, Nature.

[39]  Yukio Kawahara,et al.  A-to-I RNA Editing and Human Disease , 2006, RNA biology.

[40]  Erez Y. Levanon,et al.  Identification of Widespread Ultra-Edited Human RNAs , 2011, PLoS genetics.

[41]  Eli Eisenberg,et al.  RNA editing level in the mouse is determined by the genomic repeat repertoire. , 2006, RNA.

[42]  Richard Durbin,et al.  High levels of RNA-editing site conservation amongst 15 laboratory mouse strains , 2012, Genome Biology.

[43]  Colin N. Dewey,et al.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome , 2011, BMC Bioinformatics.

[44]  R. Sorek,et al.  Is abundant A-to-I RNA editing primate-specific? , 2004, Trends in genetics : TIG.

[45]  Michael Q. Zhang,et al.  Regulating Gene Expression through RNA Nuclear Retention , 2005, Cell.

[46]  Philipp Kapranov,et al.  Genome-wide analysis of A-to-I RNA editing by single-molecule sequencing in Drosophila , 2013, Nature Structural &Molecular Biology.

[47]  Jin Billy Li,et al.  Accurate identification of human Alu and non-Alu RNA editing sites , 2012, Nature Methods.

[48]  Yiannis A. Savva,et al.  The ADAR protein family , 2012, Genome Biology.

[49]  Richard Wooster,et al.  A survey of RNA editing in human brain. , 2004, Genome research.

[50]  Aaron R. Quinlan,et al.  Bioinformatics Applications Note Genome Analysis Bedtools: a Flexible Suite of Utilities for Comparing Genomic Features , 2022 .

[51]  Kazuko Nishikura,et al.  Adenosine-to-inosine RNA editing and human disease , 2013, Genome Medicine.

[52]  Michael M. Mwangi,et al.  Transcriptome-wide sequencing reveals numerous APOBEC1 mRNA-editing targets in transcript 3′ UTRs , 2012, Nature Structural &Molecular Biology.

[53]  Eli Eisenberg,et al.  A-to-I RNA editing occurs at over a hundred million genomic sites, located in a majority of human genes , 2014, Genome research.

[54]  M. Rosbash,et al.  Nascent-seq indicates widespread cotranscriptional RNA editing in Drosophila. , 2012, Molecular cell.

[55]  C. Smith,et al.  Specific cleavage of hyper‐edited dsRNAs , 2001, The EMBO journal.

[56]  E. Levanon,et al.  Identification of RNA editing sites in the SNP database , 2005, Nucleic acids research.

[57]  David Haussler,et al.  The UCSC genome browser database: update 2007 , 2006, Nucleic Acids Res..

[58]  N. A. Temiz,et al.  APOBEC3B is an enzymatic source of mutation in breast cancer , 2013, Nature.

[59]  Jae-Hyung Lee,et al.  Accurate identification of A-to-I RNA editing in human by transcriptome sequencing. , 2012, Genome research.

[60]  S. Batalov,et al.  Antisense Transcription in the Mammalian Transcriptome , 2005, Science.

[61]  Michael Zuker,et al.  Mfold web server for nucleic acid folding and hybridization prediction , 2003, Nucleic Acids Res..

[62]  P. Seeburg,et al.  Modulation of microRNA processing and expression through RNA editing by ADAR deaminases , 2006, Nature Structural &Molecular Biology.

[63]  Aamira Tariq,et al.  Transcript Diversification in the Nervous System: A to I RNA Editing in CNS Function and Disease Development , 2012, Front. Neurosci..

[64]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[65]  Michael M. Mwangi,et al.  Transcriptome-wide sequencing reveals numerous APOBEC1 mRNA editing targets in transcript 3′ UTRs , 2010, Nature Structural &Molecular Biology.