Investigating RNA editing in deep transcriptome datasets with REDItools and REDIportal

RNA editing is a widespread post-transcriptional mechanism able to modify transcripts through insertions/deletions or base substitutions. It is prominent in mammals, in which millions of adenosines are deaminated to inosines by members of the ADAR family of enzymes. A-to-I RNA editing has a plethora of biological functions, but its detection in large-scale transcriptome datasets is still an unsolved computational task. To this aim, we developed REDItools, the first software package devoted to the RNA editing profiling in RNA-sequencing (RNAseq) data. It has been successfully used in human transcriptomes, proving the tissue and cell type specificity of RNA editing as well as its pervasive nature. Outcomes from large-scale REDItools analyses on human RNAseq data have been collected in our specialized REDIportal database, containing more than 4.5 million events. Here we describe in detail two bioinformatic procedures based on our computational resources, REDItools and REDIportal. In the first procedure, we outline a workflow to detect RNA editing in the human cell line NA12878, for which transcriptome and whole genome data are available. In the second procedure, we show how to identify dysregulated editing at specific recoding sites in post-mortem brain samples of Huntington disease donors. On a 64-bit computer running Linux with ≥32 GB of random-access memory (RAM), both procedures should take ~76 h, using 4 to 24 cores. Our protocols have been designed to investigate RNA editing in different organisms with available transcriptomic and/or genomic reads. Scripts to complete both procedures and a docker image are available at https://github.com/BioinfoUNIBA/REDItools . This protocol describes bioinformatics procedures to detect RNA editing in RNA-sequencing datasets using REDItools and REDIportal. REDItools is a software package to profile RNA editing, while known editing sites are collected in the REDIportal database.

[1]  Ernesto Picardi,et al.  REDItools: high-throughput RNA editing detection made easy , 2013, Bioinform..

[2]  Sean Chun-Chang Chen,et al.  The Cancer Editome Atlas: A Resource for Exploratory Analysis of the Adenosine-to-Inosine RNA Editome in Cancer. , 2019, Cancer research.

[3]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[4]  R. Benne,et al.  Major transcript of the frameshifted coxll gene from trypanosome mitochondria contains four nucleotides that are not encoded in the DNA , 1986, Cell.

[5]  Yang Li,et al.  A-to-I RNA editing is developmentally regulated and generally adaptive for sexual reproduction in Neurospora crassa , 2017, Proceedings of the National Academy of Sciences.

[6]  David Sankoff,et al.  A consolidation algorithm for genomes fractionated after higher order polyploidization , 2012, BMC Bioinformatics.

[7]  Yi Xing,et al.  Transcriptome sequencing reveals aberrant alternative splicing in Huntington's disease. , 2016, Human molecular genetics.

[8]  G. Pesole,et al.  Bioengineering and Biotechnology Methods Article Uncovering Rna Editing Sites in Long Non-coding Rnas , 2022 .

[9]  Jin Billy Li,et al.  RADAR: a rigorously annotated database of A-to-I RNA editing , 2013, Nucleic Acids Res..

[10]  G. Pesole,et al.  Whole transcriptome profiling of Late-Onset Alzheimer’s Disease patients provides insights into the molecular changes involved in the disease , 2018, Scientific Reports.

[11]  Feng Liu,et al.  Accurate identification of RNA editing sites from primitive sequence with deep neural networks , 2018, Scientific Reports.

[12]  Björn Usadel,et al.  Trimmomatic: a flexible trimmer for Illumina sequence data , 2014, Bioinform..

[13]  Serban Nacu,et al.  Fast and SNP-tolerant detection of complex variants and splicing in short reads , 2010, Bioinform..

[14]  M. Schaefer,et al.  "Mining the Epitranscriptome: Detection of RNA editing and RNA modifications". , 2019, Methods.

[15]  R. Emeson,et al.  Functions and mechanisms of RNA editing. , 2000, Annual review of genetics.

[16]  E. Levanon,et al.  Reduced levels of protein recoding by A-to-I RNA editing in Alzheimer's disease , 2016, RNA.

[17]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[18]  David John,et al.  RNAEditor: easy detection of RNA editing events and the introduction of editing islands , 2016, Briefings Bioinform..

[19]  Xavier Estivill,et al.  A myriad of miRNA variants in control and Huntington’s disease brain regions detected by massively parallel sequencing , 2010, Nucleic acids research.

[20]  Eli Eisenberg,et al.  A-to-I RNA editing — immune protector and transcriptome diversifier , 2018, Nature Reviews Genetics.

[21]  Jia Gu,et al.  fastp: an ultra-fast all-in-one FASTQ preprocessor , 2018 .

[22]  Min-Su Kim,et al.  RDDpred: a condition-specific RNA-editing prediction model from RNA-seq data , 2016, BMC Genomics.

[23]  Ernesto Picardi,et al.  Profiling RNA editing in human tissues: towards the inosinome Atlas , 2015, Scientific Reports.

[24]  P. Deininger Alu elements: know the SINEs , 2011, Genome Biology.

[25]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[26]  Erez Y. Levanon,et al.  A genome-wide map of hyper-edited RNA reveals numerous new sites , 2014, Nature Communications.

[27]  Eli Eisenberg,et al.  RNA editing is abundant and correlates with task performance in a social bumblebee , 2019, Nature Communications.

[28]  Michael R. Johnson,et al.  Genome-wide analysis of differential RNA editing in epilepsy. , 2017, Genome research.

[29]  Chris Williams,et al.  RNA-SeQC: RNA-seq metrics for quality control and process optimization , 2012, Bioinform..

[30]  Steven L Salzberg,et al.  HISAT: a fast spliced aligner with low memory requirements , 2015, Nature Methods.

[31]  D. Conrad,et al.  Dynamic landscape and regulation of RNA editing in mammals , 2017, Nature.

[32]  Ernesto Picardi,et al.  Using REDItools to Detect RNA Editing Events in NGS Datasets , 2015, Current protocols in bioinformatics.

[33]  Janusz M. Bujnicki,et al.  MODOMICS: a database of RNA modification pathways. 2017 update , 2017, Nucleic Acids Res..

[34]  Luis M. Valor Transcription, Epigenetics and Ameliorative Strategies in Huntington’s Disease: a Genome-Wide Perspective , 2014, Molecular Neurobiology.

[35]  Wei Li,et al.  RSeQC: quality control of RNA-seq experiments , 2012, Bioinform..

[36]  Hui Yang,et al.  Genomic variant annotation and prioritization with ANNOVAR and wANNOVAR , 2015, Nature Protocols.

[37]  X. Xiao,et al.  Genome Sequence-Independent Identification of RNA Editing Sites , 2015, Nature Methods.

[38]  H. Hakonarson,et al.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.

[39]  Pei Zhang,et al.  RES-Scanner: a software package for genome-wide identification of RNA-editing sites , 2016, GigaScience.

[40]  Eli Eisenberg,et al.  Human cancer tissues exhibit reduced A-to-I editing of miRNAs coupled with elevated editing of their targets , 2017, Nucleic acids research.

[41]  Mukesh Jain,et al.  NGS QC Toolkit: A Toolkit for Quality Control of Next Generation Sequencing Data , 2012, PloS one.

[42]  Ernesto Picardi,et al.  REDIportal: a comprehensive database of A-to-I RNA editing events in humans , 2016, Nucleic Acids Res..

[43]  Rosario Distefano,et al.  ncRNA Editing: Functional Characterization and Computational Resources. , 2019, Methods in molecular biology.

[44]  Yishay Pinto,et al.  Mammalian conserved ADAR targets comprise only a small fragment of the human editosome , 2014, Genome Biology.

[45]  G. Pesole,et al.  RNA editing signature during myeloid leukemia cell differentiation , 2017, Leukemia.

[46]  Jin Billy Li,et al.  Accurate identification of human Alu and non-Alu RNA editing sites , 2012, Nature Methods.

[47]  Colin Kennedy Paediatric neurology: understanding risk and improving therapeutic choices , 2011, The Lancet Neurology.

[48]  Christoph Dieterich,et al.  JACUSA: site-specific identification of RNA editing events from replicate sequencing data , 2017, BMC Bioinformatics.

[49]  Angela Gallo,et al.  ADAR RNA editing in human disease; more to it than meets the I , 2017, Human Genetics.

[50]  Giorgio Valle,et al.  Large-scale detection and analysis of RNA editing in grape mtDNA by RNA deep-sequencing , 2010, Nucleic acids research.

[51]  Meng Wang,et al.  pblat: a multithread blat algorithm speeding up aligning sequences to genomes , 2019, BMC Bioinformatics.

[52]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[53]  Feng Zhang,et al.  SPRINT: an SNP-free toolkit for identifying RNA editing sites , 2017, Bioinform..

[54]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[55]  J. Olson,et al.  Regional and cellular gene expression changes in human Huntington's disease brain. , 2006, Human molecular genetics.

[56]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[57]  Eric L Van Nostrand,et al.  Widespread RNA editing dysregulation in brains from autistic individuals , 2018, Nature Neuroscience.

[58]  Michael M. Mwangi,et al.  Transcriptome-wide sequencing reveals numerous APOBEC1 mRNA editing targets in transcript 3′ UTRs , 2010, Nature Structural &Molecular Biology.

[59]  Xun Xu,et al.  RED-ML: a novel, effective RNA editing detection method based on machine learning , 2017, GigaScience.

[60]  Ernesto Picardi,et al.  Massive transcriptome sequencing of human spinal cord tissues provides new insights into motor neuron degeneration in ALS , 2017, Scientific Reports.

[61]  Eli Eisenberg,et al.  Bioinformatic approaches for identification of A-to-I editing sites. , 2012, Current topics in microbiology and immunology.

[62]  Chris P. Ponting,et al.  The RNA-Editing Enzyme ADAR1 Controls Innate Immune Responses to RNA , 2014, Cell reports.

[63]  Ernesto Picardi,et al.  Single-cell transcriptomics reveals specific RNA editing signatures in the human brain. , 2017, RNA.

[64]  Hui Zhang,et al.  Identification of Symmetrical RNA Editing Events in the Mitochondria of Salvia miltiorrhiza by Strand-specific RNA Sequencing , 2017, Scientific Reports.

[65]  K. Nishikura,et al.  A-to-I editing of coding and non-coding RNAs by ADARs , 2015, Nature Reviews Molecular Cell Biology.

[66]  C A Ross,et al.  Decreased expression of striatal signaling genes in a mouse model of Huntington's disease. , 2000, Human molecular genetics.

[67]  A global reference for human genetic variation , 2015, Nature.

[68]  Ernesto Picardi,et al.  Dynamic inosinome profiles reveal novel patient stratification and gender-specific differences in glioblastoma , 2019, Genome Biology.

[69]  Salvatore Alaimo,et al.  Knowledge in the Investigation of A-to-I RNA Editing Signals , 2015, Front. Bioeng. Biotechnol..

[70]  C. Ross,et al.  Huntington's disease: from molecular pathogenesis to clinical treatment , 2011, The Lancet Neurology.

[71]  Qin Li,et al.  Illuminating spatial A-to-I RNA editing signatures within the Drosophila brain , 2018, Proceedings of the National Academy of Sciences.

[72]  Eli Eisenberg,et al.  A-to-I RNA editing occurs at over a hundred million genomic sites, located in a majority of human genes , 2014, Genome research.

[73]  Siu-Ming Yiu,et al.  SOAP2: an improved ultrafast tool for short read alignment , 2009, Bioinform..

[74]  Manuel Aranda,et al.  Condition-specific RNA editing in the coral symbiont Symbiodinium microadriaticum , 2017, PLoS genetics.

[75]  K. Nishikura,et al.  A-to-I editing of protein coding and noncoding RNAs , 2012, Critical reviews in biochemistry and molecular biology.

[76]  Jonas Korlach,et al.  The birth of the Epitranscriptome: deciphering the function of RNA modifications , 2012, Genome Biology.

[77]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[78]  Ernesto Picardi,et al.  Elucidating the editome: bioinformatics approaches for RNA editing detection , 2019, Briefings Bioinform..

[79]  Pavel V. Baranov,et al.  Darned in 2013: inclusion of model organisms and linking with Wikipedia , 2012, Nucleic Acids Res..

[80]  Eli Eisenberg,et al.  Massive A-to-I RNA editing is common across the Metazoa and correlates with dsRNA abundance , 2017, Genome Biology.