Accurate mapping of tRNA reads

Motivation Many repetitive DNA elements are transcribed at appreciable expression levels. Mapping the corresponding RNA sequencing reads back to a reference genome is notoriously difficult and error-prone task, however. This is in particular true if chemical modifications introduce systematic mismatches, while at the same time the genomic loci are only approximately identical, as in the case of tRNAs. Results We therefore developed a dedicated mapping strategy to handle RNA-seq reads that map to tRNAs relying on a modified target genome in which known tRNA loci are masked and instead intronless tRNA precursor sequences are appended as artificial 'chromosomes'. In a first pass, reads that overlap the boundaries of mature tRNAs are extracted. In the second pass, the remaining reads are mapped to a tRNA-masked target that is augmented by representative mature tRNA sequences. Using both simulated and real life data we show that our best-practice workflow removes most of the mapping artefacts introduced by simpler mapping schemes and makes it possible to reliably identify many of chemical tRNA modifications in generic small RNA-seq data. Using simulated data the FDR is only 2%. We find compelling evidence for tissue specific differences of tRNA modification patterns. Availability and implementation The workflow is available both as a bash script and as a Galaxy workflow from https://github.com/AnneHoffmann/tRNA-read-mapping. Contact fabian@tbi.univie.ac.at. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  Tim R. Mercer,et al.  The Human Mitochondrial Transcriptome , 2011, Cell.

[2]  Peter F. Stadler,et al.  Fast Mapping of Short Sequences with Mismatches, Insertions and Deletions Using Index Structures , 2009, PLoS Comput. Biol..

[3]  M. Helm,et al.  tRNA stabilization by modified nucleotides. , 2010, Biochemistry.

[4]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[5]  J. Kidd,et al.  Discovery and characterization of Alu repeat sequences via precise local read assembly , 2015, bioRxiv.

[6]  A. Marchfelder,et al.  The making of tRNAs and more - RNase P and tRNase Z. , 2009, Progress in molecular biology and translational science.

[7]  Sebastian A. Leidel,et al.  Modify or die? - RNA modification defects in metazoans , 2014, RNA biology.

[8]  D. Liao,et al.  Concerted evolution: molecular mechanism and biological implications. , 1999, American journal of human genetics.

[9]  F. Cramer,et al.  The -C-C-A end of tRNA and its role in protein biosynthesis. , 1985, Progress in nucleic acid research and molecular biology.

[10]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer , 2011, Nature Biotechnology.

[11]  Tao Pan,et al.  Tissue-Specific Differences in Human Transfer RNA Expression , 2006, PLoS genetics.

[12]  V. de Crécy-Lagard,et al.  The Levels of a Universally Conserved tRNA Modification Regulate Cell Growth* , 2015, The Journal of Biological Chemistry.

[13]  Christiane Branlant,et al.  Identification of modified residues in RNAs by reverse transcription-based methods. , 2007, Methods in enzymology.

[14]  Mihai Pop,et al.  DNACLUST: accurate and efficient clustering of phylogenetic marker genes , 2011, BMC Bioinformatics.

[15]  Piet Herdewijn,et al.  A methyl group controls conformational equilibrium in human mitochondrial tRNA(Lys). , 2007, Journal of the American Chemical Society.

[16]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[17]  Andreas Hildebrandt,et al.  The reverse transcription signature of N-1-methyladenosine in RNA-Seq is sequence dependent , 2015, Nucleic acids research.

[18]  Jian-Kang Zhu,et al.  Bioinformatics analysis suggests base modifications of tRNAs and miRNAs in Arabidopsis thaliana , 2009, BMC Genomics.

[19]  Todd M. Lowe,et al.  ARM-Seq: AlkB-facilitated RNA methylation sequencing reveals a complex landscape of modified tRNA fragments , 2015, Nature Methods.

[20]  R Giegé,et al.  Universal rules and idiosyncratic features in tRNA identity. , 1998, Nucleic acids research.

[21]  C. Sarkar,et al.  A-to-I editing in human miRNAs is enriched in seed sequence, influenced by sequence contexts and significantly hypoedited in glioblastoma multiforme , 2017, Scientific Reports.

[22]  A. Hopper,et al.  tRNA biology charges to the front. , 2010, Genes & development.

[23]  Y. Pilpel,et al.  An Evolutionarily Conserved Mechanism for Controlling the Efficiency of Protein Translation , 2010, Cell.

[24]  S. Eddy,et al.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. , 1997, Nucleic acids research.

[25]  R. Maraia,et al.  Factors That Shape Eukaryotic tRNAomes: Processing, Modification and Anticodon–Codon Use , 2017, Biomolecules.

[26]  P. Stadler,et al.  LOTTE-seq (Long hairpin oligonucleotide based tRNA high-throughput sequencing): specific selection of tRNAs with 3’-CCA end for high-throughput sequencing , 2019, RNA biology.

[27]  Yuri Motorin,et al.  Detecting RNA modifications in the epitranscriptome: predict and validate , 2017, Nature Reviews Genetics.

[28]  T. Pan,et al.  tRNA base methylation identification and quantification via high-throughput sequencing , 2016, RNA.

[29]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration , 2012, Briefings Bioinform..

[30]  Gunnar Rätsch,et al.  MMR: a tool for read multi-mapper resolution , 2015, bioRxiv.

[31]  R. Maraia,et al.  Plasticity and diversity of tRNA anticodon determinants of substrate recognition by eukaryotic A37 isopentenyltransferases. , 2011, RNA.

[32]  M. Biel,et al.  Isotope-Based Analysis of Modified tRNA Nucleosides Correlates Modification Density with Translational Efficiency , 2012, Angewandte Chemie.

[33]  Yuri Motorin,et al.  High-throughput sequencing for 1-methyladenosine (m(1)A) mapping in RNA. , 2016, Methods.

[34]  P. Agris,et al.  Highly conserved modified nucleosides influence Mg2+-dependent tRNA folding. , 2002, Nucleic acids research.

[35]  Yuk Yee Leung,et al.  HAMR: high-throughput annotation of modified ribonucleotides , 2013, RNA.

[36]  M. Mörl,et al.  tRNA Modifications: Impact on Structure and Thermal Adaptation , 2017, Biomolecules.

[37]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[38]  Aaron R. Quinlan,et al.  Bioinformatics Applications Note Genome Analysis Bedtools: a Flexible Suite of Utilities for Comparing Genomic Features , 2022 .

[39]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[40]  Knut Reinert,et al.  A novel and well-defined benchmarking method for second generation read mapping , 2011, BMC Bioinformatics.

[41]  Peter F. Stadler,et al.  tRNAdb 2009: compilation of tRNA sequences and tRNA genes , 2008, Nucleic Acids Res..

[42]  Andreas Hildebrandt,et al.  CoverageAnalyzer (CAn): A Tool for Inspection of Modification Signatures in RNA Sequencing Profiles , 2016, Biomolecules.

[43]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[44]  J. Michael Cherry,et al.  ENCODE data at the ENCODE portal , 2015, Nucleic Acids Res..

[45]  Paul F Agris,et al.  tRNA's wobble decoding of the genome: 40 years of modification. , 2007, Journal of molecular biology.

[46]  R. B. Richardson,et al.  Greater organ involution in highly proliferative tissues associated with the early onset and acceleration of ageing in humans , 2014, Experimental Gerontology.

[47]  Sam Griffiths-Jones,et al.  tRNA anticodon shifts in eukaryotic genomes , 2014, RNA.

[48]  I. Ruvinsky,et al.  Family Size and Turnover Rates among Several Classes of Small Non–Protein-Coding RNA Genes in Caenorhabditis Nematodes , 2012, Genome biology and evolution.

[49]  J. Alfonzo,et al.  Transfer RNA modifications: nature's combinatorial chemistry playground , 2013, Wiley interdisciplinary reviews. RNA.

[50]  B. Gregory,et al.  HAMR: High-Throughput Annotation of Modified Ribonucleotides. , 2018, Methods in molecular biology.

[51]  Casey M. Bergman,et al.  The Evolution of tRNA Genes in Drosophila , 2010, Genome biology and evolution.

[52]  Robert C. Edgar,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2001 .

[53]  Janusz M Bujnicki,et al.  Distribution and frequencies of post-transcriptional modifications in tRNAs , 2014, RNA biology.

[54]  M. Vinayak,et al.  Queuosine modification of tRNA: its divergent role in cellular machinery. , 2009, Bioscience reports.

[55]  Thomas J. Begley,et al.  tRNA modifications regulate translation during cellular stress , 2014, FEBS letters.

[56]  Steve Hoffmann,et al.  Traces of post-transcriptional RNA modifications in deep sequencing data , 2011, Biological chemistry.

[57]  Toralf Kirsten,et al.  Genomic organization of eukaryotic tRNAs , 2010, BMC Genomics.

[58]  Carsten O. Daub,et al.  Probabilistic resolution of multi-mapping reads in massively parallel sequencing data using MuMRescueLite , 2009, Bioinform..

[59]  E. Wang,et al.  Tertiary structure base pairs between D- and TpsiC-loops of Escherichia coli tRNA(Leu) play important roles in both aminoacylation and editing. , 2003, Nucleic acids research.

[60]  A. Torres Enjoy the Silence: Nearly Half of Human tRNA Genes Are Silent , 2019, Bioinformatics and biology insights.

[61]  T. Mercer,et al.  The human mitochondrial transcriptome and the RNA‐binding proteins that regulate its expression , 2012, Wiley interdisciplinary reviews. RNA.

[62]  P. Farabaugh,et al.  Transfer RNA modifications that alter +1 frameshifting in general fail to affect -1 frameshifting. , 2003, RNA.

[63]  Herbert H. Tsang,et al.  Meta-analysis of small RNA-sequencing errors reveals ubiquitous post-transcriptional RNA modifications , 2009, Nucleic acids research.

[64]  Nancy Retzlaff,et al.  Orthologs, turn-over, and remolding of tRNAs in primates and fruit flies , 2016, BMC Genomics.

[65]  Martin Renqiang Min,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[66]  Chengqi Yi,et al.  Efficient and quantitative high-throughput transfer RNA sequencing , 2015, Nature Methods.

[67]  J. Coller,et al.  tRNA Metabolism and Neurodevelopmental Disorders. , 2019, Annual review of genomics and human genetics.

[68]  Gregory Kucherov,et al.  Dynamic read mapping and online consensus calling for better variant detection , 2016, 1605.09070.

[69]  O. Reina,et al.  Inosine modifications in human tRNAs are incorporated at the precursor tRNA level , 2015, Nucleic acids research.