Systematic refinement of gene annotations by parsing mRNA 3' end sequencing datasets.

Alternative cleavage and polyadenylation generates mRNA 3' isoforms in a cell type-specific manner. Due to finite available RNA sequencing data of organisms with vast cell type complexity, currently available gene annotation resources are incomplete, which poses significant challenges to the comprehensive interpretation and quantification of transcriptomes. In this chapter, we introduce 3'GAmES, a stand-alone computational pipeline for the identification and quantification of novel mRNA 3'end isoforms from 3'mRNA sequencing data. 3'GAmES expands available repositories and improves comprehensive gene-tag counting by cost-effective 3' mRNA sequencing, faithfully mirroring whole-transcriptome RNAseq measurements. By employing R and bash shell scripts (assembled in a Singularity container) 3'GAmES systematically augments cell type-specific 3' ends of RNA polymerase II transcripts and increases the sensitivity of quantitative gene expression profiling by 3' mRNA sequencing. Public access: https://github.com/AmeresLab/3-GAmES.git.

[1]  J. Ule,et al.  High-Resolution RNA Maps Suggest Common Principles of Splicing and Polyadenylation Regulation by TDP-43 , 2017, Cell reports.

[2]  D. Bartel,et al.  Formation, Regulation and Evolution of Caenorhabditis elegans 3′UTRs , 2010, Nature.

[3]  Johannes Zuber,et al.  Thiol-linked alkylation of RNA to assess expression dynamics , 2017, Nature Methods.

[4]  Steven W. Flavell,et al.  Genome-Wide Analysis of MEF2 Transcriptional Program Reveals Synaptic Target Genes and Neuronal Activity-Dependent Polyadenylation Site Selection , 2008, Neuron.

[5]  Wei Chen,et al.  Alternative Polyadenylation: Methods, Findings, and Impacts , 2017, Genom. Proteom. Bioinform..

[6]  T. Babak,et al.  A quantitative atlas of polyadenylation in five mammals , 2012, Genome research.

[7]  K. Nishida,et al.  Mechanisms and consequences of alternative polyadenylation. , 2011, Molecules and Cells.

[8]  M. Hentze,et al.  3′ end mRNA processing: molecular mechanisms and implications for health and disease , 2008, The EMBO journal.

[9]  D. Gautheret,et al.  Patterns of variant polyadenylation signal usage in human genes. , 2000, Genome research.

[10]  Wei Sun,et al.  Global analysis of regulatory divergence in the evolution of mouse alternative polyadenylation , 2016, Molecular systems biology.

[11]  E. Wahle,et al.  Control of poly(A) tail length , 2011, Wiley interdisciplinary reviews. RNA.

[12]  Larry N. Singh,et al.  U1 snRNP Determines mRNA Length and Regulates Isoform Expression , 2012, Cell.

[13]  J. Graber,et al.  Signals for pre‐mRNA cleavage and polyadenylation , 2012, Wiley interdisciplinary reviews. RNA.

[14]  Arndt von Haeseler,et al.  Quantification of experimentally induced nucleotide conversions in high-throughput sequencing datasets , 2019, BMC Bioinformatics.

[15]  R. Bock Witnessing Genome Evolution: Experimental Reconstruction of Endosymbiotic and Horizontal Gene Transfer. , 2017, Annual review of genetics.

[16]  Astrid Gall,et al.  Ensembl 2020 , 2019, Nucleic Acids Res..

[17]  E. Liu,et al.  Next-generation DNA sequencing of paired-end tags (PET) for transcriptome and genome analyses. , 2009, Genome research.

[18]  Arndt von Haeseler,et al.  NextGenMap: fast and accurate read mapping in highly polymorphic genomes , 2013, Bioinform..

[19]  M. Zavolan,et al.  Alternative cleavage and polyadenylation in health and disease , 2019, Nature Reviews Genetics.

[20]  M. Zavolan,et al.  PolyASite 2.0: a consolidated atlas of polyadenylation sites from 3′ end sequencing , 2019, Nucleic Acids Res..

[21]  Ryan D. Morin,et al.  Next-generation tag sequencing for cancer gene expression profiling. , 2009, Genome research.

[22]  B. Tian,et al.  Alternative polyadenylation of mRNA precursors , 2016, Nature Reviews Molecular Cell Biology.

[23]  G. Yehia,et al.  Analysis of alterative cleavage and polyadenylation by 3′ region extraction and deep sequencing , 2012, Nature Methods.

[24]  P. Sharp,et al.  Proliferating Cells Express mRNAs with Shortened 3' Untranslated Regions and Fewer MicroRNA Target Sites , 2008, Science.

[25]  C. Hon,et al.  Quantification of stochastic noise of splicing and polyadenylation in Entamoeba histolytica , 2012, Nucleic acids research.

[26]  Melissa J. Moore,et al.  Pre-mRNA Processing Reaches Back toTranscription and Ahead to Translation , 2009, Cell.

[27]  B. Tian,et al.  Progressive lengthening of 3′ untranslated regions of mRNAs by alternative polyadenylation during mouse embryonic development , 2009, Proceedings of the National Academy of Sciences.

[28]  Maria Carmo-Fonseca,et al.  Dynamic transitions in RNA polymerase II density profiles during transcription termination , 2012, Genome research.

[29]  P. Moll,et al.  QuantSeq 3[prime] mRNA sequencing for RNA quantification , 2014 .

[30]  J. Yates,et al.  Molecular architecture of the human pre-mRNA 3' processing complex. , 2009, Molecular cell.

[31]  J. Hadfield,et al.  RNA sequencing: the teenage years , 2019, Nature Reviews Genetics.

[32]  Wen J. Li,et al.  Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation , 2015, Nucleic Acids Res..

[33]  Vanessa Sochat,et al.  Singularity: Scientific containers for mobility of compute , 2017, PloS one.

[34]  D. Bartel,et al.  Extensive alternative polyadenylation during zebrafish development , 2012, Genome research.

[35]  Bin Tian,et al.  A large-scale analysis of mRNA polyadenylation of human and mouse genes , 2005, Nucleic acids research.

[36]  C. Mayr Regulation by 3'-Untranslated Regions. , 2017, Annual review of genetics.

[37]  Ralf Schmidt,et al.  A comprehensive analysis of 3′ end sequencing data sets reveals novel polyadenylation signals and the repressive role of heterogeneous ribonucleoprotein C on cleavage and polyadenylation , 2015, bioRxiv.

[38]  E. Birney,et al.  Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt , 2009, Nature Protocols.

[39]  Christine Mayr,et al.  Evolution and Biological Roles of Alternative 3'UTRs. , 2016, Trends in cell biology.

[40]  Andrew H. Beck,et al.  3′-End Sequencing for Expression Quantification (3SEQ) from Archival Tumor Samples , 2010, PloS one.