A comprehensive analysis of 3′ end sequencing data sets reveals novel polyadenylation signals and the repressive role of heterogeneous ribonucleoprotein C on cleavage and polyadenylation

Alternative polyadenylation (APA) is a general mechanism of transcript diversification in mammals, which has been recently linked to proliferative states and cancer. Different 3' untranslated region (3' UTR) isoforms interact with different RNA-binding proteins (RBPs), which modify the stability, translation, and subcellular localization of the corresponding transcripts. Although the heterogeneity of pre-mRNA 3' end processing has been established with high-throughput approaches, the mechanisms that underlie systematic changes in 3' UTR lengths remain to be characterized. Through a uniform analysis of a large number of 3' end sequencing data sets, we have uncovered 18 signals, six of which are novel, whose positioning with respect to pre-mRNA cleavage sites indicates a role in pre-mRNA 3' end processing in both mouse and human. With 3' end sequencing we have demonstrated that the heterogeneous ribonucleoprotein C (HNRNPC), which binds the poly(U) motif whose frequency also peaks in the vicinity of polyadenylation (poly(A)) sites, has a genome-wide effect on poly(A) site usage. HNRNPC-regulated 3' UTRs are enriched in ELAV-like RBP 1 (ELAVL1) binding sites and include those of the CD47 gene, which participate in the recently discovered mechanism of 3' UTR-dependent protein localization (UDPL). Our study thus establishes an up-to-date, high-confidence catalog of 3' end processing sites and poly(A) signals, and it uncovers an important role of HNRNPC in regulating 3' end processing. It further suggests that U-rich elements mediate interactions with multiple RBPs that regulate different stages in a transcript's life cycle.

[1]  Andrew H. Beck,et al.  3′-End Sequencing for Expression Quantification (3SEQ) from Archival Tumor Samples , 2010, PloS one.

[2]  Michael T. McManus,et al.  Pervasive Transcription of the Human Genome Produces Thousands of Previously Unidentified Long Intergenic Noncoding RNAs , 2013, PLoS genetics.

[3]  P. Kapranov,et al.  Comprehensive Polyadenylation Site Maps in Yeast and Human Reveal Pervasive Alternative Polyadenylation , 2010, Cell.

[4]  J. D. den Dunnen,et al.  RNA sequencing: from tag-based profiling to resolving complete transcript structure , 2014, Cellular and Molecular Life Sciences.

[5]  Julie L. Yang,et al.  Ubiquitously transcribed genes use alternative polyadenylation to achieve tissue-specific expression , 2013, Genes & development.

[6]  Wei Li,et al.  CFIm25 links Alternative Polyadenylation to Glioblastoma Tumor Suppression , 2014, Nature.

[7]  Mihaela Zavolan,et al.  Argonaute CLIP--a method to identify in vivo targets of miRNAs. , 2012, Methods.

[8]  L. Steinmetz,et al.  Alternative polyadenylation diversifies post‐transcriptional regulation by selective RNA–protein interactions , 2014, Molecular Systems Biology.

[9]  Terrence S. Furey,et al.  The UCSC Genome Browser Database: update 2006 , 2005, Nucleic Acids Res..

[10]  Auinash Kalsotra,et al.  Systematic Profiling of Poly(A)+ Transcripts Modulated by Core 3’ End Processing and Splicing Factors Reveals Regulatory Rules of Alternative Cleavage and Polyadenylation , 2015, PLoS genetics.

[11]  Michael Recce,et al.  PolyA_DB: a database for mammalian mRNA polyadenylation , 2004, Nucleic Acids Res..

[12]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[13]  J. Ule,et al.  CPSF 30 and Wdr 33 directly bind to AAUAAA in mammalian mRNA 3 9 processing , 2014 .

[14]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[15]  M. Zavolan,et al.  A quantitative analysis of CLIP methods for identifying binding sites of RNA-binding proteins , 2011, Nature Methods.

[16]  M. Wollerton,et al.  Polypyrimidine Tract Binding Protein Modulates Efficiency of Polyadenylation , 2004, Molecular and Cellular Biology.

[17]  M. Ohno,et al.  hnRNP C Tetramer Measures RNA Length to Classify RNA Polymerase II Transcripts for Export , 2012, Science.

[18]  C. Burge,et al.  3′ UTR-isoform choice has limited influence on the stability and translational efficiency of most mRNAs in mouse fibroblasts , 2013, Genome research.

[19]  Peter J. Shepard,et al.  Complex and dynamic landscape of RNA polyadenylation revealed by PAS-Seq. , 2011, RNA.

[20]  Jie Li,et al.  APASdb: a database describing alternative poly(A) sites and selection of heterogeneous cleavage sites downstream of poly(A) signals , 2014, Nucleic Acids Res..

[21]  Chuan He,et al.  N6-methyladenosine-dependent RNA structural switches regulate RNA-protein interactions , 2015, Nature.

[22]  M. Antoniou,et al.  A physical and functional link between splicing factors promotes pre-mRNA 3′ end processing , 2009, Nucleic acids research.

[23]  Li Liu,et al.  Regulation of lncRNA expression , 2014, Cellular & Molecular Biology Letters.

[24]  M. Wickens,et al.  Point mutations in AAUAAA and the poly (A) addition site: effects on the accuracy and efficiency of cleavage and polyadenylation in vitro. , 1990, Nucleic acids research.

[25]  Larry N. Singh,et al.  U1 snRNP Determines mRNA Length and Regulates Isoform Expression , 2012, Cell.

[26]  W. Keller,et al.  Human Fip1 is a subunit of CPSF that binds to U‐rich RNA elements and stimulates poly(A) polymerase , 2004, The EMBO journal.

[27]  Mary Goldman,et al.  The UCSC Genome Browser database: extensions and updates 2013 , 2012, Nucleic Acids Res..

[28]  J. Graber,et al.  Signals for pre‐mRNA cleavage and polyadenylation , 2012, Wiley interdisciplinary reviews. RNA.

[29]  Jeffrey G. Reifenberger,et al.  Direct RNA sequencing , 2009, Nature.

[30]  Bin Tian,et al.  PolyA_DB 2: mRNA polyadenylation sites in vertebrate genes , 2007, Nucleic Acids Res..

[31]  Jiahuai Han,et al.  Orphan nuclear receptor TR3 acts in autophagic cell death via mitochondrial signaling pathway. , 2014, Nature chemical biology.

[32]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[33]  W. Lestourgeon,et al.  Solution structure of the symmetric coiled coil tetramer formed by the oligomerization domain of hnRNP C: implications for biological function. , 2005, Journal of molecular biology.

[34]  Wencheng Li,et al.  RBBP6 isoforms regulate the human polyadenylation machinery and modulate expression of mRNAs with AU-rich 3′ UTRs , 2014, Genes & development.

[35]  Thomas D. Wu,et al.  GMAP: a genomic mapping and alignment program for mRNA and EST sequence , 2005, Bioinform..

[36]  Cesare Furlanello,et al.  A promoter-level mammalian expression atlas , 2015 .

[37]  M. Zavolan,et al.  Reconstitution of CPSF active in polyadenylation: recognition of the polyadenylation signal by WDR33 , 2014, Genes & development.

[38]  Jeffrey Wilusz,et al.  Downstream sequence elements with different affinities for the hnRNP H/H' protein influence the processing efficiency of mammalian polyadenylation signals. , 2002, Nucleic acids research.

[39]  Yongsheng Shi,et al.  Alternative polyadenylation: new insights from global analyses. , 2012, RNA.

[40]  Mohsen Khorshid,et al.  CLIPZ: a database and analysis environment for experimentally determined binding sites of RNA-binding proteins , 2010, Nucleic Acids Res..

[41]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[42]  N. Proudfoot,et al.  3′ Non-coding region sequences in eukaryotic messenger RNA , 1976, Nature.

[43]  Larry N. Singh,et al.  U1 snRNP protects pre-mRNAs from premature cleavage and polyadenylation , 2010, Nature.

[44]  P. Hordijk,et al.  Rac1‐induced cell migration requires membrane recruitment of the nuclear oncogene SET , 2007, The EMBO journal.

[45]  S. Liebhaber,et al.  αCP Poly(C) Binding Proteins Act as Global Regulators of Alternative Polyadenylation , 2013, Molecular and Cellular Biology.

[46]  E. Wang,et al.  Analysis and design of RNA sequencing experiments for identifying isoform regulation , 2010, Nature Methods.

[47]  D. Gautheret,et al.  Patterns of variant polyadenylation signal usage in human genes. , 2000, Genome research.

[48]  C. Bult,et al.  Functional annotation of a full-length mouse cDNA collection , 2001, Nature.

[49]  D. Bartel,et al.  Extensive alternative polyadenylation during zebrafish development , 2012, Genome research.

[50]  Bin Tian,et al.  A large-scale analysis of mRNA polyadenylation of human and mouse genes , 2005, Nucleic acids research.

[51]  G. Dreyfuss,et al.  Isolation of the heterogeneous nuclear RNA-ribonucleoprotein complex (hnRNP): a unique supramolecular assembly. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[52]  Xiaohui S. Xie,et al.  Fip1 regulates mRNA alternative polyadenylation to promote stem cell self‐renewal , 2014, The EMBO journal.

[53]  Yonggui Fu,et al.  Dynamic landscape of tandem 3′ UTRs during zebrafish development , 2012, Genome research.

[54]  Jonathan Hall,et al.  Structural and mechanistic insights into poly(uridine) tract recognition by the hnRNP C RNA recognition motif. , 2014, Journal of the American Chemical Society.

[55]  J. A. Steitz,et al.  HuR and mRNA stability , 2001, Cellular and Molecular Life Sciences CMLS.

[56]  Christine Mayr,et al.  Alternative 3'UTRs act as scaffolds to regulate membrane protein localization , 2015, Nature.

[57]  K. Venkataraman,et al.  Analysis of a noncanonical poly(A) site reveals a tripartite mechanism for vertebrate poly(A) site recognition. , 2005, Genes & development.

[58]  N. Proudfoot Ending the message: poly(A) signals then and now. , 2011, Genes & development.

[59]  D. Bartel,et al.  Poly(A)-tail profiling reveals an embryonic switch in translational control , 2014, Nature.

[60]  D. Bartel,et al.  Global analyses of the effect of different cellular contexts on microRNA targeting. , 2014, Molecular cell.

[61]  Bronwen L. Aken,et al.  GENCODE: The reference human genome annotation for The ENCODE Project , 2012, Genome research.

[62]  E. Lai,et al.  Widespread and extensive lengthening of 3′ UTRs in the mammalian brain , 2013, Genome research.

[63]  Peter F. Stadler,et al.  Fast Mapping of Short Sequences with Mismatches, Insertions and Deletions Using Index Structures , 2009, PLoS Comput. Biol..

[64]  B. Tian,et al.  Reprogramming of 3′ Untranslated Regions of mRNAs by Alternative Polyadenylation in Generation of Pluripotent Stem Cells from Different Cell Types , 2009, PloS one.

[65]  Wei Li,et al.  Dynamic analyses of alternative polyadenylation from RNA-seq reveal a 3′-UTR landscape across seven tumour types , 2014, Nature Communications.

[66]  R. Elkon,et al.  The Poly(A)-Binding Protein Nuclear 1 Suppresses Alternative Cleavage and Polyadenylation Sites , 2012, Cell.

[67]  C. Mayr,et al.  Widespread Shortening of 3′UTRs by Alternative Cleavage and Polyadenylation Activates Oncogenes in Cancer Cells , 2009, Cell.

[68]  J. Ule,et al.  Intergenic Alu exonisation facilitates the evolution of tissue-specific transcript ends , 2015, Nucleic acids research.

[69]  Patrice M. Milos,et al.  An in-depth map of polyadenylation sites in cancer , 2012, Nucleic acids research.

[70]  T. Shenk,et al.  The C proteins of heterogeneous nuclear ribonucleoprotein complexes interact with RNA sequences downstream of polyadenylation cleavage sites , 1988, Molecular and cellular biology.

[71]  A. Beyer,et al.  Identification and characterization of the packaging proteins of core 40S hnRNP particles , 1977, Cell.

[72]  Yongsheng Shi,et al.  The polyadenylation code: a unified model for the regulation of mRNA alternative poly adenylation , 2014, Journal of Zhejiang University SCIENCE B.

[73]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[74]  B. Tian,et al.  Progressive lengthening of 3′ untranslated regions of mRNAs by alternative polyadenylation during mouse embryonic development , 2009, Proceedings of the National Academy of Sciences.

[75]  Y. Hayashizaki,et al.  Deep cap analysis gene expression (CAGE): genome-wide identification of promoters, quantification of their expression, and network inference. , 2008, BioTechniques.

[76]  J. Ule,et al.  iCLIP reveals the function of hnRNP particles in splicing at individual nucleotide resolution , 2010, Nature Structural &Molecular Biology.

[77]  Christopher B. Burge,et al.  Promoter directionality is controlled by U1 snRNP and polyadenylation signals , 2013, Nature.

[78]  C. Burd,et al.  The determinants of RNA-binding specificity of the heterogeneous nuclear ribonucleoprotein C proteins. , 1994, The Journal of biological chemistry.

[79]  John R Yates,et al.  CPSF30 and Wdr33 directly bind to AAUAAA in mammalian mRNA 3′ processing , 2014, Genes & development.

[80]  Brendan J. Frey,et al.  A compendium of RNA-binding motifs for decoding gene regulation , 2013, Nature.

[81]  C. Milcarek,et al.  The hnRNPs F and H2 bind to similar sequences to influence gene expression. , 2006, The Biochemical journal.

[82]  C R Cantor,et al.  In silico detection of control signals: mRNA 3'-end-processing sequences in diverse species. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[83]  Julian König,et al.  Direct Competition between hnRNP C and U2AF65 Protects the Transcriptome from the Exonization of Alu Elements , 2013, Cell.

[84]  Andreas R. Gruber,et al.  Cleavage factor Im is a key regulator of 3′ UTR length , 2012, RNA biology.

[85]  E. van Nimwegen,et al.  Global 3′ UTR shortening has a limited effect on protein abundance in proliferating T cells , 2014, Nature Communications.

[86]  G. Yehia,et al.  Analysis of alterative cleavage and polyadenylation by 3′ region extraction and deep sequencing , 2012, Nature Methods.

[87]  P. Sharp,et al.  Proliferating Cells Express mRNAs with Shortened 3' Untranslated Regions and Fewer MicroRNA Target Sites , 2008, Science.

[88]  T. Babak,et al.  A quantitative atlas of polyadenylation in five mammals , 2012, Genome research.

[89]  Laurent Gil,et al.  Ensembl 2013 , 2012, Nucleic Acids Res..

[90]  C. MacDonald,et al.  Reexamining the polyadenylation signal: were we wrong about AAUAAA? , 2002, Molecular and Cellular Endocrinology.

[91]  Xiaomin Zhao,et al.  A 57-Nucleotide Upstream Early Polyadenylation Element in Human Papillomavirus Type 16 Interacts with hFip1, CstF-64, hnRNP C1/C2, and Polypyrimidine Tract Binding Protein , 2005, Journal of Virology.

[92]  Mihaela Zavolan,et al.  Genome-wide analysis of pre-mRNA 3' end processing reveals a decisive role of human cleavage factor I in the regulation of 3' UTR length. , 2012, Cell reports.