Dynamic analyses of alternative polyadenylation from RNA-seq reveal a 3′-UTR landscape across seven tumour types

Alternative polyadenylation (APA) is a pervasive mechanism in the regulation of most human genes, and its implication in diseases including cancer is only beginning to be appreciated. Since conventional APA profiling has not been widely adopted, global cancer APA studies are very limited. Here we develop a novel bioinformatics algorithm (DaPars) for the de novo identification of dynamic APAs from standard RNA-seq. When applied to 358 TCGA Pan-Cancer tumour/normal pairs across seven tumour types, DaPars reveals 1,346 genes with recurrent and tumour-specific APAs. Most APA genes (91%) have shorter 3'-untranslated regions (3' UTRs) in tumours that can avoid microRNA-mediated repression, including glutaminase (GLS), a key metabolic enzyme for tumour proliferation. Interestingly, selected APA events add strong prognostic power beyond common clinical and molecular variables, suggesting their potential as novel prognostic biomarkers. Finally, our results implicate CstF64, an essential polyadenylation factor, as a master regulator of 3'-UTR shortening across multiple tumour types.

[1]  Kari Stefansson,et al.  A germline variant in the TP53 polyadenylation signal confers cancer susceptibility , 2011, Nature Genetics.

[2]  C. Burge,et al.  Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets , 2005, Cell.

[3]  R. Elkon,et al.  Alternative cleavage and polyadenylation: extent, regulation and function , 2013, Nature Reviews Genetics.

[4]  J. Manley,et al.  Levels of polyadenylation factor CstF-64 control IgM heavy chain mRNA accumulation and other events associated with B cell differentiation. , 1998, Molecular cell.

[5]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[6]  E. Hayden Activists sound alarm on tiered drug prices , 2014, Nature.

[7]  Anton J. Enright,et al.  Human MicroRNA Targets , 2004, PLoS biology.

[8]  Hongzhe Li,et al.  A change-point model for identifying 3′UTR switching by next-generation RNA sequencing , 2014, Bioinform..

[9]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[10]  Ji Wan,et al.  Transcriptome-wide analyses of CstF64–RNA interactions in global regulation of mRNA alternative polyadenylation , 2012, Proceedings of the National Academy of Sciences.

[11]  Yonggui Fu,et al.  Genome-wide alternative polyadenylation in animals: insights from high-throughput technologies. , 2012, Journal of molecular cell biology.

[12]  Timothy L. Bailey,et al.  Gene expression Advance Access publication May 4, 2011 DREME: motif discovery in transcription factor ChIP-seq data , 2011 .

[13]  Wei Li,et al.  CFIm25 links Alternative Polyadenylation to Glioblastoma Tumor Suppression , 2014, Nature.

[14]  G. Yehia,et al.  Analysis of alterative cleavage and polyadenylation by 3′ region extraction and deep sequencing , 2012, Nature Methods.

[15]  Mihaela Zavolan,et al.  Genome-wide Analysis of Pre-mRNA 3 0 End Processing Reveals a Decisive Role of Human Cleavage Factor I in the Regulation of 3 0 UTR Length , 2012 .

[16]  Hanlee P. Ji,et al.  The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements. , 2006, Nature biotechnology.

[17]  P. Sharp,et al.  Proliferating Cells Express mRNAs with Shortened 3' Untranslated Regions and Fewer MicroRNA Target Sites , 2008, Science.

[18]  Juw Won Park,et al.  MATS: a Bayesian framework for flexible detection of differential alternative splicing from RNA-Seq data , 2012, Nucleic acids research.

[19]  R. Elkon,et al.  Alternative Cleavage and Polyadenylation during Colorectal Cancer Development , 2012, Clinical Cancer Research.

[20]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[21]  Tsung-Cheng Chang,et al.  c-Myc suppression of miR-23 enhances mitochondrial glutaminase and glutamine metabolism , 2009, Nature.

[22]  B. Tian,et al.  Progressive lengthening of 3′ untranslated regions of mRNAs by alternative polyadenylation during mouse embryonic development , 2009, Proceedings of the National Academy of Sciences.

[23]  W. Wong,et al.  Modeling non-uniformity in short-read rates in RNA-Seq data , 2010, Genome Biology.

[24]  Sandrine Dudoit,et al.  Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments , 2010, BMC Bioinformatics.

[25]  J. Yates,et al.  Molecular architecture of the human pre-mRNA 3' processing complex. , 2009, Molecular cell.

[26]  T. Babak,et al.  A quantitative atlas of polyadenylation in five mammals , 2012, Genome research.

[27]  W. Souba,et al.  Cloning and analysis of unique human glutaminase isoforms generated by tissue-specific alternative splicing. , 1999, Physiological genomics.

[28]  Michael Recce,et al.  PolyA_DB: a database for mammalian mRNA polyadenylation , 2004, Nucleic Acids Res..

[29]  J. Matés,et al.  A novel glutaminase isoform in mammalian tissues , 2009, Neurochemistry International.

[30]  Adrian Wiestner,et al.  Point mutations and genomic deletions in CCND1 create stable truncated cyclin D1 mRNAs that are associated with increased proliferation rate and shorter survival. , 2007, Blood.

[31]  R. Elkon,et al.  E2F mediates enhanced alternative polyadenylation in proliferation , 2012, Genome Biology.

[32]  J. Márquez,et al.  Identification of two human glutaminase loci and tissue-specific expression of the two related genes , 2000, Mammalian Genome.

[33]  Peter J. Shepard,et al.  Complex and dynamic landscape of RNA polyadenylation revealed by PAS-Seq. , 2011, RNA.

[34]  Mikael Bodén,et al.  MEME Suite: tools for motif discovery and searching , 2009, Nucleic Acids Res..

[35]  E. Wang,et al.  Analysis and design of RNA sequencing experiments for identifying isoform regulation , 2010, Nature Methods.

[36]  D. Bartel,et al.  Extensive alternative polyadenylation during zebrafish development , 2012, Genome research.

[37]  Bin Tian,et al.  A large-scale analysis of mRNA polyadenylation of human and mouse genes , 2005, Nucleic acids research.

[38]  K. Bachman,et al.  Analysis of glutamine dependency in non-small cell lung cancer , 2012, Cancer biology & therapy.

[39]  Maqc Consortium The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements , 2006, Nature Biotechnology.

[40]  L. Lim,et al.  MicroRNA targeting specificity in mammals: determinants beyond seed pairing. , 2007, Molecular cell.

[41]  R. Guigó,et al.  Modelling and simulating generic RNA-Seq experiments with the flux simulator , 2012, Nucleic acids research.

[42]  D. Hanahan,et al.  Hallmarks of Cancer: The Next Generation , 2011, Cell.

[43]  Tao Jiang,et al.  IsoLasso: A LASSO Regression Approach to RNA-Seq Based Transcriptome Assembly - (Extended Abstract) , 2011, RECOMB.

[44]  A. Cassago,et al.  Mitochondrial localization and structure-based phosphate activation mechanism of Glutaminase C with implications for cancer metabolism , 2012, Proceedings of the National Academy of Sciences.

[45]  C. Sander,et al.  Analysis of microRNA-target interactions across diverse cancer types , 2013, Nature Structural &Molecular Biology.

[46]  D. Sabatini,et al.  Cancer Cell Metabolism: Warburg and Beyond , 2008, Cell.

[47]  K. Kinzler,et al.  Cancer Genome Landscapes , 2013, Science.

[48]  Yongsheng Shi,et al.  Alternative polyadenylation: new insights from global analyses. , 2012, RNA.

[49]  Patrice M. Milos,et al.  An in-depth map of polyadenylation sites in cancer , 2012, Nucleic acids research.

[50]  J. Graber,et al.  Global changes in processing of mRNA 3' untranslated regions characterize clinically distinct cancer subtypes. , 2009, Cancer research.

[51]  Andrew M. Gross,et al.  Network-based stratification of tumor mutations , 2013, Nature Methods.

[52]  L. Paillard,et al.  AU-rich elements and associated factors: are there unifying principles? , 2006, Nucleic acids research.

[53]  S. Gabriel,et al.  Discovery and saturation analysis of cancer genes across 21 tumor types , 2014, Nature.

[54]  Mihaela Zavolan,et al.  Genome-wide analysis of pre-mRNA 3' end processing reveals a decisive role of human cleavage factor I in the regulation of 3' UTR length. , 2012, Cell reports.

[55]  Joseph K. Pickrell,et al.  Understanding mechanisms underlying human gene expression variation with RNA sequencing , 2010, Nature.

[56]  E. Lai,et al.  Widespread and extensive lengthening of 3′ UTRs in the mammalian brain , 2013, Genome research.

[57]  C. Mayr,et al.  Widespread Shortening of 3′UTRs by Alternative Cleavage and Polyadenylation Activates Oncogenes in Cancer Cells , 2009, Cell.

[58]  Gunnar Rätsch,et al.  rQuant.web: a tool for RNA-Seq-based transcript quantitation , 2010, Nucleic Acids Res..

[59]  K. Nishida,et al.  Mechanisms and consequences of alternative polyadenylation. , 2011, Molecules and Cells.

[60]  D. Gautheret,et al.  Patterns of variant polyadenylation signal usage in human genes. , 2000, Genome research.

[61]  R. Gill,et al.  Cox's regression model for counting processes: a large sample study : (preprint) , 1982 .