APASdb: a database describing alternative poly(A) sites and selection of heterogeneous cleavage sites downstream of poly(A) signals

Increasing amounts of genes have been shown to utilize alternative polyadenylation (APA) 3′-processing sites depending on the cell and tissue type and/or physiological and pathological conditions at the time of processing, and the construction of genome-wide database regarding APA is urgently needed for better understanding poly(A) site selection and APA-directed gene expression regulation for a given biology. Here we present a web-accessible database, named APASdb (http://mosas.sysu.edu.cn/utr), which can visualize the precise map and usage quantification of different APA isoforms for all genes. The datasets are deeply profiled by the sequencing alternative polyadenylation sites (SAPAS) method capable of high-throughput sequencing 3′-ends of polyadenylated transcripts. Thus, APASdb details all the heterogeneous cleavage sites downstream of poly(A) signals, and maintains near complete coverage for APA sites, much better than the previous databases using conventional methods. Furthermore, APASdb provides the quantification of a given APA variant among transcripts with different APA sites by computing their corresponding normalized-reads, making our database more useful. In addition, APASdb supports URL-based retrieval, browsing and display of exon-intron structure, poly(A) signals, poly(A) sites location and usage reads, and 3′-untranslated regions (3′-UTRs). Currently, APASdb involves APA in various biological processes and diseases in human, mouse and zebrafish.

[1]  Qunfeng Dong,et al.  Using WebGBrowse to visualize genome annotation on GBrowse. , 2010, Cold Spring Harbor protocols.

[2]  C. Lutz,et al.  Alternative polyadenylation: a twist on mRNA 3' end formation. , 2008, ACS chemical biology.

[3]  Sören Müller,et al.  APADB: a database for alternative polyadenylation and microRNA regulation events , 2014, Database J. Biol. Databases Curation.

[4]  S. Lewis,et al.  The generic genome browser: a building block for a model organism system database. , 2002, Genome research.

[5]  Jiahui Liang,et al.  development UTRs during zebrafish ′ Dynamic landscape of tandem 3 Material Supplemental , 2012 .

[6]  C. Mayr,et al.  Widespread Shortening of 3′UTRs by Alternative Cleavage and Polyadenylation Activates Oncogenes in Cancer Cells , 2009, Cell.

[7]  Ernesto Picardi,et al.  UTRdb and UTRsite (RELEASE 2010): a collection of sequences and regulatory motifs of the untranslated regions of eukaryotic mRNAs , 2009, Nucleic Acids Res..

[8]  Björn Rotter,et al.  Massive analysis of cDNA Ends (MACE) and miRNA expression profiling identifies proatherogenic pathways in chronic kidney disease , 2013, Epigenetics.

[9]  Z. Gong,et al.  Zebrafish mRNA sequencing deciphers novelties in transcriptome dynamics during maternal to zygotic transition. , 2011, Genome research.

[10]  Bin Tian,et al.  PolyA_DB 2: mRNA polyadenylation sites in vertebrate genes , 2007, Nucleic Acids Res..

[11]  T. Babak,et al.  A quantitative atlas of polyadenylation in five mammals , 2012, Genome research.

[12]  C. Lutz,et al.  Alternative mRNA polyadenylation in eukaryotes: an effective regulator of gene expression , 2011, Wiley interdisciplinary reviews. RNA.

[13]  D Gautheret,et al.  Identification of alternate polyadenylation sites and analysis of their tissue distribution using EST data. , 2001, Genome research.

[14]  Yonggui Fu,et al.  Genome-wide alternative polyadenylation in animals: insights from high-throughput technologies. , 2012, Journal of molecular cell biology.

[15]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[16]  J. Beisson,et al.  Paramecium tetraurelia: the renaissance of an early unicellular model. , 2010, Cold Spring Harbor protocols.

[17]  E Pauws,et al.  Heterogeneity in polyadenylation cleavage sites in mammalian mRNA sequences: implications for SAGE analysis. , 2001, Nucleic acids research.

[18]  K. Nishida,et al.  Mechanisms and consequences of alternative polyadenylation. , 2011, Molecules and Cells.

[19]  Lincoln Stein,et al.  Using GBrowse 2.0 to visualize and share next-generation sequence data , 2013, Briefings Bioinform..

[20]  Bin Tian,et al.  A large-scale analysis of mRNA polyadenylation of human and mouse genes , 2005, Nucleic acids research.

[21]  H. Meijer,et al.  Mechanisms of translational control by the 3' UTR in development and differentiation. , 2005, Seminars in cell & developmental biology.

[22]  J. Manley,et al.  Mechanism and regulation of mRNA polyadenylation. , 1997, Genes & development.

[23]  S. Kuersten,et al.  The power of the 3′ UTR: translational control and development , 2003, Nature Reviews Genetics.

[24]  Donglin Liu,et al.  BIOINFORMATICS APPLICATIONS NOTE Databases and ontologies PACdb: PolyA Cleavage Site and 3 ′-UTR Database , 2022 .

[25]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[26]  Chong-Jian Chen,et al.  Differential genome-wide profiling of tandem 3' UTRs among human breast cancer and normal cells by high-throughput sequencing. , 2011, Genome research.

[27]  Matthew R. Pocock,et al.  The Bioperl toolkit: Perl modules for the life sciences. , 2002, Genome research.

[28]  J. Manley,et al.  Strange bedfellows: polyadenylation factors at the promoter. , 2003, Genes & development.

[29]  Jürg Bähler,et al.  Genome-wide analysis of poly(A) site selection in Schizosaccharomyces pombe , 2013, RNA.

[30]  R. Knight,et al.  Regions and Fewer MicroRNA Target Sites Proliferating Cells Express mRNAs with Shortened 3 ' Untranslated , 2012 .

[31]  I. Mattaj,et al.  The influence of 5′ and 3′ end structures on pre-mRNA metabolism , 1995, Journal of Cell Science.

[32]  Michael Recce,et al.  PolyA_DB: a database for mammalian mRNA polyadenylation , 2004, Nucleic Acids Res..

[33]  Philip Lijnzaad,et al.  The Ensembl genome database project , 2002, Nucleic Acids Res..

[34]  Anlong Xu,et al.  A Global Analysis of Tandem 3′UTRs in Eosinophilic Chronic Rhinosinusitis with Nasal Polyps , 2012, PloS one.

[35]  David Haussler,et al.  The UCSC Genome Browser database: update 2010 , 2009, Nucleic Acids Res..

[36]  R. Henrik Nilsson,et al.  Finding needles in haystacks: linking scientific names, reference specimens and molecular data for Fungi , 2014, Database J. Biol. Databases Curation.