Update of the FANTOM web resource: expansion to provide additional transcriptome atlases

Abstract The FANTOM web resource (http://fantom.gsc.riken.jp/) was developed to provide easy access to the data produced by the FANTOM project. It contains the most complete and comprehensive sets of actively transcribed enhancers and promoters in the human and mouse genomes. We determined the transcription activities of these regulatory elements by CAGE (Cap Analysis of Gene Expression) for both steady and dynamic cellular states in all major and some rare cell types, consecutive stages of differentiation and responses to stimuli. We have expanded the resource by employing different assays, such as RNA-seq, short RNA-seq and a paired-end protocol for CAGE (CAGEscan), to provide new angles to study the transcriptome. That yielded additional atlases of long noncoding RNAs, miRNAs and their promoters. We have also expanded the CAGE analysis to cover rat, dog, chicken, and macaque species for a limited number of cell types. The CAGE data obtained from human and mouse were reprocessed to make them available on the latest genome assemblies. Here, we report the recent updates of both data and interfaces in the FANTOM web resource.

[1]  Nadav S. Bar,et al.  Landscape of transcription in human cells , 2012, Nature.

[2]  T. Meehan,et al.  An atlas of active enhancers across human cell types and tissues , 2014, Nature.

[3]  Jun Sese,et al.  ChIP‐Atlas: a data‐mining suite powered by full integration of public ChIP‐seq data , 2018, EMBO reports.

[4]  Piero Carninci,et al.  FANTOM5 transcriptome catalog of cellular states based on Semantic MediaWiki , 2016, Database J. Biol. Databases Curation.

[5]  E. Birney,et al.  Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs , 2002, Nature.

[6]  Sebastian D. Mackowiak,et al.  miRDeep2 accurately identifies known and hundreds of novel microRNA genes in seven animal clades , 2011, Nucleic acids research.

[7]  Aviv Regev,et al.  Comprehensive comparative analysis of 5’ end RNA sequencing methods , 2018, Nature Methods.

[8]  Martin S. Taylor,et al.  Genome-wide analysis of mammalian promoter architecture and evolution , 2006, Nature Genetics.

[9]  Piero Carninci,et al.  Monitoring transcription initiation activities in rat and dog , 2017, Scientific Data.

[10]  Yoshihide Hayashizaki,et al.  Interactive visualization and analysis of large-scale sequencing datasets using ZENBU , 2014, Nature Biotechnology.

[11]  Hiroshi Tanaka,et al.  FANTOM5 CAGE profiles of human and mouse samples , 2017, Scientific Data.

[12]  J. Ragoussis,et al.  A Large Fraction of Extragenic RNA Pol II Transcription Sites Overlap Enhancers , 2010, PLoS biology.

[13]  D. Karolchik,et al.  The UCSC Genome Browser database: 2016 update , 2015, bioRxiv.

[14]  Jordan A. Ramilowski,et al.  An atlas of human long non-coding RNAs with accurate 5′ ends , 2017, Nature.

[15]  Bronwen L. Aken,et al.  GENCODE: The reference human genome annotation for The ENCODE Project , 2012, Genome research.

[16]  A. Frankish,et al.  Towards a complete map of the human long non-coding RNA transcriptome , 2018, Nature Reviews Genetics.

[17]  Piero Carninci,et al.  Linking FANTOM5 CAGE peaks to annotations with CAGEscan , 2017, bioRxiv.

[18]  Thomas J. Ha,et al.  Transcribed enhancers lead waves of coordinated transcription in transitioning mammalian cells , 2015, Science.

[19]  Carsten O. Daub,et al.  Linking promoters to functional transcripts in small samples with nanoCAGE and CAGEscan , 2010, Nature Methods.

[20]  Piero Carninci,et al.  Unamplified Cap Analysis of Gene Expression on a Single-molecule Sequencer , 2022 .

[21]  Jay W. Shin,et al.  An integrated expression atlas of miRNAs and their promoters in human and mouse , 2017, Nature Biotechnology.

[22]  Ting Wang,et al.  Track data hubs enable visualization of user-defined genome-wide annotations on the UCSC Genome Browser , 2013, Bioinform..

[23]  Cesare Furlanello,et al.  A promoter-level mammalian expression atlas , 2015 .

[24]  Piero Carninci,et al.  Systematic analysis of transcription start sites in avian development , 2017, PLoS biology.

[25]  Piero Carninci,et al.  Transcription start site profiling of 15 anatomical regions of the Macaca mulatta central nervous system , 2017, Scientific Data.

[26]  Martin S. Taylor,et al.  The transcriptional network that controls growth arrest and differentiation in a human myeloid leukemia cell line , 2009, Nature Genetics.

[27]  Derek W Wright,et al.  Gateways to the FANTOM5 promoter level mammalian expression atlas , 2015, Genome Biology.

[28]  Robert D. Finn,et al.  Ensembl Genomes 2018: an integrated omics infrastructure for non-vertebrate species , 2017, Nucleic Acids Res..

[29]  S. Salzberg,et al.  The Transcriptional Landscape of the Mammalian Genome , 2005, Science.

[30]  Piero Carninci,et al.  FANTOM5 CAGE profiles of human and mouse reprocessed for GRCh38 and GRCm38 genome assemblies , 2017, Scientific Data.

[31]  Piero Carninci,et al.  Paradigm shifts in genomics through the FANTOM projects , 2015, Mammalian Genome.

[32]  Finn Drabløs,et al.  Update of the FANTOM web resource: high resolution transcriptome of diverse cell types in mammals , 2016, Nucleic Acids Res..

[33]  S. Dhanasekaran,et al.  The landscape of long noncoding RNAs in the human transcriptome , 2015, Nature Genetics.

[34]  Cole Trapnell,et al.  Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. , 2011, Genes & development.

[35]  R. Wilson,et al.  Modernizing Reference Genome Assemblies , 2011, PLoS biology.

[36]  Jun S. Liu,et al.  The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans , 2015, Science.

[37]  C. Bult,et al.  Functional annotation of a full-length mouse cDNA collection , 2001, Nature.