Monitoring transcription initiation activities in rat and dog

The promoter landscape of several non-human model organisms is far from complete. As a part of FANTOM5 data collection, we generated 13 profiles of transcription initiation activities in dog and rat aortic smooth muscle cells, mesenchymal stem cells and hepatocytes by employing CAGE (Cap Analysis of Gene Expression) technology combined with single molecule sequencing. Our analyses show that the CAGE profiles recapitulate known transcription start sites (TSSs) consistently, in addition to uncover novel TSSs. Our dataset can be thus used with high confidence to support gene annotation in dog and rat species. We identified 28,497 and 23,147 CAGE peaks, or promoter regions, for rat and dog respectively, and associated them to known genes. This approach could be seen as a standard method for improvement of existing gene models, as well as discovery of novel genes. Given that the FANTOM5 data collection includes dog and rat matched cell types in human and mouse as well, this data would also be useful for cross-species studies.

[1]  Piero Carninci,et al.  On-the-fly selection of cell-specific enhancers, genes, miRNAs and proteins across the human body using SlideBase , 2016, Database J. Biol. Databases Curation.

[2]  K. Lindblad-Toh,et al.  Comparative genomics as a tool to understand evolution and disease , 2013, Genome research.

[3]  Jordan A. Ramilowski,et al.  An atlas of human long non-coding RNAs with accurate 5′ ends , 2017, Nature.

[4]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[5]  Carsten O. Daub,et al.  TagDust—a program to eliminate artifacts from next generation sequencing data , 2009, Bioinform..

[6]  Steven L Salzberg,et al.  HISAT: a fast spliced aligner with low memory requirements , 2015, Nature Methods.

[7]  S. Salzberg,et al.  The Transcriptional Landscape of the Mammalian Genome , 2005, Science.

[8]  Daniel J. Gaffney,et al.  A survey of best practices for RNA-seq data analysis , 2016, Genome Biology.

[9]  Anton J. Enright,et al.  Network visualization and analysis of gene expression data using BioLayout Express3D , 2009, Nature Protocols.

[10]  A. Sandelin,et al.  Metazoan promoters: emerging characteristics and insights into transcriptional regulation , 2012, Nature Reviews Genetics.

[11]  Mick Watson,et al.  Errors in RNA-Seq quantification affect genes of relevance to human disease , 2015, Genome Biology.

[12]  Jun Wang,et al.  The draft genome of Tibetan hulless barley reveals adaptive patterns to the high stressful Tibetan Plateau , 2015, Proceedings of the National Academy of Sciences.

[13]  Piero Carninci,et al.  CAGE (cap analysis of gene expression): a protocol for the detection of promoter and transcriptional networks. , 2012, Methods in molecular biology.

[14]  Thomas J. Ha,et al.  Transcribed enhancers lead waves of coordinated transcription in transitioning mammalian cells , 2015, Science.

[15]  Terrence S. Furey,et al.  The UCSC Genome Browser Database , 2003, Nucleic Acids Res..

[16]  Piero Carninci,et al.  FANTOM5 transcriptome catalog of cellular states based on Semantic MediaWiki , 2016, Database J. Biol. Databases Curation.

[17]  Piero Carninci,et al.  Unamplified Cap Analysis of Gene Expression on a Single-molecule Sequencer , 2022 .

[18]  Xiangqin Cui,et al.  Design and validation issues in RNA-seq experiments , 2011, Briefings Bioinform..

[19]  Martin S. Taylor,et al.  The transcriptional network that controls growth arrest and differentiation in a human myeloid leukemia cell line , 2009, Nature Genetics.

[20]  J. Harrow,et al.  Systematic evaluation of spliced alignment programs for RNA-seq data , 2013, Nature Methods.

[21]  Cesare Furlanello,et al.  A promoter-level mammalian expression atlas , 2015 .

[22]  Martin S. Taylor,et al.  Genome-wide analysis of mammalian promoter architecture and evolution , 2006, Nature Genetics.

[23]  angesichts der Corona-Pandemie,et al.  UPDATE , 1973, The Lancet.

[24]  Nadav S. Bar,et al.  Landscape of transcription in human cells , 2012, Nature.

[25]  Piero Carninci,et al.  Paradigm shifts in genomics through the FANTOM projects , 2015, Mammalian Genome.

[26]  Daniel W. A. Buchan,et al.  The tomato genome sequence provides insights into fleshy fruit evolution , 2012, Nature.

[27]  Rezvan Ehsani,et al.  EpiFactors: a comprehensive database of human epigenetic factors and complexes , 2015, Database J. Biol. Databases Curation.

[28]  Ariel S. Schwartz,et al.  An Atlas of Combinatorial Transcriptional Regulation in Mouse and Man , 2010, Cell.

[29]  Wen J. Li,et al.  Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation , 2015, Nucleic Acids Res..

[30]  Martin S. Taylor,et al.  The frequent evolutionary birth and death of functional promoters in mouse and human , 2015, Genome research.

[31]  D. Karolchik,et al.  The UCSC Genome Browser database: 2016 update , 2015, bioRxiv.

[32]  E. Birney,et al.  Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs , 2002, Nature.

[33]  Johnf . Thompson,et al.  Single Molecule Sequencing with a HeliScope Genetic Analysis System , 2010, Current protocols in molecular biology.

[34]  David Haussler,et al.  The UCSC Genome Browser database: 2017 update , 2016, Nucleic Acids Res..

[35]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[36]  Julie M Sheridan,et al.  edgeR: a versatile tool for the analysis of shRNA-seq and CRISPR-Cas9 genetic screens , 2014, F1000Research.

[37]  Aaron R. Quinlan,et al.  Bioinformatics Applications Note Genome Analysis Bedtools: a Flexible Suite of Utilities for Comparing Genomic Features , 2022 .

[38]  Derek W Wright,et al.  Gateways to the FANTOM5 promoter level mammalian expression atlas , 2015, Genome Biology.

[39]  T. Meehan,et al.  An atlas of active enhancers across human cell types and tissues , 2014, Nature.

[40]  David J. Arenillas,et al.  CAGEd-oPOSSUM: motif enrichment analysis from CAGE-derived TSSs , 2016, bioRxiv.

[41]  C. Glass,et al.  Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. , 2010, Molecular cell.

[42]  Yoshihide Hayashizaki,et al.  Interactive visualization and analysis of large-scale sequencing datasets using ZENBU , 2014, Nature Biotechnology.