TBro: visualization and management of de novo transcriptomes

RNA sequencing (RNA-seq) has become a powerful tool to understand molecular mechanisms and/or developmental programs. It provides a fast, reliable and cost-effective method to access sets of expressed elements in a qualitative and quantitative manner. Especially for non-model organisms and in absence of a reference genome, RNA-seq data is used to reconstruct and quantify transcriptomes at the same time. Even SNPs, InDels, and alternative splicing events are predicted directly from the data without having a reference genome at hand. A key challenge, especially for non-computational personnal, is the management of the resulting datasets, consisting of different data types and formats. Here, we present TBro, a flexible de novo transcriptome browser, tackling this challenge. TBro aggregates sequences, their annotation, expression levels as well as differential testing results. It provides an easy-to-use interface to mine the aggregated data and generate publication-ready visualizations. Additionally, it supports users with an intuitive cart system, that helps collecting and analysing biological meaningful sets of transcripts. TBro’s modular architecture allows easy extension of its functionalities in the future. Especially, the integration of new data types such as proteomic quantifications or array-based gene expression data is straightforward. Thus, TBro is a fully featured yet flexible transcriptome browser that supports approaching complex biological questions and enhances collaboration of numerous researchers. Database URL: tbro.carnivorom.com

[1]  Thomas Nussbaumer,et al.  RNASeqExpressionBrowser - a web interface to browse and visualize high-throughput expression data , 2014, Bioinform..

[2]  D. Altman,et al.  STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT , 1986, The Lancet.

[3]  Sergio Contrino,et al.  InterMine: a flexible data warehouse system for the integration and analysis of heterogeneous biological data , 2012, Bioinform..

[4]  Chuan-Xi Zhang,et al.  De novo characterization of a whitefly transcriptome and analysis of its gene expression during development , 2010, BMC Genomics.

[5]  Colin N. Dewey,et al.  De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis , 2013, Nature Protocols.

[6]  Matthew Fraser,et al.  InterProScan 5: genome-scale protein function classification , 2014, Bioinform..

[7]  Geet Duggal,et al.  Salmon: Accurate, Versatile and Ultrafast Quantification from RNA-seq Data using Lightweight-Alignment , 2015 .

[8]  Martin Vingron,et al.  Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels , 2012, Bioinform..

[9]  Peter J. Woolf,et al.  GAGE: generally applicable gene set enrichment for pathway analysis , 2009, BMC Bioinformatics.

[10]  Chaozu He,et al.  RNA-Seq analysis and de novo transcriptome assembly of Hevea brasiliensis , 2011, Plant Molecular Biology.

[11]  Stephen P. Ficklin,et al.  Tripal: a construction toolkit for online genome databases , 2011, Database J. Biol. Databases Curation.

[12]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[13]  S. Wiegand,et al.  TraV: A Genome Context Sensitive Transcriptome Browser , 2014, PloS one.

[14]  H. Chandler Database , 1985 .

[15]  Brad Fitzpatrick,et al.  Distributed caching with memcached , 2004 .

[16]  Chris Mungall,et al.  A Chado case study: an ontology-based modular schema for representing genome-associated biological information , 2007, ISMB/ECCB.

[17]  J M Bland,et al.  Statistical methods for assessing agreement between two methods of clinical measurement , 1986 .

[18]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[19]  Colin N. Dewey,et al.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome , 2011, BMC Bioinformatics.

[20]  N. Friedman,et al.  Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data , 2011, Nature Biotechnology.

[21]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[22]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.

[23]  Mark Stitt,et al.  Mercator: a fast and simple web server for genome scale functional annotation of plant sequence data. , 2014, Plant, cell & environment.

[24]  Stephen P. Ficklin,et al.  Tripal v1.1: a standards-based toolkit for construction of online genetic and genomic databases , 2013, Database J. Biol. Databases Curation.

[25]  Zhe-zhi Wang,et al.  De novo transcriptome sequencing in Salvia miltiorrhiza to identify genes involved in the biosynthesis of active ingredients. , 2011, Genomics.

[26]  Jonathan Vincent,et al.  dbWFA: a web-based database for functional annotation of Triticum aestivum transcripts , 2013, Database J. Biol. Databases Curation.

[27]  Akhilesh K. Tyagi,et al.  De Novo Assembly of Chickpea Transcriptome Using Short Reads for Gene Discovery and Marker Identification , 2011, DNA research : an international journal for rapid publication of reports on genes and genomes.