AgriSeqDB: an online RNA-Seq database for functional studies of agriculturally relevant plant species

Background The genome-wide expression profile of genes in different tissues/cell types and developmental stages is a vital component of many functional genomic studies. Transcriptome data obtained by RNA-sequencing (RNA-Seq) is often deposited in public databases that are made available via data portals. Data visualization is one of the first steps in assessment and hypothesis generation. However, these databases do not typically include visualization tools and establishing one is not trivial for users who are not computational experts. This, as well as the various formats in which data is commonly deposited, makes the processes of data access, sharing and utility more difficult. Our goal was to provide a simple and user-friendly repository that meets these needs for datasets from major agricultural crops. Description AgriSeqDB (https://expression.latrobe.edu.au/agriseqdb), is a database for viewing, analysing and interpreting developmental and tissue/cell-specific transcriptome data from several species, including major agricultural crops such as wheat, rice, maize, barley and tomato. The disparate manner in which public transcriptome data is often warehoused and the challenge of visualizing raw data are both major hurdles to data reuse. The popular eFP browser does an excellent job of presenting transcriptome data in an easily interpretable view, but previous implementation has been mostly on a case-by-case basis. Here we present an integrated visualisation database of transcriptome datasets from six species that did not previously have public-facing visualisations. We combine the eFP browser, for gene-by-gene investigation, with the Degust browser, which enables visualisation of all transcripts across multiple samples. The two visualisation interfaces launch from the same point, enabling users to easily switch between analysis modes. The tools allow users, even those without bioinformatics expertise, to mine into datasets and understand the behaviour of transcripts of interest across samples and time. We have also incorporated an additional graphic download option to simplify incorporation into presentations or publications. Conclusion Powered by eFP and Degust browsers, AgriSeqDB is a quick and easy-to-use platform for data analysis and visualization in five crops and Arabidopsis. Furthermore, it provides a tool that makes it easy for researchers to share their datasets, promoting research collaborations and dataset reuse.

[1]  Yuliya V. Karpievitch,et al.  Extensive transcriptomic and epigenomic remodelling occurs during Arabidopsis thaliana germination , 2017, Genome Biology.

[2]  Andrew G. Sharpe,et al.  The developmental transcriptome atlas of the biofuel crop Camelina sativa. , 2016, The Plant journal : for cell and molecular biology.

[3]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[4]  Erik S. Ferlanti,et al.  ePlant: Visualizing and Exploring Multiple Levels of Data for Hypothesis Generation in Plant Biology[OPEN] , 2017, Plant Cell.

[5]  Nuno A. Fonseca,et al.  The RNASeq-er API—a gateway to systematically updated analysis of public RNA-seq data , 2017, Bioinform..

[6]  Robert Turgeon,et al.  The developmental dynamics of the maize leaf transcriptome , 2010, Nature Genetics.

[7]  Nicholas J Provart,et al.  RNA-Seq effectively monitors gene expression in Eutrema salsugineum plants growing in an extreme natural habitat and in controlled growth cabinet conditions , 2013, BMC Genomics.

[8]  Karl G. Kugler,et al.  Genome interplay in the grain transcriptome of hexaploid bread wheat , 2014, Science.

[9]  R. Burton,et al.  Isolation of tissues and preservation of RNA from intact, germinated barley grain , 2017, The Plant journal : for cell and molecular biology.

[10]  Manuele Bicego,et al.  The Grapevine Expression Atlas Reveals a Deep Transcriptome Shift Driving the Entire Plant into a Maturation Program[W][OA] , 2012, Plant Cell.

[11]  Nicholas J Provart,et al.  Developmental transcriptional profiling reveals key insights into Triticeae reproductive development. , 2013, The Plant journal : for cell and molecular biology.

[12]  J. Bohlmann,et al.  Cell‐type‐ and tissue‐specific transcriptomes of the white spruce (Picea glauca) bark unmask fine‐scale spatial patterns of constitutive and induced conifer defense , 2017, The Plant journal : for cell and molecular biology.

[13]  N. Provart,et al.  Expression atlas and comparative coexpression network analyses reveal important genes involved in the formation of lignified cell wall in Brachypodium distachyon. , 2017, The New phytologist.

[14]  Zhangjun Fei,et al.  High-resolution spatiotemporal transcriptome mapping of tomato fruit development and ripening , 2018, Nature Communications.

[15]  Lawrence Kelley,et al.  ePlant and the 3D Data Display Initiative: Integrative Systems Biology on the World Wide Web , 2011, PloS one.

[16]  D. Galbraith,et al.  Profiling translatomes of discrete cell populations resolves altered cellular priorities during hypoxia in Arabidopsis , 2009, Proceedings of the National Academy of Sciences.

[17]  Unraveling the complexity of transcriptomic, metabolomic and quality environmental response of tomato fruit , 2017, BMC Plant Biology.

[18]  B. Langmead,et al.  Cloud computing for genomic data analysis and collaboration , 2018, Nature Reviews Genetics.

[19]  M. Schmid,et al.  Cell type-specific transcriptome analysis in the early Arabidopsis thaliana embryo , 2014, Development.

[20]  Jinsheng Lai,et al.  Dynamic Transcriptome Landscape of Maize Embryo and Endosperm Development1[W][OPEN] , 2014, Plant Physiology.

[21]  Abdul Ahad,et al.  Analysis of gene expression patterns during seed coat development in Arabidopsis. , 2011, Molecular plant.

[22]  Matthew D. Schultz,et al.  Dynamic and rapid changes in the transcriptome and epigenome during germination and in developing rice (Oryza sativa) coleoptiles under anoxia and re‐oxygenation , 2017, The Plant journal : for cell and molecular biology.

[23]  Y. Chu,et al.  A Developmental Transcriptome Map for Allotetraploid Arachis hypogaea , 2016, Front. Plant Sci..

[24]  Justin Foong,et al.  Expansion and Diversification of the Populus R2R3-MYB Family of Transcription Factors1[W][OA] , 2008, Plant Physiology.

[25]  Nicholas J. Provart,et al.  An “Electronic Fluorescent Pictograph” Browser for Exploring and Analyzing Large-Scale Biological Data Sets , 2007, PloS one.

[26]  M. Muers Human disease: Genome-wide insights into lipid levels , 2009, Nature Reviews Genetics.

[27]  Z. Fei,et al.  Catalyzing plant science research with RNA-seq , 2013, Front. Plant Sci..

[28]  Xiangfeng Wang,et al.  RNA Sequencing of Laser-Capture Microdissected Compartments of the Maize Kernel Identifies Regulatory Modules Associated with Endosperm Cell Differentiation[OPEN] , 2015, Plant Cell.