EchinoDB, an application for comparative transcriptomics of deeply-sampled clades of echinoderms

BackgroundOne of our goals for the echinoderm tree of life project (http://echinotol.org) is to identify orthologs suitable for phylogenetic analysis from next-generation transcriptome data. The current dataset is the largest assembled for echinoderm phylogeny and transcriptomics. We used RNA-Seq to profile adult tissues from 42 echinoderm specimens from 24 orders and 37 families. In order to achieve sampling members of clades that span key evolutionary divergence, many of our exemplars were collected from deep and polar seas.DescriptionA small fraction of the transcriptome data we produced is being used for phylogenetic reconstruction. Thus to make a larger dataset available to researchers with a wide variety of interests, we made a web-based application, EchinoDB (http://echinodb.uncc.edu). EchinoDB is a repository of orthologous transcripts from echinoderms that is searchable via keywords and sequence similarity.ConclusionsFrom transcripts we identified 749,397 clusters of orthologous loci. We have developed the information technology to manage and search the loci their annotations with respect to the Sea Urchin (Strongylocentrotus purpuratus) genome. Several users have already taken advantage of these data for spin-off projects in developmental biology, gene family studies, and neuroscience. We hope others will search EchinoDB to discover datasets relevant to a variety of additional questions in comparative biology.

[1]  B. Livingston,et al.  Examination of the skeletal proteome of the brittle star Ophiocoma wendtii reveals overall conservation of proteins but variation in spicule matrix proteins , 2015, Proteome Science.

[2]  Le-Shin Wu,et al.  Trinity RNA-Seq assembler performance optimization , 2012, XSEDE '12.

[3]  Gregorio V. Linchangco,et al.  Phylogeny of Echinoderm Hemoglobins , 2015, PloS one.

[4]  Christopher E. Jones,et al.  Identification of a neuropeptide precursor protein that gives rise to a "cocktail" of peptides that bind Cu(II) and generate metal-linked dimers. , 2016, Biochimica et biophysica acta.

[5]  E. Davidson,et al.  Quantitative developmental transcriptomes of the sea urchin Strongylocentrotus purpuratus. , 2014, Developmental biology.

[6]  C. Stoeckert,et al.  OrthoMCL: identification of ortholog groups for eukaryotic genomes. , 2003, Genome research.

[7]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[8]  Gregorio V. Linchangco,et al.  Phylotranscriptomic analysis uncovers a wealth of tissue inhibitor of metalloproteinases variants in echinoderms , 2015, Royal Society Open Science.

[9]  R. Gibbs,et al.  Gene structure in the sea urchin Strongylocentrotus purpuratus based on transcriptome analysis , 2012, Genome research.

[10]  Craig A. Stewart,et al.  Proceedings of the 1st Conference of the Extreme Science and Engineering Discovery Environment: Bridging from the eXtreme to the campus and beyond , 2012 .