InterStoreDB: a generic integration resource for genetic and genomic data.

Associating phenotypic traits and quantitative trait loci (QTL) to causative regions of the underlying genome is a key goal in agricultural research. InterStoreDB is a suite of integrated databases designed to assist in this process. The individual databases are species independent and generic in design, providing access to curated datasets relating to plant populations, phenotypic traits, genetic maps, marker loci and QTL, with links to functional gene annotation and genomic sequence data. Each component database provides access to associated metadata, including data provenance and parameters used in analyses, thus providing users with information to evaluate the relative worth of any associations identified. The databases include CropStoreDB, for management of population, genetic map, QTL and trait measurement data, SeqStoreDB for sequence-related data and AlignStoreDB, which stores sequence alignment information, and allows navigation between genetic and genomic datasets. Genetic maps are visualized and compared using the CMAP tool, and functional annotation from sequenced genomes is provided via an EnsEMBL-based genome browser. This framework facilitates navigation of the multiple biological domains involved in genetics and genomics research in a transparent manner within a single portal. We demonstrate the value of InterStoreDB as a tool for Brassica research. InterStoreDB is available from: http://www.interstoredb.org.

[1]  Gudmundur A. Thorisson,et al.  Genotype–phenotype databases: challenges and solutions for the post-genomic era , 2009, Nature Reviews Genetics.

[2]  J. Poulain,et al.  The genome of the mesopolyploid crop species Brassica rapa , 2011, Nature Genetics.

[3]  Christopher G. Love,et al.  A Brassica Exon Array for Whole-Transcript Gene Expression Profiling , 2010, PloS one.

[4]  Wei Zhao,et al.  Panzea: an update on new content and features , 2007, Nucleic Acids Res..

[5]  Lincoln Stein,et al.  CMap 1.01: a comparative mapping application for the Internet , 2009, Bioinform..

[6]  M. Freeling,et al.  How to usefully compare homologous plant genes and chromosomes as DNA sequences. , 2008, The Plant journal : for cell and molecular biology.

[7]  Christopher G. Love,et al.  Regulatory Hotspots Are Associated with Plant Gene Expression under Varying Soil Phosphorus Supply in Brassica rapa1[W][OA] , 2011, Plant Physiology.

[8]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[9]  L. Stein,et al.  Gramene: Development and Integration of Trait and Gene Ontologies for Rice , 2002, Comparative and functional genomics.

[10]  G. King,et al.  Integration of linkage maps for the Amphidiploid Brassica napus and comparative mapping with Arabidopsis and Brassica rapa , 2011, BMC Genomics.

[11]  David M. Grant,et al.  The Legume Information System (LIS): an integrated information resource for comparative legume biology , 2004, Nucleic Acids Res..

[12]  Robert M. Buels,et al.  The Chado Natural Diversity module: a new generic database schema for large-scale phenotyping and genotyping data , 2011, Database J. Biol. Databases Curation.

[13]  Andrew J. Flavell,et al.  GERMINATE. A Generic Database for Integrating Genotypic and Phenotypic Information for Plant Genetic Resource Collections1[w] , 2005, Plant Physiology.

[14]  Jun Yu Li,et al.  A comparative linkage map of oilseed rape and its use for QTL analysis of seed oil and erucic acid content , 2006, Theoretical and Applied Genetics.

[15]  Winston A Hide,et al.  Big data: The future of biocuration , 2008, Nature.

[16]  David E Matthews,et al.  Plant and crop databases. , 2009, Methods in molecular biology.

[17]  Lisa C. Harper,et al.  MaizeGDB: curation and outreach go hand-in-hand , 2011, Database J. Biol. Databases Curation.

[18]  L. Kunst,et al.  Very-long-chain fatty acid biosynthesis is controlled through the expression and specificity of the condensing enzyme. , 1997, The Plant journal : for cell and molecular biology.

[19]  Daniel Rios,et al.  Ensembl 2011 , 2010, Nucleic Acids Res..

[20]  Graham J.W. King,et al.  CropStoreDB: a practical approach to managing crop data; from traits to sequences , 2010 .

[21]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[22]  Xiaowu Wang,et al.  BRAD, the genetics and genomics database for Brassica plants , 2011, BMC Plant Biology.

[23]  Maureen J Donlin,et al.  Using the Generic Genome Browser (GBrowse) , 2007, Current protocols in bioinformatics.

[24]  P. Hurban,et al.  A newly-developed community microarray resource for transcriptome profiling in Brassica species enables the confirmation of Brassica-specific expressed sequences , 2009, BMC Plant Biology.

[25]  Chris Mungall,et al.  A Chado case study: an ontology-based modular schema for representing genome-associated biological information , 2007, ISMB/ECCB.

[26]  G. King,et al.  A functional genomics resource for Brassica napus: development of an EMS mutagenized population and discovery of FAE1 point mutations by TILLING. , 2008, The New phytologist.

[27]  M. A. Stevens,et al.  Genetics and breeding , 1986 .

[28]  G. King,et al.  Novel Insights into Seed Fatty Acid Synthesis and Modification Pathways from Genetic Diversity and Quantitative Trait Loci Analysis of the Brassica C Genome1[OA] , 2007, Plant Physiology.

[29]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[30]  N. Morrison,et al.  Multifunctional crop trait ontology for breeders' data: field book, annotation, data discovery and semantic enrichment of the literature , 2010, AoB PLANTS.

[31]  Gerard R. Lazo,et al.  GrainGenes, the genome database for small-grain crops , 2003, Nucleic Acids Res..

[32]  Edward S. Buckler,et al.  Gramene database in 2010: updates and extensions , 2010, Nucleic Acids Res..

[33]  S. Jackson,et al.  Next-generation sequencing technologies and their implications for crop genetics and breeding. , 2009, Trends in biotechnology.

[34]  Lei Shi,et al.  Open Access Research Article Assessment of Fae1 Polymorphisms in Three Brassica Species Using Ecotilling and Their Association with Differences in Seed Erucic Acid Contents , 2022 .

[35]  Miguel A. Andrade-Navarro,et al.  Evaluation of annotation strategies using an entire genome sequence , 2003, Bioinform..

[36]  J. Bard,et al.  Ontologies in biology: design, applications and future challenges , 2004, Nature Reviews Genetics.

[37]  D. Ware,et al.  The Gramene Genetic Diversity Module: a resource for genotype-phenotype association analysis in grass species , 2010 .

[38]  C. Craplet Genetics and breeding. , 1953 .

[39]  R. Last,et al.  Shotguns and SNPs: how fast and cheap sequencing is revolutionizing plant biology. , 2010, The Plant journal : for cell and molecular biology.