OPTIMAS-DW, MetaCrop and VANTED: A Case Study for Data Integration, Curation and Visualisation in Life Sciences

Since the data volume in life sciences has been growing exponentially in recent years, it is indispensable to develop databases and tools for efficient data integration, curation and visualisation. Focusing on data handling in crop plant research, this paper presents an approach, which combines (i) a data warehouse (OPTIMAS-DW) for integrating experimental data, (ii) an information system (MetaCrop) for manually curated biochemical pathways, and (iii) a visualisation software (VANTED) for integrated data visualisation. The functionality and usability of the concept will be illustrated by a use case.

[1]  Sarala M. Wimalaratne,et al.  The Systems Biology Graphical Notation , 2009, Nature Biotechnology.

[2]  Hiroaki Kitano,et al.  The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models , 2003, Bioinform..

[3]  Kenji Akiyama,et al.  UniVIO: A Multiple Omics Database with Hormonome and Transcriptome Data from Rice , 2013, Plant & cell physiology.

[4]  Matthias Klapperstück,et al.  VANTED v2: a framework for systems biology applications , 2012, BMC Systems Biology.

[5]  Uwe Scholz,et al.  MetaCrop 2.0: managing and exploring information about crop plant metabolism , 2011, Nucleic Acids Res..

[6]  Susumu Goto,et al.  KEGG for integration and interpretation of large-scale molecular data sets , 2011, Nucleic Acids Res..

[7]  Uwe Scholz,et al.  OPTIMAS-DW: A comprehensive transcriptomics, metabolomics, ionomics, proteomics and phenomics data resource for maize , 2012, BMC Plant Biology.

[8]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[9]  Gene Ontology Consortium The Gene Ontology (GO) database and informatics resource , 2003 .

[10]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[11]  Uwe Scholz,et al.  Systems Analysis of a Maize Leaf Developmental Gradient Redefines the Current C4 Model and Provides Candidates for Regulation[W][OA] , 2011, Plant Cell.

[12]  B Marshall,et al.  Gene Ontology Consortium: The Gene Ontology (GO) database and informatics resource , 2004, Nucleic Acids Res..

[13]  L. Stein,et al.  Plant Ontology (PO): a Controlled Vocabulary of Plant Structures and Growth Stages , 2005, Comparative and functional genomics.

[14]  Juan Miguel García-Gómez,et al.  BIOINFORMATICS APPLICATIONS NOTE Sequence analysis Manipulation of FASTQ data with Galaxy , 2005 .

[15]  Astrid Junker,et al.  Creating interactive, web-based and data-enriched maps with the Systems Biology Graphical Notation , 2012, Nature Protocols.