Considerations of graph-based concepts to manage of computational biology models and associated simulations

Over the past years various databases in Life Sciences have been developed, among them databases to handle computational models of biological systems. Exchange formats that represent these models are typically XMLbased; they encode models as networks. Models are published together with supplementary materials such as annotations, simulation experiment descriptions, or result sets. In consequence, not only model files need to be managed, but also the associated simulation setups, and highly linked meta-information. We discuss here the use of graph databases for model storage as they well reflect this interrelated nature. They can also enhance the integrated management of computational models and associated meta-information, as they handle connections between models, simulation descriptions and result data files, as well as external knowledge. This property enhances version control, retrieval and ranking, thereby resulting in improved model reuse and result reproducibility.

[1]  Matthew R. Pocock,et al.  Annotation of SBML models through rule-based semantic integration , 2009, J. Biomed. Semant..

[2]  Claudio Gutierrez,et al.  Survey of graph database models , 2008, CSUR.

[3]  Michel Dumontier,et al.  Controlled vocabularies and semantics in systems biology , 2011, Molecular systems biology.

[4]  Peter J. Hunter,et al.  An Overview of CellML 1.1, a Biological Model Description Language , 2003, Simul..

[5]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[6]  Christopher J. Rawlings,et al.  Lost in translation: data integration tools meet the Semantic Web (experiences from the Ondex project) , 2011, ICDE 2012.

[7]  Jacky L. Snoep,et al.  Reproducible computational biology experiments with SED-ML - The Simulation Experiment Description Markup Language , 2011, BMC Systems Biology.

[8]  Michel Dumontier,et al.  Integrating systems biology models and biomedical ontologies , 2011, BMC Systems Biology.

[9]  Andreas Heuer,et al.  Das Sombi-Framework zum Ermitteln geeigneter Suchfunktionen für biologische Modelldatenbasen , 2011, Datenbank-Spektrum.

[10]  Phillip Lord,et al.  Annotation of SBML Models Through Rule-Based Semantic Integration , 2009 .

[11]  Yixin Chen,et al.  A comparison of a graph database and a relational database: a data provenance perspective , 2010, ACM SE '10.

[12]  Michael L. Hines,et al.  NeuroML: A Language for Describing Data Driven Models of Neurons and Networks with a High Degree of Biological Detail , 2010, PLoS Comput. Biol..

[13]  Melanie I. Stefan,et al.  BioModels Database: An enhanced, curated and annotated resource for published quantitative kinetic models , 2010, BMC Systems Biology.

[14]  Nicolas Le Novère,et al.  Ranked retrieval of Computational Biology models , 2010, BMC Bioinformatics.

[15]  Olivier Bodenreider,et al.  Bio-ontologies: current trends and future directions , 2006, Briefings Bioinform..

[16]  Olaf Wolkenhauer,et al.  Reproducibility of Model-Based Results in Systems Biology , 2013 .

[17]  Howard Gobioff,et al.  The Google file system , 2003, SOSP '03.

[18]  Ora Lassila,et al.  W3c resource description framework (rdf) model and syntax specification , 1998 .

[19]  Michael Hucka,et al.  The Systems Biology Markup Language (SBML): Language Specification for Level 3 Version 1 Core (Release 1 Candidate) , 2010 .