Data Integration and Semantic Enrichment of Systems Biology Models and Simulations

The rise of Systems Biology approaches, in conjunction with the availability of numerous powerful and user-friendly modeling environments, brought computational models out of the dusty closets of theoreticians to the forefront of research in biology. Those models are becoming larger, more complex and more realistic. As any other type of data in life sciences, models have to be stored, exchanged and re-used. This was made possible by the development of a series of standards, that, when used in conjunction, can cover the whole life-cycle of a model, including the specification of its structure and syntax, the simulations to be run, and the description of its behaviour and resulting numerical output. We will review those standards, well-accepted or still under development, including the Minimal requirements (MIRIAM, MIASE), the description formats (SBML, SED-ML, SBRML) and the associated ontologies (SBO, KiSAO, TEDDY). We will show how their use by the community, through a rich toolkit of complementary software, can permit to leverage on everyone's efforts, to integrate models and simulations with other types of biological knowledge, and eventually lead to the fulfillment of one of Systems Biology's tenets of collaboration between biology, mathematics and computing science.

[1]  Nicolas Le Novère,et al.  SED-ML - An XML Format for the Implementation of the MIASE Guidelines , 2008, CMSB.

[2]  Nigel H. Goddard,et al.  Towards NeuroML: model description methods for collaborative modelling in neuroscience. , 2001, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[3]  Roy T. Fielding,et al.  Uniform Resource Identifiers (URI): Generic Syntax , 1998, RFC.

[4]  Hugh D. Spence,et al.  Minimum information requested in the annotation of biochemical models (MIRIAM) , 2005, Nature Biotechnology.

[5]  Upinder S. Bhalla,et al.  The Database of Quantitative Cellular Signaling: management and analysis of chemical kinetic models of signaling networks , 2003, Bioinform..

[6]  Ian Stark,et al.  The Continuous pi-Calculus: A Process Algebra for Biochemical Modelling , 2008, CMSB.

[7]  Michael Hucka,et al.  Systems Biology Markup Language (SBML) Level 2: Structures and Facilities for Model Definitions , 2007, WWW 2007.

[8]  Hiroaki Kitano,et al.  The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models , 2003, Bioinform..

[9]  Jacky L. Snoep,et al.  Web-based kinetic modelling using JWS Online , 2004, Bioinform..

[10]  M. Ashburner,et al.  The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration , 2007, Nature Biotechnology.

[11]  Catherine M Lloyd,et al.  CellML: its future, present and past. , 2004, Progress in biophysics and molecular biology.

[12]  Jacky L. Snoep,et al.  BioModels Database: a free, centralized database of curated, published, quantitative kinetic models of biochemical and cellular systems , 2005, Nucleic Acids Res..

[13]  Roy T. Fielding,et al.  Uniform Resource Identifier (URI): Generic Syntax , 2005, RFC.

[14]  Nigel W. Hardy,et al.  Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the MIBBI project , 2008, Nature Biotechnology.

[15]  L. Stein,et al.  OWL Web Ontology Language - Reference , 2004 .