Improving the reuse of computational models through version control

MOTIVATION Only models that are accessible to researchers can be reused. As computational models evolve over time, a number of different but related versions of a model exist. Consequently, tools are required to manage not only well-curated models but also their associated versions. RESULTS In this work, we discuss conceptual requirements for model version control. Focusing on XML formats such as Systems Biology Markup Language and CellML, we present methods for the identification and explanation of differences and for the justification of changes between model versions. In consequence, researchers can reflect on these changes, which in turn have considerable value for the development of new models. The implementation of model version control will therefore foster the exploration of published models and increase their reusability.

[1]  Renate Kania,et al.  SABIO-RK: Integration and Curation of Reaction Kinetics Data , 2006, DILS.

[2]  Daniel S. Hirschberg,et al.  Algorithms for the Longest Common Subsequence Problem , 1977, JACM.

[3]  Jacky L. Snoep,et al.  Reproducible computational biology experiments with SED-ML - The Simulation Experiment Description Markup Language , 2011, BMC Systems Biology.

[4]  Dan Brickley,et al.  Resource Description Framework (RDF) Model and Syntax Specification , 2002 .

[5]  Peter J. Hunter,et al.  Revision history aware repositories of computational models of biological systems , 2011, BMC Bioinformatics.

[6]  Melanie I. Stefan,et al.  BioModels Database: An enhanced, curated and annotated resource for published quantitative kinetic models , 2010, BMC Systems Biology.

[7]  Man Lung Yiu,et al.  Group-by skyline query processing in relational engines , 2009, CIKM.

[8]  Luis Arévalo Rosado,et al.  An XQuery-based version extension of an XML native database , 2009, EDBT/ICDT '09.

[9]  Peter J. Hunter,et al.  Bioinformatics Applications Note Databases and Ontologies the Physiome Model Repository 2 , 2022 .

[10]  Uwe M. Borghoff,et al.  Efficient and reliable merging of XML documents , 2009, CIKM.

[11]  Nicolas Le Novère,et al.  Identifiers.org and MIRIAM Registry: community resources to provide persistent identification , 2011, Nucleic Acids Res..

[12]  JOHN J. TYSON Modeling the cell division cycle : cdc 2 and cyclin interactions ( maturation promoting factor / metaphase arrest , .

[13]  Sarala M. Wimalaratne,et al.  The Systems Biology Graphical Notation , 2009, Nature Biotechnology.

[14]  Michael L. Hines,et al.  NeuroML: A Language for Describing Data Driven Models of Neurons and Networks with a High Degree of Biological Detail , 2010, PLoS Comput. Biol..

[15]  Michael Hucka,et al.  The Systems Biology Markup Language (SBML): Language Specification for Level 3 Version 1 Core , 2010 .

[16]  Michel Dumontier,et al.  Integrating systems biology models and biomedical ontologies , 2011, BMC Systems Biology.

[17]  Nicolas Le Novère,et al.  Ranked retrieval of Computational Biology models , 2010, BMC Bioinformatics.

[18]  Peter J. Hunter,et al.  CellML metadata standards, associated tools and repositories , 2009, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[19]  Jeff Z. Pan,et al.  Resource Description Framework , 2020, Definitions.

[20]  Gary R. Mirams,et al.  High-throughput functional curation of cellular electrophysiology models. , 2011, Progress in biophysics and molecular biology.

[21]  Richard Orton,et al.  Version control of pathway models using XML patches , 2009, BMC Systems Biology.

[22]  Michael Hucka,et al.  The Systems Biology Markup Language (SBML): Language Specification for Level 3 Version 1 Core , 2010, J. Integr. Bioinform..

[23]  David P. Nickerson,et al.  An overview of the CellML API and its implementation , 2010, BMC Bioinformatics.

[24]  Peter J. Hunter,et al.  An Overview of CellML 1.1, a Biological Model Description Language , 2003, Simul..

[25]  Olaf Wolkenhauer,et al.  Considerations of graph-based concepts to manage of computational biology models and associated simulations , 2012, GI-Jahrestagung.

[26]  Hiroaki Kitano,et al.  The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models , 2003, Bioinform..

[27]  Markus J. Herrgård,et al.  A consensus yeast metabolic network reconstruction obtained from a community approach to systems biology , 2008, Nature Biotechnology.

[28]  Nicholas T. Carnevale,et al.  ModelDB: A Database to Support Computational Neuroscience , 2004, Journal of Computational Neuroscience.

[29]  Hubert Naacke,et al.  Proceedings of the 2009 EDBT/ICDT Workshops , 2009 .

[30]  Uwe M. Borghoff,et al.  Towards XML version control of office documents , 2005, DocEng '05.

[31]  Jennifer Widom,et al.  Change detection in hierarchically structured information , 1996, SIGMOD '96.

[32]  Hugh D. Spence,et al.  Minimum information requested in the annotation of biochemical models (MIRIAM) , 2005, Nature Biotechnology.

[33]  James Hetherington,et al.  Computational challenges of systems biology , 2004, Computer.

[34]  Edda Klipp,et al.  Annotation and merging of SBML models with semanticSBML , 2010, Bioinform..