Challenges for an enzymatic reaction kinetics database

The scientific literature contains a tremendous amount of kinetic data describing the dynamic behaviour of biochemical reactions over time. These data are needed for computational modelling to create models of biochemical reaction networks and to obtain a better understanding of the processes in living cells. To extract the knowledge from the literature, biocurators are required to understand a paper and interpret the data. For modellers, as well as experimentalists, this process is very time consuming because the information is distributed across the publication and, in most cases, is insufficiently structured and often described without standard terminology. In recent years, biological databases for different data types have been developed. The advantages of these databases lie in their unified structure, searchability and the potential for augmented analysis by software, which supports the modelling process. We have developed the SABIO‐RK database for biochemical reaction kinetics. In the present review, we describe the challenges for database developers and curators, beginning with an analysis of relevant publications up to the export of database information in a standardized format. The aim of the present review is to draw the experimentalist's attention to the problem (from a data integration point of view) of incompletely and imprecisely written publications. We describe how to lower the barrier to curators and improve this situation. At the same time, we are aware that curating experimental data takes time. There is a community concerned with making the task of publishing data with the proper structure and annotation to ontologies much easier. In this respect, we highlight some useful initiatives and tools.

[1]  Upinder S. Bhalla,et al.  The Database of Quantitative Cellular Signaling: management and analysis of chemical kinetic models of signaling networks , 2003, Bioinform..

[2]  Hiroaki Kitano,et al.  The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models , 2003, Bioinform..

[3]  Lei Shi,et al.  SABIO-RK—database for biochemical reaction kinetics , 2011, Nucleic Acids Res..

[4]  Susumu Goto,et al.  KEGG for representation and analysis of molecular networks involving diseases and drugs , 2009, Nucleic Acids Res..

[5]  Nigel W. Hardy,et al.  Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the MIBBI project , 2008, Nature Biotechnology.

[6]  Hans Bisswanger,et al.  Enzyme Kinetics: Principles and Methods , 2002 .

[7]  Michel Dumontier,et al.  Controlled vocabularies and semantics in systems biology , 2011, Molecular systems biology.

[8]  Jeremy Gunawardena,et al.  Some lessons about models from Michaelis and Menten , 2012, Molecular biology of the cell.

[9]  Neil Swainston,et al.  Enzyme kinetics informatics: from instrument to browser , 2010, The FEBS journal.

[10]  Rainer Breitling,et al.  What is Systems Biology? , 2010, Front. Physiology.

[11]  D. Burk,et al.  The Determination of Enzyme Dissociation Constants , 1934 .

[12]  M. Trautz,et al.  Das Gesetz der Reaktionsgeschwindigkeit und der Gleichgewichte in Gasen. Bestätigung der Additivität von Cv‐3/2R. Neue Bestimmung der Integrationskonstanten und der Moleküldurchmesser , 1916 .

[13]  Renate Kania,et al.  SABIO-RK: Integration and Curation of Reaction Kinetics Data , 2006, DILS.

[14]  William W. Chen,et al.  Classic and contemporary approaches to modeling biochemical reactions. , 2010, Genes & development.

[15]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[16]  Christoph Steinbeck,et al.  Chemical Entities of Biological Interest: an update , 2009, Nucleic Acids Res..

[17]  Jacky L. Snoep,et al.  BioModels Database: a free, centralized database of curated, published, quantitative kinetic models of biochemical and cellular systems , 2005, Nucleic Acids Res..

[18]  J C Schaff,et al.  Integrating BioPAX pathway knowledge with SBML models. , 2009, IET systems biology.

[19]  Edda Klipp,et al.  Annotation and merging of SBML models with semanticSBML , 2010, Bioinform..

[21]  Jacky L. Snoep,et al.  Web-based kinetic modelling using JWS Online , 2004, Bioinform..

[22]  J. Hofmeyr,et al.  The importance of uniformity in reporting protein-function data. , 2005, TIBS -Trends in Biochemical Sciences. Regular ed.

[23]  A. Cornish-Bowden,et al.  Detection of errors of interpretation in experiments in enzyme kinetics. , 2001, Methods.

[24]  H. Berman The Protein Data Bank: a historical perspective. , 2008, Acta crystallographica. Section A, Foundations of crystallography.

[25]  J C Schaff,et al.  Virtual Cell modelling and simulation software environment. , 2008, IET systems biology.

[26]  Victor Henri,et al.  [General theory of the action of some glycoside hydrolases]. , 2006, Comptes rendus biologies.

[27]  Guy Cochrane,et al.  The International Nucleotide Sequence Database Collaboration , 2011, Nucleic Acids Res..

[28]  Antje Chang,et al.  BRENDA, the enzyme information system in 2011 , 2010, Nucleic Acids Res..

[29]  R. Goody,et al.  The original Michaelis constant: translation of the 1913 Michaelis-Menten paper. , 2011, Biochemistry.

[30]  Yukiko Matsuoka,et al.  Integration of CellDesigner and SABIO-RK , 2007, Silico Biol..

[31]  Matthias Stein,et al.  SYCAMORE - a systems biology computational analysis and modeling research environment , 2008, Bioinform..

[32]  Wolfram Liebermeister,et al.  Integration of Enzyme Kinetic Data from Various Sources , 2007, Silico Biol..

[33]  A. Cornish-Bowden Fundamentals of Enzyme Kinetics , 1979 .

[34]  David S. Broomhead,et al.  Systematic integration of experimental data and models in systems biology , 2010, BMC Bioinformatics.

[35]  Antje Chang,et al.  The BRENDA Tissue Ontology (BTO): the first all-integrating ontology of all organisms for enzyme sources , 2010, Nucleic Acids Res..

[36]  S. Sahle,et al.  Problems of Currently Published Enzyme Kinetic Data for Usage in Modelling and Simulation , 2007 .

[37]  陈奕欣 Ongoing and future developments at the Universal Protein Resource , 2011 .

[38]  Ursula Kummer Usage of Reaction Kinetics Data Stored in Databases - A Modeler's Point of View , 2007, Silico Biol..

[39]  I. H. Segel Enzyme Kinetics: Behavior and Analysis of Rapid Equilibrium and Steady-State Enzyme Systems , 1975 .

[40]  G. Cochrane,et al.  The International Nucleotide Sequence Database Collaboration , 2011, Nucleic Acids Res..

[41]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[42]  Portland Press Ltd Symbolism and terminology in enzyme kinetics. Recommendations 1981. , 1983, The Biochemical journal.

[43]  Carole A. Goble,et al.  RightField: embedding ontology annotation in spreadsheets , 2011, Bioinform..