Integrating BioPAX pathway knowledge with SBML models.

Online databases store thousands of molecular interactions and pathways, and numerous modelling software tools provide users with an interface to create and simulate mathematical models of such interactions. However, the two most widespread used standards for storing pathway data (biological pathway exchange; BioPAX) and for exchanging mathematical models of pathways (systems biology markup language; SBML) are structurally and semantically different. Conversion between formats (making data present in one format available in another format) based on simple one-to-one mappings may lead to loss or distortion of data, is difficult to automate, and often impractical and/or erroneous. This seriously limits the integration of knowledge data and models. In this paper we introduce an approach for such integration based on a bridging format that we named systems biology pathway exchange (SBPAX) alluding to SBML and BioPAX. It facilitates conversion between data in different formats by a combination of one-to-one mappings to and from SBPAX and operations within the SBPAX data. The concept of SBPAX is to provide a flexible description expanding around essential pathway data - basically the common subset of all formats describing processes, the substances participating in these processes and their locations. SBPAX can act as a repository for molecular interaction data from a variety of sources in different formats, and the information about their relative relationships, thus providing a platform for converting between formats and documenting assumptions used during conversion, gluing (identifying related elements across different formats) and merging (creating a coherent set of data from multiple sources) data.

[1]  Keyuan Jiang,et al.  Ontology-Based Aggregation of Biological Pathway Datasets , 2005, 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference.

[2]  Hiroaki Kitano,et al.  The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models , 2003, Bioinform..

[3]  Michael L. Blinov,et al.  Kinetic Modeling Using BioPAX Ontology , 2007, 2007 IEEE International Conference on Bioinformatics and Biomedicine (BIBM 2007).

[4]  Nicolas Le Novère,et al.  Model storage, exchange and integration , 2006, BMC neuroscience.

[5]  Hugh D. Spence,et al.  Minimum information requested in the annotation of biochemical models (MIRIAM) , 2005, Nature Biotechnology.

[6]  A. Gatti,et al.  Multisite Autophosphorylation of p21-activated Protein Kinase γ-PAK as a Function of Activation* , 1999, The Journal of Biological Chemistry.

[7]  Masao Nagasaki,et al.  Cell System Ontology: Representation for Modeling, Visualizing, and Simulating Biological Pathways , 2007, Silico Biol..

[8]  Mudita Singhal,et al.  COPASI - a COmplex PAthway SImulator , 2006, Bioinform..

[9]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[10]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[11]  Joanne S. Luciano,et al.  PAX of mind for pathway researchers. , 2005, Drug discovery today.

[12]  Matthew R. Pocock,et al.  Integration of constraints documented in SBML, SBO, and the SBML Manual facilitates validation of biological models , 2007, J. Integr. Bioinform..

[13]  Emmanuel Barillot,et al.  BiNoM: a Cytoscape plugin for manipulating and analyzing biological networks , 2008, Bioinform..

[14]  W. S. Hlavacek,et al.  A network model of early events in epidermal growth factor receptor signaling that accounts for combinatorial complexity. , 2006, Bio Systems.

[15]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[16]  Nicolas Le Novère,et al.  Model storage, exchange and integration , 2006, BMC Neuroscience.

[17]  Edda Klipp,et al.  SBMLmerge, a system for combining biochemical network models. , 2006, Genome informatics. International Conference on Genome Informatics.

[18]  C. Ouzounis,et al.  Expansion of the BioCyc collection of pathway/genome databases to 160 genomes , 2005, Nucleic acids research.

[19]  J. Weinstein,et al.  Molecular interaction maps of bioregulatory networks: a general rubric for systems biology. , 2005, Molecular biology of the cell.

[20]  Catherine M Lloyd,et al.  CellML: its future, present and past. , 2004, Progress in biophysics and molecular biology.

[21]  Hiroaki Kitano,et al.  CellDesigner: a process diagram editor for gene-regulatory and biochemical networks , 2003 .

[22]  Jacky L. Snoep,et al.  BioModels Database: a free, centralized database of curated, published, quantitative kinetic models of biochemical and cellular systems , 2005, Nucleic Acids Res..

[23]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[24]  J C Schaff,et al.  Virtual Cell modelling and simulation software environment. , 2008, IET systems biology.

[25]  O Ruebenacker,et al.  Complexity and modularity of intracellular networks: a systematic approach for modelling and simulation. , 2008, IET systems biology.

[26]  Emek Demir,et al.  PATIKA: an integrated visual environment for collaborative construction and analysis of cellular pathways , 2002, Bioinform..