Abstract interpretation and types for systems biology

Abstract interpretation is a theory of abstraction that has been introduced for the analysis of programs. In particular, it has proved useful for organizing the multiple semantics of a given programming language in a hierarchy corresponding to different detail levels, and for defining type systems for programming languages and program analyzers in software engineering. In this paper, we investigate the application of these concepts to systems biology formalisms. More specifically, we consider the Systems Biology Markup Language SBML, and the Biochemical Abstract Machine BIOCHAM with its differential, stochastic, discrete and boolean semantics. We first show how all of these different semantics, except the differential one, can be formally related by simple Galois connections. Then we define three type systems: one for checking or inferring the functions of proteins in a reaction model, one for checking or inferring the activation and inhibition effects of proteins in a reaction model, and another one for checking or inferring the topology of compartments or locations. We show that the framework of abstract interpretation elegantly applies to the formalization of these further abstractions, and to the implementation of linear or quadratic time type checking as well as type inference algorithms. Furthermore, we show a theorem of independence of the graph of activation and inhibition effects from the kinetic expressions in the reaction model, under general conditions. Through some examples, we show that the analysis of biochemical models by type inference provides accurate and useful information. Interestingly, such a mathematical formalization of the abstractions commonly used in systems biology already provides some guidelines for the extensions of biochemical reaction rule languages.

[1]  Monika Heiner,et al.  Application of Petri net based analysis techniques to signal transduction pathways , 2006, BMC Bioinformatics.

[2]  Luca Cardelli,et al.  Brane Calculi , 2004, CMSB.

[3]  Vincent Danos,et al.  Modeling and querying biomolecular interaction networks , 2004, Theor. Comput. Sci..

[4]  Liang Qiao,et al.  Bistability and Oscillations in the Huang-Ferrell Model of MAPK Signaling , 2007, PLoS Comput. Biol..

[5]  K. Kohn Molecular interaction map of the mammalian cell cycle control and DNA repair systems. , 1999, Molecular biology of the cell.

[6]  Michael Hucka,et al.  A Correction to the Review Titled "Rules for Modeling Signal-Transduction Systems" by W. S. Hlavacek et al. , 2006, Science's STKE.

[7]  Emmanuel Coquery,et al.  Typing constraint logic programs , 2001, Theory Pract. Log. Program..

[8]  C. Soulé Graphic Requirements for Multistationarity , 2004, Complexus.

[9]  François Fages,et al.  Machine Learning Biochemical Networks from Temporal Logic Properties , 2006, Trans. Comp. Sys. Biology.

[10]  R. Thomas,et al.  Multistationarity, the basis of cell differentiation and memory. II. Logical analysis of regulatory networks in terms of feedback circuits. , 2001, Chaos.

[11]  R Heinrich,et al.  Complex calcium oscillations and the role of mitochondria and cytosolic proteins. , 2000, Bio Systems.

[12]  François Fages,et al.  CMBSlib: A Library for Comparing Formalisms and Models of Biological Systems , 2004, CMSB.

[13]  François Fages,et al.  Type Inference in Systems Biology , 2006, CMSB.

[14]  Luca Cardelli,et al.  Brane Calculi Interactions of Biological Membranes , 2004 .

[15]  R Thomas,et al.  A complex control circuit. Regulation of immunity in temperate bacteriophages. , 1976, European journal of biochemistry.

[16]  Simon L. Peyton Jones,et al.  Imperative functional programming , 1993, POPL '93.

[17]  A Goldbeter,et al.  Complex intracellular calcium oscillations. A theoretical exploration of possible mechanisms. , 1997, Biophysical chemistry.

[18]  Erich J. Neuhold,et al.  Formal description of programming concepts , 1991 .

[19]  Patrick Cousot,et al.  Abstract interpretation: a unified lattice model for static analysis of programs by construction or approximation of fixpoints , 1977, POPL.

[20]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[21]  François Fages,et al.  Symbolic Model Checking of Biochemical Networks , 2003, CMSB.

[22]  Thomas Pfeiffer,et al.  Exploring the pathway structure of metabolism: decomposition into subnetworks and application to Mycoplasma pneumoniae , 2002, Bioinform..

[23]  Kai Wang,et al.  INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS FOR MOLECULAR BIOLOGY (ISMB) , 2009 .

[24]  B. Kholodenko,et al.  Signaling switches and bistability arising from multisite phosphorylation in protein kinase cascades , 2004, The Journal of cell biology.

[25]  D. Gillespie A General Method for Numerically Simulating the Stochastic Time Evolution of Coupled Chemical Reactions , 1976 .

[26]  François Fages,et al.  From Syntax to Semantics in Systems Biology Towards Automated Reasoning Tools , 2006, Trans. Comp. Sys. Biology.

[27]  Luca Cardelli,et al.  Typeful Programming , 1989, Formal Description of Programming Concepts.

[28]  François Fages,et al.  Modelling and querying interaction networks in the biochemical abstract machine BIOCHAM , 2002 .

[29]  Monika Heiner,et al.  A Unifying Framework for Modelling and Analysing Biochemical Pathways Using Petri Nets , 2007, CMSB.

[30]  Michael L. Mavrovouniotis,et al.  Petri Net Representations in Metabolic Pathways , 1993, ISMB.

[31]  Jehoshua Bruck,et al.  Scaffold proteins may biphasically affect the levels of mitogen-activated protein kinase signaling and reduce its threshold properties. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[32]  Jehoshua Bruck,et al.  A probabilistic model of a prokaryotic gene and its regulation , 1999 .

[33]  D. Gillespie Exact Stochastic Simulation of Coupled Chemical Reactions , 1977 .

[34]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[35]  T. Ideker,et al.  A new approach to decoding life: systems biology. , 2001, Annual review of genomics and human genetics.

[36]  Lee A. Segel,et al.  Modeling Dynamic Phenomena in Molecular and Cellular Biology , 1984 .

[37]  Claire J. Tomlin,et al.  Lateral Inhibition through Delta-Notch Signaling: A Piecewise Affine Hybrid Model , 2001, HSCC.

[38]  Wolfram Liebermeister,et al.  Linear modes of gene expression determined by independent component analysis , 2002, Bioinform..

[39]  Patrick Cousot,et al.  Abstract Interpretation and Application to Logic Programs , 1992, J. Log. Program..

[40]  Patrick Cousot Constructive design of a hierarchy of semantics of a transition system by abstract interpretation , 2002, Theor. Comput. Sci..

[41]  José Meseguer,et al.  Pathway Logic: Symbolic Analysis of Biological Signaling , 2001, Pacific Symposium on Biocomputing.

[42]  Emmanuel Coquery,et al.  Subtyping Constraints in Quasi-lattices , 2003, FSTTCS.

[43]  Alejandra C. Ventura,et al.  A Hidden Feedback in Signaling Cascades Is Revealed , 2008, PLoS Comput. Biol..

[44]  Andrea Ciliberto,et al.  Steady States and Oscillations in the p53/Mdm2 Network , 2005, Cell cycle.

[45]  N. S. Barnett,et al.  Private communication , 1969 .

[46]  Patrick Cousot,et al.  Types as abstract interpretations , 1997, POPL '97.

[47]  Hiroaki Kitano,et al.  The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models , 2003, Bioinform..

[48]  Claudine Chaouiya,et al.  Petri net modelling of biological networks , 2007, Briefings Bioinform..

[49]  François Fages,et al.  BIOCHAM: an environment for modeling biological systems and formalizing experimental knowledge , 2006, Bioinform..

[50]  Stefan Schuster,et al.  Topological analysis of metabolic networks based on Petri net theory , 2003, Silico Biol..