Qualitative and Quantitative Formal Modeling of Biological Systems

Cell Biology, the study of the morphological and functional organization of cells, is now an established field in biochemical research. Computer Science can help the research in Cell Biology in several ways. For instance, it can provide biologists with models and formalisms able to describe and analyze complex systems such as cells. In the last few years many formalisms, originally developed by computer scientists to model systems of interacting components, have been applied to Biology. Among these, there are Petri Nets, Hybrid Systems, and the π-calculus. Moreover, formalisms such as P Systems, originally developed to study new computational paradigms inspired by Biology, have recently found application to the description of biological phenomena. Finally, some new formalisms have been proposed to describe biomolecular and membrane interactions. The first advantage of using formal models to describe biological systems is that they avoid ambiguities. In fact, ambiguity is often a problem of the notations used by biologists. Moreover, the formal modeling of biological systems allows the development of simulators, which can be used to understand how the described system behaves in normal conditions, and how it reacts to changes in the environment and to alterations of some of its components. Furthermore, formal models allow the verification of properties of the described systems, by means of tools (such as model checkers) which are well established and widely used in other application fields of Computer Science, but unknown to biologists. In this thesis we develop a formalism for the description of biological systems, called Calculus of Looping Sequences (CLS), based on term rewriting and including some typical features of process calculi for concurrency. What we want to achieve is a formalism that allows describing proteins, DNA fragments, membranes and other macromolecules, without ignoring the physical structure of these elements, and by keeping the syntax and the semantics of the formalism as simple as possible. CLS terms are constructed from an alphabet of basic symbols (representing simple molecules) and include operators for the creation of sequences (representing proteins and DNA fragments), of closed sequences which may contain something (representing membranes), and of multisets of all these elements (representing juxtaposition). A CLS term describes the structure of the system under study, and its evolution is given by the application of rewrite rules describing the events that may occur in the system, and how the system changes after the occurrence of one of these events. We equip CLS with an operational semantics describing the possible evolutions of the system by means of application of given rewrite rules, and we show that other formalisms for the description of membranes can be encoded into CLS in a sound and complete way. We propose bisimilarity as a tool to verify properties of the described systems. Bisimilarity is widely accepted as the finest extensional behavioral equivalence one may want to impose on systems. It may be used to verify a property of a system by assessing the bisimilarity of the considered system with a system one knows to enjoy that property. To define bisimilarity of systems, these must have semantics based on labeled transition relations capturing potential external interactions between systems and their environment. A labeled transition semantics for CLS is derived from rewrite rules by using as labels contexts that would allow rules to be applied. We define bisimulation relations upon this semantics, and we show them to be congruences with respect to the operators on terms. In order to model quantitative aspects of biological systems, such as the the frequency of a biological event, we develop a stochastic extension of CLS, called Stochastic CLS. Rates are associated with rewrite rules in order to model the speeds of the described activities. Therefore, transitions derived in Stochastic CLS are driven by exponential distributions, whose rates are obtained from the rates of the applied rewrite rules and characterize the stochastic behavior of the transitions. The choice of the next rule to be applied and of the time of its application is based on the classical Gillespie’s algorithm for simulation of chemical reactions. Stochastic CLS can be used as a formal foundation for a stochastic simulator, but also to build models to be given as an input to model checking tools. In fact, the transition system obtained by the semantics of Stochastic CLS can be easily transformed into a Continuous Time Markov Chain (CTMC). If the set of states of the CTMC is finite (namely, if the set of reachable CLS terms is finite) a standard probabilistic model checker (such as PRISM) can be used to verify properties of the described system. Finally, we propose a translation of Kohn Molecular Interaction Maps (MIMs), a compact graphical notation for biomolecular systems, into Stochastic CLS. By means of our translation, a simulator of systems described with Stochastic CLS can be used to simulate also systems described by using MIMs.

[1]  D. Gillespie Exact Stochastic Simulation of Coupled Chemical Reactions , 1977 .

[2]  Sheldon M. Ross,et al.  Stochastic Processes , 2018, Gauge Integral Structures for Stochastic Calculus and Quantum Electrodynamics.

[3]  Jane Hillston,et al.  A compositional approach to performance modelling , 1996 .

[4]  David Harel,et al.  Statecharts: A Visual Formalism for Complex Systems , 1987, Sci. Comput. Program..

[5]  Nachum Dershowitz,et al.  Termination of Rewriting , 1987, J. Symb. Comput..

[6]  Robin Milner,et al.  Communication and concurrency , 1989, PHI Series in computer science.

[7]  Przemyslaw Prusinkiewicz,et al.  The Algorithmic Beauty of Plants , 1990, The Virtual Laboratory.

[8]  Jan van Leeuwen,et al.  Handbook of Theoretical Computer Science, Vol. B: Formal Models and Semantics , 1994 .

[9]  Wolfgang Thomas,et al.  Handbook of Theoretical Computer Science, Volume B: Formal Models and Semantics , 1990 .

[10]  Pedro Mendes,et al.  GEPASI: a software package for modelling the dynamics, steady states and control of biochemical and other systems , 1993, Comput. Appl. Biosci..

[11]  Corrado Priami,et al.  Stochastic pi-Calculus , 1995, Comput. J..

[12]  Davide Sangiorgi,et al.  Bisimulation for Higher-Order Process Calculi , 1994, Inf. Comput..

[13]  J. Keasling,et al.  Mathematical Model of the lac Operon: Inducer Exclusion, Catabolite Repression, and Diauxic Growth on Glucose and Lactose , 1997, Biotechnology progress.

[14]  P Borella,et al.  Sorbitol dehydrogenase from bovine lens: purification and properties. , 1997, Archives of biochemistry and biophysics.

[15]  Luca Cardelli,et al.  Mobile Ambients , 1998, FoSSaCS.

[16]  Andrew D. Gordon,et al.  A Concurrent Object Calculus: Reduction and Typing , 1998, HLCL.

[17]  Robin Milner,et al.  Communicating and mobile systems - the Pi-calculus , 1999 .

[18]  Martín Abadi,et al.  Object Types against Races , 1999, CONCUR.

[19]  David Harel,et al.  LSCs: Breathing Life into Message Sequence Charts , 1999, Formal Methods Syst. Des..

[20]  Gheorghe Paun,et al.  Computing with Membranes , 2000, J. Comput. Syst. Sci..

[21]  H Matsuno,et al.  Hybrid Petri net representation of gene regulatory network. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[22]  S. Tapscott,et al.  A basis for a visual language for describing, archiving and analyzing functional models of complex biological systems , 2001, Genome Biology.

[23]  Robin Milner,et al.  Deriving Bisimulation Congruences for Reactive Systems , 2000, CONCUR.

[24]  Davide Sangiorgi,et al.  Communicating and Mobile Systems: the π-calculus, , 2000 .

[25]  J. Dumont,et al.  The visual display of regulatory information and networks. , 2000, Trends in cell biology.

[26]  David Harel,et al.  The immune system as a reactive system: modeling T cell activation with statecharts , 2001, Proceedings IEEE Symposia on Human-Centric Computing Languages and Environments (Cat. No.01TH8587).

[27]  K. Kohn Molecular interaction maps as information organizers and simulation guides. , 2001, Chaos.

[28]  Corrado Priami,et al.  Application of a stochastic name-passing calculus to representation and simulation of molecular processes , 2001, Inf. Process. Lett..

[29]  Calin Belta,et al.  Hybrid Modeling and Simulation of Biomolecular Networks , 2001, HSCC.

[30]  Hiroaki Kitano,et al.  Foundations of systems biology , 2001 .

[31]  Aviv Regev,et al.  Representation and Simulation of Biochemical Processes Using the pi-Calculus Process Algebra , 2000, Pacific Symposium on Biocomputing.

[32]  Peter Sewell,et al.  From rewrite rules to bisimulation congruences , 2002, Theor. Comput. Sci..

[33]  Gheorghe Paun,et al.  Membrane Computing , 2002, Natural Computing Series.

[34]  Gheorghe Paun,et al.  A guide to membrane computing , 2002, Theor. Comput. Sci..

[35]  H. Kitano Systems Biology: A Brief Overview , 2002, Science.

[36]  E. Shapiro,et al.  Cellular abstractions: Cells as computation , 2002, Nature.

[37]  Amir Pnueli,et al.  Formal Modeling of C. elegans Development: A Scenario-Based Approach , 2003, CMSB.

[38]  D. Harel,et al.  Toward rigorous comprehension of biological complexity: modeling, execution, and visualization of thymic T-cell maturation. , 2003, Genome research.

[39]  Hiroaki Kitano,et al.  A graphical notation for biochemical networks , 2003 .

[40]  David Harel A Grand Challenge: Full Reactive Modeling of a Multi-cellular Animal , 2003, HSCC.

[41]  Hiroaki Kitano,et al.  CellDesigner: a process diagram editor for gene-regulatory and biochemical networks , 2003 .

[42]  Vincent Danos,et al.  Modeling and querying biomolecular interaction networks , 2004, Theor. Comput. Sci..

[43]  Corrado Priami,et al.  Modelling biochemical pathways through enhanced pi-calculus , 2004, Theor. Comput. Sci..

[44]  Luca Cardelli,et al.  BioAmbients: an abstraction for biological compartments , 2004, Theor. Comput. Sci..

[45]  Luca Cardelli,et al.  Brane Calculi , 2004, CMSB.

[46]  Corrado Priami,et al.  Beta Binders for Biological Interactions , 2004, CMSB.

[47]  Aviv Regev,et al.  The π-calculus as an Abstraction for Biomolecular Systems , 2004 .

[48]  Gordon D. Plotkin,et al.  A structural approach to operational semantics , 2004, J. Log. Algebraic Methods Program..

[49]  Vincent Danos,et al.  Projective Brane Calculus , 2004, CMSB.

[50]  John N Weinstein,et al.  Molecular Interaction Maps--A Diagrammatic Graphical Language for Bioregulatory Networks , 2004, Science's STKE.

[51]  Luca Cardelli,et al.  Brane Calculi Interactions of Biological Membranes , 2004 .

[52]  Cosimo Laneve,et al.  Formal molecular biology , 2004, Theor. Comput. Sci..

[53]  Katherine C. Chen,et al.  Integrative analysis of cell cycle control in budding yeast. , 2004, Molecular biology of the cell.

[54]  Marta Z. Kwiatkowska,et al.  Probabilistic symbolic model checking with PRISM: a hybrid approach , 2004, International Journal on Software Tools for Technology Transfer.

[55]  Francesco Zappa Nardelli,et al.  Behavioural Theory for Mobile Ambients , 2004, IFIP TCS.

[56]  Paolo Milazzo,et al.  An Alternative to Gillespie's Algorithm for Simulating Chemical Reactions , 2005 .

[57]  Corrado Priami,et al.  Modeling Kohn Interaction Maps with Beta-Binders: An Example , 2005, Trans. Comp. Sys. Biology.

[58]  William S. Hlavacek,et al.  Graphical rule-based representation of signal-transduction networks , 2005, SAC '05.

[59]  Paolo Milazzo,et al.  A Probabilistic Model for Molecular Systems , 2005, Fundam. Informaticae.

[60]  H. Kitano,et al.  A comprehensive pathway map of epidermal growth factor receptor signaling , 2005, Molecular systems biology.

[61]  L. Cardelli,et al.  Interactions of Biological Membranes , 2005 .

[62]  Francesco Zappa Nardelli,et al.  Behavioural Theory for Mobile Ambients , 2004, IFIP TCS.

[63]  Darren J. Wilkinson Stochastic Modelling for Systems Biology , 2006 .

[64]  Mirit I Aladjem,et al.  Circuit diagrams for biological networks , 2006, Molecular systems biology.

[65]  Luca Cardelli,et al.  An universality result for a (mem)brane calculus based on mate/drip operations , 2006, Int. J. Found. Comput. Sci..

[66]  Luca Cardelli,et al.  A Compositional Approach to the Stochastic Dynamics of Gene Networks , 2006, Trans. Comp. Sys. Biology.

[67]  Roberto Barbuti,et al.  Stochastic CLS for the Modeling and Simulation of Biological Systems , 2006 .

[68]  Paolo Milazzo,et al.  Bisimulation Congruences in the Calculus of Looping Sequences , 2006, ICTAC.

[69]  Céline Kuttler,et al.  Simulating Bacterial Transcription and Translation in a Stochastic pi Calculus , 2006, Trans. Comp. Sys. Biology.

[70]  J. Weinstein,et al.  Molecular interaction maps of bioregulatory networks: a general rubric for systems biology. , 2005, Molecular biology of the cell.

[71]  Paolo Milazzo,et al.  A Calculus of Looping Sequences for Modelling Microbiological Systems , 2006, Fundam. Informaticae.

[72]  Paola Lecca,et al.  Cell Cycle Control in Eukaryotes: A BioSpi model , 2007, Electron. Notes Theor. Comput. Sci..

[73]  Paolo Milazzo,et al.  Extending the Calculus of Looping Sequences to Model Protein Interaction at the Domain Level , 2007, ISBRA.

[74]  Paolo Milazzo,et al.  The Calculus of Looping Sequences for Modeling Biological Membranes , 2007, Workshop on Membrane Computing.

[75]  Cosimo Laneve,et al.  A Simple Calculus for Proteins and Cells , 2007, Electron. Notes Theor. Comput. Sci..

[76]  Paolo Milazzo,et al.  Bisimulations in calculi modelling membranes , 2008, Formal Aspects of Computing.

[77]  N. A C H U M D E R S H O W I T Z Termination of Rewriting' , 2022 .