Stochastic Analysis of Synthetic Genetic Circuits

Over the past few decades, synthetic biology has generated great interest to biologists and engineers alike. Synthetic biology combines the research of biology with the engineering principles of standards, abstraction, and automated construction with the ultimate goal of being able to design and build useful biological systems. To realize this goal, researchers are actively working on better ways to model and analyze synthetic genetic circuits, groupings of genes that influence the expression of each other through the use of proteins. When designing and analyzing genetic circuits, researchers are often interested in building circuits that exhibit a particular behavior. Usually, this involves simulating their models to produce some time series data and analyzing this data to discern whether or not the circuit behaves appropriately. This method becomes less attractive as circuits grow in complexity because it becomes very time consuming to generate a sufficient amount of runs for analysis. In addition, trying to select representative runs out of a large data set is tedious and error-prone thereby motivating methods of automating this analysis. This has led to the need for design space exploration techniques that allow synthetic biologists to easily explore the effect of varying parameters and efficiently consider alternative designs of their systems. This dissertation attempts to address this need by proposing new analysis and verification techniques for synthetic genetic circuits. In particular, it applies formal methods such as model checking techniques to models of genetic circuits in order to ensure that they behave correctly and are as robust as possible for a variety of different inputs and/or parameter settings. However, model checking stochastic systems is not as simple as model checking deterministic systems where it is always known what the next state of the system will be at any given step. Stochastic systems can exhibit a variety of different behaviors that are chosen randomly with different probabilities at each time step. Therefore, model checking a stochastic system involves calculating the probability that the system will exhibit a desired behavior. Although it is often more difficult to work with the probabilities that stochastic systems introduce, stochastic systems and the models that represent them are becoming commonplace in many disciplines including electronic circuit design where as parts are being made smaller and smaller, they are becoming less reliable. In addition to stochastic model checking, this dissertation proposes a new incremental stochastic simulation algorithm (iSSA) based on Gillespie’s stochastic simulation algorithm (SSA) that is capable of presenting a researcher with a simulation trace of the typical behavior of the system. Before the development of this algorithm, discerning this information was extremely error-prone as it involved performing many simulations and attempting to wade through the massive amounts of data. This algorithm greatly aids researchers in designing genetic circuits as it efficiently shows the researcher the most likely behavior of the circuit. Both the iSSA and stochastic model checking can be used in concert to give a researcher the likelihood that the system will exhibit its most typical behavior. Once the typical behavior is known, properties for nontypical behaviors can be constructed and their likelihoods can also be computed. This methodology is applied to several genetic circuits leading to new understanding of the effects of various parameters on the behavior of these circuits.

[1]  Chris J. Myers,et al.  Effecient Stochastic Simulation to Analyze Targeted Properties of Biological Systems , 2010 .

[2]  Javier Esparza,et al.  Model checking probabilistic pushdown automata , 2004, Proceedings of the 19th Annual IEEE Symposium on Logic in Computer Science, 2004..

[3]  J. Liao,et al.  Metabolic engineering for advanced biofuels production from Escherichia coli. , 2008, Current opinion in biotechnology.

[4]  Denis Thieffry,et al.  Dynamical behaviour of biological regulatory networks—II. Immunity control in bacteriophage lambda , 1995 .

[5]  Muffy Calder,et al.  Some Investigations Concerning the CTMC and the ODE Model Derived From Bio-PEPA , 2009, FBTC@ICALP.

[6]  Thomas A. Henzinger,et al.  Hybrid numerical solution of the chemical master equation , 2010, CMSB '10.

[7]  Marta Z. Kwiatkowska,et al.  Probabilistic model checking of complex biological pathways , 2008, Theor. Comput. Sci..

[8]  D. Gillespie Approximate accelerated stochastic simulation of chemically reacting systems , 2001 .

[9]  David Hsu,et al.  Probabilistic approximations of ODEs based bio-pathway dynamics , 2011, Theor. Comput. Sci..

[10]  David R. Gilbert,et al.  Analysis of Signalling Pathways Using Continuous Time Markov Chains , 2006, Trans. Comp. Sys. Biology.

[11]  Hiroaki Kitano,et al.  The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models , 2003, Bioinform..

[12]  Hiroyuki Kuwahara,et al.  Model Abstraction and Temporal Behavior Analysis of Genetic Regulatory Networks , 2007 .

[13]  P. Waage,et al.  Studies concerning affinity , 1986 .

[14]  Bengt Jonsson,et al.  A logic for reasoning about time and reliability , 1990, Formal Aspects of Computing.

[15]  Michael S. Samoilov,et al.  Automated Abstraction Methodology for Genetic Regulatory Networks , 2006, Trans. Comp. Sys. Biology.

[16]  D. Endy Foundations for engineering biology , 2005, Nature.

[17]  D. Gillespie Exact Stochastic Simulation of Coupled Chemical Reactions , 1977 .

[18]  S A Kauffman,et al.  Control of sequential compartment formation in Drosophila. , 1978, Science.

[19]  Arantxa Etxeverria The Origins of Order , 1993 .

[20]  Chris J. Myers,et al.  The Design of a Genetic Muller C-Element , 2007, 13th IEEE International Symposium on Asynchronous Circuits and Systems (ASYNC'07).

[21]  Hiroyuki Kuwahara,et al.  An efficient and exact stochastic simulation method to analyze rare events in biochemical systems. , 2008, The Journal of chemical physics.

[22]  Linda R Petzold,et al.  The slow-scale stochastic simulation algorithm. , 2005, The Journal of chemical physics.

[23]  Marta Z. Kwiatkowska,et al.  Game-based Abstraction for Markov Decision Processes , 2006, Third International Conference on the Quantitative Evaluation of Systems - (QEST'06).

[24]  M. Bennett,et al.  A fast, robust, and tunable synthetic gene oscillator , 2008, Nature.

[25]  D. Gillespie Markov Processes: An Introduction for Physical Scientists , 1991 .

[26]  William J. Stewart,et al.  Introduction to the numerical solution of Markov Chains , 1994 .

[27]  Timothy S. Ham,et al.  Production of the antimalarial drug precursor artemisinic acid in engineered yeast , 2006, Nature.

[28]  Håkan L. S. Younes,et al.  Numerical vs. statistical probabilistic model checking , 2006, International Journal on Software Tools for Technology Transfer.

[29]  R. Thomas,et al.  Boolean formalization of genetic control circuits. , 1973, Journal of theoretical biology.

[30]  Andrew Hinton,et al.  PRISM: A Tool for Automatic Verification of Probabilistic Systems , 2006, TACAS.

[31]  Christopher A. Voigt,et al.  Environmentally controlled invasion of cancer cells by engineered bacteria. , 2006, Journal of molecular biology.

[32]  F. Crick Central Dogma of Molecular Biology , 1970, Nature.

[33]  Chris J. Myers,et al.  Engineering Genetic Circuits , 2009 .

[34]  D. Gillespie The chemical Langevin equation , 2000 .

[35]  Linda R. Petzold,et al.  Improved leap-size selection for accelerated stochastic simulation , 2003 .

[36]  Victor de Lorenzo,et al.  Genetically modified organisms for the environment: stories of success and failure and what we have learned from them. , 2005, International microbiology : the official journal of the Spanish Society for Microbiology.

[37]  Thomas A. Henzinger,et al.  Sliding Window Abstraction for Infinite Markov Chains , 2009, CAV.

[38]  Hidde de Jong,et al.  Modeling and Simulation of Genetic Regulatory Systems: A Literature Review , 2002, J. Comput. Biol..

[39]  Robert K. Brayton,et al.  Model-checking continuous-time Markov chains , 2000, TOCL.

[40]  S. Kauffman Metabolic stability and epigenesis in randomly constructed genetic nets. , 1969, Journal of theoretical biology.

[41]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[42]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[43]  Stuart A. Kauffman,et al.  The origins of order , 1993 .

[44]  Christel Baier,et al.  Partial Order Reduction for Markov Decision Processes: A Survey , 2005, FMCO.

[45]  S. Leibler,et al.  Mechanisms of noise-resistance in genetic oscillators , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[46]  K. Burrage,et al.  A Krylov-based finite state projection algorithm for solving the chemical master equation arising in the discrete modelling of biological systems , 2006 .

[47]  Kousha Etessami,et al.  Verifying Probabilistic Procedural Programs , 2004, FSTTCS.

[48]  Linda R Petzold,et al.  Refining the weighted stochastic simulation algorithm. , 2009, The Journal of chemical physics.

[49]  Bengt Jonsson,et al.  A framework for reasoning about time and reliability , 1989, [1989] Proceedings. Real-Time Systems Symposium.

[50]  Alberto Policriti,et al.  Model building and model checking for biochemical processes , 2007, Cell Biochemistry and Biophysics.

[51]  Linda R Petzold,et al.  Efficient step size selection for the tau-leaping simulation method. , 2006, The Journal of chemical physics.

[52]  D. Gillespie A General Method for Numerically Simulating the Stochastic Time Evolution of Coupled Chemical Reactions , 1976 .

[53]  Antonín Kucera Methods for Quantitative Analysis of Probabilistic Pushdown Automata , 2005, INFINITY.

[54]  Henrik Ejersbo Jensen,et al.  Reachability Analysis of Probabilistic Systems by Successive Refinements , 2001, PAPM-PROBMIV.

[55]  Joost-Pieter Katoen,et al.  Three-Valued Abstraction for Continuous-Time Markov Chains , 2007, CAV.

[56]  Chris J. Myers,et al.  iSSA: An incremental stochastic simulation algorithm for genetic circuits , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[57]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[58]  A. Shwartz,et al.  Handbook of Markov decision processes : methods and applications , 2002 .

[59]  Aidan P Thompson,et al.  A constant-time kinetic Monte Carlo algorithm for simulation of large biochemical reaction networks. , 2008, The Journal of chemical physics.

[60]  Kishor S. Trivedi,et al.  An Aggregation Technique for the Transient Analysis of Stiff Markov Chains , 1986, IEEE Transactions on Computers.

[61]  J. Collins,et al.  Construction of a genetic toggle switch in Escherichia coli , 2000, Nature.

[62]  Werner Sandmann,et al.  Efficient calculation of rare event probabilities in Markovian queueing networks , 2011, VALUETOOLS.

[63]  Michael A. Gibson,et al.  Efficient Exact Stochastic Simulation of Chemical Systems with Many Species and Many Channels , 2000 .

[64]  Lijun Zhang,et al.  Time-bounded model checking of infinite-state continuous-time Markov chains , 2008, 2008 8th International Conference on Application of Concurrency to System Design.

[65]  Parosh Aziz Abdulla,et al.  Reasoning about Probabilistic Lossy Channel Systems , 2000, CONCUR.

[66]  Kousha Etessami,et al.  Algorithmic Verification of Recursive Probabilistic State Machines , 2005, TACAS.

[67]  Journal of Chemical Physics , 1932, Nature.

[68]  Marta Z. Kwiatkowska,et al.  Stochastic Model Checking , 2007, SFM.

[69]  Parosh Aziz Abdulla,et al.  Verification of Probabilistic Systems with Faulty Communication , 2003, FoSSaCS.

[70]  René Thomas Regulatory networks seen as asynchronous automata: A logical description , 1991 .

[71]  M. Elowitz,et al.  A synthetic oscillatory network of transcriptional regulators , 2000, Nature.

[72]  D. Endy,et al.  Refinement and standardization of synthetic biological parts and devices , 2008, Nature Biotechnology.

[73]  Adam Arkin,et al.  Setting the standard in synthetic biology , 2008, Nature Biotechnology.

[74]  Alexander Moshe Rabinovich,et al.  Quantitative Analysis of Probabilistic Lossy Channel Systems , 2003, ICALP.

[75]  Zoran Ognjanovic Discrete Linear-time Probabilistic Logics: Completeness, Decidability and Complexity , 2006, J. Log. Comput..

[76]  L. A. Segel,et al.  The Quasi-Steady-State Assumption: A Case Study in Perturbation , 1989, SIAM Rev..

[77]  Parosh Aziz Abdulla,et al.  Verifying infinite Markov chains with a finite attractor or the global coarseness property , 2005, 20th Annual IEEE Symposium on Logic in Computer Science (LICS' 05).

[78]  Moshe Y. Vardi Probabilistic Linear-Time Model Checking: An Overview of the Automata-Theoretic Approach , 1999, ARTS.

[79]  Luca de Alfaro,et al.  Magnifying-Lens Abstraction for Markov Decision Processes , 2007, CAV.

[80]  Anne Remke,et al.  Model checking structured infinite Markov chains , 2008 .

[81]  M. Khammash,et al.  The finite state projection algorithm for the solution of the chemical master equation. , 2006, The Journal of chemical physics.

[82]  Chris J. Myers,et al.  Tablet—next generation sequence assembly visualization , 2009, Bioinform..

[83]  Thomas A. Henzinger,et al.  Approximation of event probabilities in noisy cellular processes , 2009, Theor. Comput. Sci..