Efficient Parallel Statistical Model Checking of Biochemical Networks

We consider the problem of verifying stochastic models of biochemical networks against behavioral properties expressed in temporal logic terms. Exact probabilistic verification approaches such as, for example, CSL/PCTL model checking, are undermined by a huge computational demand which rule them out for most real case studies. Less demanding approaches, such as statistical model checking, estimate the likelihood that a property is satisfied by sampling executions out of the stochastic model. We propose a methodology for efficiently estimating the likelihood that a LTL property P holds of a stochastic model of a biochemical network. As with other statistical verification techniques, the methodology we propose uses a stochastic simulation algorithm for generating execution samples, however there are three key aspects that improve the efficiency: first, the sample generation is driven by on-the-fly verification of P which results in optimal overall simulation time. Second, the confidence interval estimation for the probability of P to hold is based on an efficient variant of the Wilson method which ensures a faster convergence. Third, the whole methodology is designed according to a parallel fashion and a prototype software tool has been implemented that performs the sampling/verification process in parallel over an HPC architecture.

[1]  Paolo Ballarini,et al.  Studying Irreversible Transitions in a Model of Cell Cycle Regulation , 2009, PASM@EPEW.

[2]  Lubos Brim,et al.  BioDiVinE: A Framework for Parallel Analysis of Biological Models , 2009, COMPMOD.

[3]  Thomas Hérault,et al.  APMC 3.0: Approximate Verification of Discrete and Continuous Time Markov Chains , 2006, Third International Conference on the Quantitative Evaluation of Systems - (QEST'06).

[4]  Joost-Pieter Katoen,et al.  A Markov reward model checker , 2005, Second International Conference on the Quantitative Evaluation of Systems (QEST'05).

[5]  David Gilbert,et al.  A Monte Carlo model checker for probabilistic LTL with numerical constraints , 2008 .

[6]  Christel Baier,et al.  Model-Checking Algorithms for Continuous-Time Markov Chains , 2002, IEEE Trans. Software Eng..

[7]  E. B. Wilson Probable Inference, the Law of Succession, and Statistical Inference , 1927 .

[8]  M. Crosby,et al.  Cell Cycle: Principles of Control , 2007, The Yale Journal of Biology and Medicine.

[9]  Tianhai Tian,et al.  Parallel implementation of stochastic simulation for large-scale cellular processes , 2005, Eighth International Conference on High-Performance Computing in Asia-Pacific Region (HPCASIA'05).

[10]  David O. Morgan,et al.  Principles of CDK regulation , 1995, Nature.

[11]  Marta Kwiatkowska,et al.  Probabilistic model checking for systems biology , 2011 .

[12]  Fred Kröger,et al.  Temporal Logic of Programs , 1987, EATCS Monographs on Theoretical Computer Science.

[13]  François Fages,et al.  Machine Learning Biochemical Networks from Temporal Logic Properties , 2006, Trans. Comp. Sys. Biology.

[14]  François Fages,et al.  On the Analysis of Numerical Data Time Series in Temporal Logic , 2007, CMSB.

[15]  Hong Li,et al.  Algorithms and Software for Stochastic Simulation of Biochemical Reacting Systems , 2008, Biotechnology progress.

[16]  Fred Krögr Temporal Logic Of Programs , 1987 .

[17]  Axel Legay,et al.  Statistical Model Checking in BioLab: Applications to the Automated Analysis of T-Cell Receptor Signaling Pathway , 2008, CMSB.

[18]  L. Brown,et al.  Interval Estimation for a Binomial Proportion , 2001 .

[19]  D. Gillespie Exact Stochastic Simulation of Coupled Chemical Reactions , 1977 .

[20]  T. Tony Cai,et al.  Confidence Intervals for a binomial proportion and asymptotic expansions , 2002 .

[21]  Hiroaki Kitano,et al.  Foundations of systems biology , 2001 .

[22]  Robert K. Brayton,et al.  Model-checking continuous-time Markov chains , 2000, TOCL.

[23]  William J. Stewart,et al.  Introduction to the numerical solution of Markov Chains , 1994 .

[24]  Corrado Priami,et al.  The BlenX Language: A Tutorial , 2008, SFM.

[25]  Rosita Guido,et al.  Taming the complexity of biological pathways through parallel computing , 2008, Briefings Bioinform..

[26]  Dan ie l T. Gil lespie A rigorous derivation of the chemical master equation , 1992 .

[27]  A. Bobbio,et al.  Kronecker representation of stochastic Petri nets with discrete PH distributions , 1998, Proceedings. IEEE International Computer Performance and Dependability Symposium. IPDS'98 (Cat. No.98TB100248).

[28]  Gianfranco Ciardo,et al.  Efficient Reachability Set Generation and Storage Using Decision Diagrams , 1999, ICATPN.

[29]  Håkan L. S. Younes,et al.  Numerical vs. statistical probabilistic model checking , 2006, International Journal on Software Tools for Technology Transfer.

[30]  Bengt Jonsson,et al.  A framework for reasoning about time and reliability , 1989, [1989] Proceedings. Real-Time Systems Symposium.

[31]  Walter W. Piegorsch,et al.  Sample sizes for improved binomial confidence intervals , 2004, Comput. Stat. Data Anal..

[32]  Marta Z. Kwiatkowska,et al.  PRISM: probabilistic model checking for performance and reliability analysis , 2009, PERV.

[33]  I. Cameron,et al.  Cell cycle controls , 1974 .