Model validation and discovery for complex stochastic system

In this thesis, we study two fundamental problems that arise in the modeling of stochastic systems: (i) Validation of stochastic models against behavioral specifications such as temporal logics, and (ii) Discovery of kinetic parameters of stochastic biochemical models from behavioral specifications. We present a new Bayesian algorithm for Statistical Model Checking of stochastic systems based on a sequential version of Jeffreys' Bayes Factor test. We argue that the Bayesian approach is more suited for application domains like systems biology modeling, where distributions on nuisance parameters and priors may be known. We prove that our Bayesian Statistical Model Checking algorithm terminates for a large subclass of prior probabilities. We also characterize the Type I/II errors associated with our algorithm. We experimentally demonstrate that this algorithm is suitable for the analysis of complex biochemical models like those written in the BioNetGen language. We then argue that i.i.d. sampling based Statistical Model Checking algorithms are not an effective way to study rare behaviors of stochastic models and present another Bayesian Statistical Model Checking algorithm that can incorporate non-i.i.d. sampling strategies. We also present algorithms for synthesis of chemical kinetic parameters of stochastic biochemical models from high level behavioral specifications. We consider the setting where a modeler knows facts that must hold on the stochastic model but is not confident about some of the kinetic parameters in her model. We suggest algorithms for discovering these kinetic parameters from facts stated in appropriate formal probabilistic specification languages. Our algorithms are based on our theoretical results characterizing the probability of a specification being true on a stochastic biochemical model. We have applied this algorithm to discover kinetic parameters for biochemical models with as many as six unknown parameters.

[1]  Aditya V. Thakur,et al.  The Yogi Project : Software Property Checking via Static Analysis and Testing , 2009 .

[2]  Avner Landver,et al.  The ForSpec Temporal Logic: A New Temporal Property-Specification Language , 2002, TACAS.

[3]  Christel Baier,et al.  Model-Checking Algorithms for Continuous-Time Markov Chains , 2002, IEEE Trans. Software Eng..

[4]  M. Davis,et al.  Kinetic discrimination in T-cell activation. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[5]  D. Gillespie A General Method for Numerically Simulating the Stochastic Time Evolution of Coupled Chemical Reactions , 1976 .

[6]  J. Andel Sequential Analysis , 2022, The SAGE Encyclopedia of Research Design.

[7]  James R Faeder,et al.  Rule-based modeling of biochemical systems with BioNetGen. , 2009, Methods in molecular biology.

[8]  Edmund M. Clarke,et al.  Counterexample-guided abstraction refinement , 2003, 10th International Symposium on Temporal Representation and Reasoning, 2003 and Fourth International Conference on Temporal Logic. Proceedings..

[9]  Felisa J. Vázquez-Abad,et al.  Accelerated simulation for pricing Asian options , 1998, 1998 Winter Simulation Conference. Proceedings (Cat. No.98CH36274).

[10]  J. Hull Options, Futures, and Other Derivatives , 1989 .

[11]  Petar M. Djuric,et al.  Gaussian particle filtering , 2003, IEEE Trans. Signal Process..

[12]  T. Narumi,et al.  Protein Explorer: A Petaflops Special-Purpose Computer System for Molecular Dynamics Simulations , 2003, ACM/IEEE SC 2003 Conference (SC'03).

[13]  Mihalis Yannakakis,et al.  The complexity of probabilistic verification , 1995, JACM.

[14]  Håkan L. S. Younes,et al.  Numerical vs. Statistical Probabilistic Model Checking: An Empirical Study , 2004, TACAS.

[15]  Amir Pnueli,et al.  The temporal logic of programs , 1977, 18th Annual Symposium on Foundations of Computer Science (sfcs 1977).

[16]  Robert W. Floyd,et al.  Assigning Meanings to Programs , 1993 .

[17]  Alberto Bemporad,et al.  Hybrid Systems: Computation and Control, 10th International Workshop, HSCC 2007, Pisa, Italy, April 3-5, 2007, Proceedings , 2007, HSCC.

[18]  James R Faeder,et al.  Stochastic effects and bistability in T cell receptor signaling. , 2008, Journal of theoretical biology.

[19]  Håkan L. S. Younes Probabilistic Verification for "Black-Box" Systems , 2005, CAV.

[20]  Doron Drusinsky,et al.  Monitoring Temporal Logic Specifications Combined with Time Series Constraints , 2003, J. Univers. Comput. Sci..

[21]  F. Black,et al.  The Pricing of Options and Corporate Liabilities , 1973, Journal of Political Economy.

[22]  J. Doob Markoff chains—denumerable case , 1945 .

[23]  Yang Cao,et al.  Sensitivity analysis of discrete stochastic systems. , 2005, Biophysical journal.

[24]  Håkan L. S. Younes,et al.  Verification and planning for stochastic processes with asynchronous events , 2004 .

[25]  Robert S. Boyer,et al.  The Correctness Problem in Computer Science , 1982 .

[26]  Edsger W. Dijkstra,et al.  A Discipline of Programming , 1976 .

[27]  Sumit Kumar Jha,et al.  Temporal-logics as query languages for dynamic Bayesian networks: application to D. melanogaster embryo development , 2006 .

[28]  Oded Maler,et al.  Systematic Simulation Using Sensitivity Analysis , 2007, HSCC.

[29]  Mahesh Viswanathan,et al.  On the Expressiveness and Complexity of Randomization in Finite State Monitors , 2008, 2008 23rd Annual IEEE Symposium on Logic in Computer Science.

[30]  R. Ramamoorthi,et al.  Remarks on consistency of posterior distributions , 2008, 0805.3248.

[31]  J. Wolfowitz,et al.  Optimum Character of the Sequential Probability Ratio Test , 1948 .

[32]  S. Goodman Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy , 1999, Annals of Internal Medicine.

[33]  Stephan Merz,et al.  Model Checking , 2000 .

[34]  A. Prasad Sistla,et al.  Monitoring the Full Range of omega-Regular Properties of Stochastic Systems , 2008, VMCAI.

[35]  Arnold Everett Amstutz,et al.  Computer Simulation of Competitive Market Response , 1970 .

[36]  Nando de Freitas,et al.  The Unscented Particle Filter , 2000, NIPS.

[37]  Ian M. Mitchell,et al.  Level Set Methods for Computation in Hybrid Systems , 2000, HSCC.

[38]  François Fages,et al.  Symbolic Model Checking of Biochemical Networks , 2003, CMSB.

[39]  Nicolas Brunel,et al.  Estimating parameters and hidden variables in non-linear state-space models based on ODEs for biological networks inference , 2007, Bioinform..

[40]  Daniel T Gillespie,et al.  Stochastic simulation of chemical kinetics. , 2007, Annual review of physical chemistry.

[41]  I. V. Girsanov On Transforming a Certain Class of Stochastic Processes by Absolutely Continuous Substitution of Measures , 1960 .

[42]  Olivier Coudert,et al.  A Performance Study of BDD-Based Model Checking , 1998, FMCAD.

[43]  Håkan L. S. Younes,et al.  Probabilistic Verification of Discrete Event Systems Using Acceptance Sampling , 2002, CAV.

[44]  Giancarlo Ferrari-Trecate,et al.  Reconstruction of Switching Thresholds in Piecewise-Affine Models of Genetic Regulatory Networks , 2006, HSCC.

[45]  Todd Millstein,et al.  Automatic predicate abstraction of C programs , 2001, PLDI '01.

[46]  François Fages,et al.  BIOCHAM: an environment for modeling biological systems and formalizing experimental knowledge , 2006, Bioinform..

[47]  Christel Baier,et al.  Symbolic Model Checking for Probabilistic Processes , 1997, ICALP.

[48]  Christopher J. Langmead,et al.  Generalized Queries and Bayesian Statistical Model Checking in Dynamic Bayesian Networks: Application to Personalized Medicine , 2009 .

[49]  Michael C. Fu,et al.  Optimal Exercise Policies and Simulation-Based Valuation for American-Asian Options , 2003, Oper. Res..

[50]  Marta Z. Kwiatkowska,et al.  Probabilistic model checking of complex biological pathways , 2008, Theor. Comput. Sci..

[51]  Edmund M. Clarke,et al.  Design and Synthesis of Synchronization Skeletons Using Branching Time Temporal Logic , 2008, 25 Years of Model Checking.

[52]  Jeremy Staum,et al.  Simulation in financial engineering , 2001, Proceeding of the 2001 Winter Simulation Conference (Cat. No.01CH37304).

[53]  Sumit Kumar Jha,et al.  A Counterexample-Guided Approach to Parameter Synthesis for Linear Hybrid Automata , 2008, HSCC.

[54]  Radu Grosu,et al.  Monte Carlo Model Checking , 2005, TACAS.

[55]  Gilles Clermont,et al.  Parameter Synthesis in Nonlinear Dynamical Systems: Application to Systems Biology , 2009, RECOMB.

[56]  François Fages,et al.  From Syntax to Semantics in Systems Biology Towards Automated Reasoning Tools , 2006, Trans. Comp. Sys. Biology.

[57]  Muruhan Rathinam,et al.  Stiffness in stochastic chemically reacting systems: The implicit tau-leaping method , 2003 .

[58]  T. McKeithan,et al.  Kinetic proofreading in T-cell receptor signal transduction. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[59]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[60]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[61]  Bernd Finkbeiner,et al.  Checking Finite Traces using Alternating Automata , 2001, Electron. Notes Theor. Comput. Sci..

[62]  William S. Hlavacek,et al.  BioNetGen: software for rule-based modeling of signal transduction based on the interactions of molecular domains , 2004, Bioinform..

[63]  L. Wasserman,et al.  Computing Bayes Factors by Combining Simulation and Asymptotic Approximations , 1997 .

[64]  Axel Legay,et al.  Statistical Model Checking in BioLab: Applications to the Automated Analysis of T-Cell Receptor Signaling Pathway , 2008, CMSB.

[65]  Sumit Kumar Jha,et al.  Symbolic Approaches for Finding Control Strategies in Boolean Networks , 2007, APBC.

[66]  D. Rubin,et al.  Bayesian Data Analysis.@@@Bayes and Empirical Bayes Methods for Data Analysis. , 1997 .

[67]  D. Gillespie Exact Stochastic Simulation of Coupled Chemical Reactions , 1977 .

[68]  Hscc Hybrid systems : computation and control : 6th International Workshop, HSCC 2003, Prague, Czech Republic, April 3-5, 2003 : proceedings , 2003 .

[69]  Christel Baier,et al.  LiQuor: A tool for Qualitative and Quantitative Linear Time analysis of Reactive Systems , 2006, Third International Conference on the Quantitative Evaluation of Systems - (QEST'06).

[70]  Vincent Danos,et al.  Modeling and querying biomolecular interaction networks , 2004, Theor. Comput. Sci..

[71]  Håkan L. S. Younes Ymer: A Statistical Model Checker , 2005, CAV.

[72]  Susmit Jha,et al.  Randomization Based Probabilistic Approach to Detect Trojan Circuits , 2008, 2008 11th IEEE High Assurance Systems Engineering Symposium.

[73]  Sumit Kumar Jha,et al.  Predicting Protein Folding Kinetics Via Temporal Logic Model Checking , 2007, WABI.

[74]  Linda R Petzold,et al.  Adaptive explicit-implicit tau-leaping method with automatic tau selection. , 2007, The Journal of chemical physics.

[75]  Sanjit A. Seshia,et al.  Symbolic Reachability Analysis of Lazy Linear Hybrid Automata , 2007, FORMATS.

[76]  Marta Z. Kwiatkowska,et al.  Probabilistic model checking in practice: case studies with PRISM , 2005, PERV.

[77]  Mahesh Viswanathan,et al.  Statistical Model Checking of Black-Box Probabilistic Systems , 2004, CAV.

[78]  Bryan C. Daniels,et al.  Sloppiness, robustness, and evolvability in systems biology. , 2008, Current opinion in biotechnology.

[79]  Frank Ciesinski,et al.  On Probabilistic Computation Tree Logic , 2004, Validation of Stochastic Systems.

[80]  Luca de Alfaro,et al.  Symbolic Model Checking of Probabilistic Processes Using MTBDDs and the Kronecker Representation , 2000, TACAS.

[81]  Grigore Rosu,et al.  Monitoring Algorithms for Metric Temporal Logic Specifications , 2004, RV@ETAPS.

[82]  S. Heston A Closed-Form Solution for Options with Stochastic Volatility with Applications to Bond and Currency Options , 1993 .

[83]  A. Prasad Sistla,et al.  Monitoring Temporal Properties of Stochastic Systems , 2008, VMCAI.

[84]  Thomas A. Henzinger,et al.  The Algorithmic Analysis of Hybrid Systems , 1995, Theor. Comput. Sci..

[85]  Mahesh Viswanathan,et al.  On Statistical Model Checking of Stochastic Systems , 2005, CAV.

[86]  Jeffrey K. Uhlmann,et al.  New extension of the Kalman filter to nonlinear systems , 1997, Defense, Security, and Sensing.

[87]  Andreas Karlsson,et al.  Elementary Survey Sampling , 2007, Technometrics.

[88]  Ashish Tiwari,et al.  Automated Symbolic Reachability Analysis; with Application to Delta-Notch Signaling Automata , 2003, HSCC.

[89]  Thomas Hérault,et al.  Approximate Probabilistic Model Checking , 2004, VMCAI.

[90]  Ron Koymans,et al.  Specifying real-time properties with metric temporal logic , 1990, Real-Time Systems.

[91]  Sumit Kumar Jha,et al.  Synthesis and infeasibility analysis for stochastic models of biochemical systems using statistical model checking and abstraction refinement , 2011, Theor. Comput. Sci..

[92]  Sriram K. Rajamani,et al.  The SLAM project: debugging system software via static analysis , 2002, POPL '02.

[93]  David F. Anderson,et al.  Error analysis of tau-leap simulation methods , 2009, 0909.4790.

[94]  Ernst-Rüdiger Olderog,et al.  Verification of Sequential and Concurrent Programs , 1997, Graduate Texts in Computer Science.

[95]  William S. Hlavacek,et al.  Graphical rule-based representation of signal-transduction networks , 2005, SAC '05.

[96]  Thomas A. Henzinger,et al.  Beyond HYTECH: Hybrid Systems Analysis Using Interval Numerical Methods , 2000, HSCC.

[97]  Håkan L. S. Younes,et al.  Statistical probabilistic model checking with a focus on time-bounded properties , 2006, Inf. Comput..

[98]  Marta Kwiatkowska,et al.  PRISM 2.0: a tool for probabilistic model checking , 2004 .

[99]  T. Naylor Computer Simulation of Competitive Market Response , 1972 .

[100]  Olaf Stursberg,et al.  Efficient Representation and Computation of Reachable Sets for Hybrid Systems , 2003, HSCC.

[101]  P. Glasserman,et al.  Estimating security price derivatives using simulation , 1996 .

[102]  H. McAdams,et al.  Circuit simulation of genetic networks. , 1995, Science.

[103]  Daniel T Gillespie The Monte Carlo Method of Evaluating Integrals , 1975 .

[104]  Frank S. de Boer,et al.  Verification of Sequential and Concurrent Programs , 1997, Texts and Monographs in Computer Science.

[105]  Ahmed Bouajjani,et al.  Symbolic Techniques for Parametric Reasoning about Counter and Clock Systems , 2000, CAV.

[106]  Espen Gaarder Haug,et al.  Why We Have Never Used the Black-Scholes-Merton Option Pricing Formula , 2008 .

[107]  S. Shreve Stochastic Calculus for Finance II: Continuous-Time Models , 2010 .

[108]  Håkan L. S. Younes,et al.  Numerical vs. statistical probabilistic model checking , 2006, International Journal on Software Tools for Technology Transfer.

[109]  François Fages,et al.  On a Continuous Degree of Satisfaction of Temporal Logic Formulae with Applications to Systems Biology , 2008, CMSB.

[110]  Nassim Nicholas Taleb,et al.  The Black Swan: The Impact of the Highly Improbable , 2007 .

[111]  Rajeev Alur,et al.  Symbolic analysis for improving simulation coverage of Simulink/Stateflow models , 2008, EMSOFT '08.

[112]  Joost-Pieter Katoen,et al.  Approximate Parameter Synthesis for Probabilistic Time-Bounded Reachability , 2008, 2008 Real-Time Systems Symposium.

[113]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[114]  François Fages Temporal Logic Constraints in the Biochemical Abstract Machine BIOCHAM , 2005, LOPSTR.

[115]  Calin Belta,et al.  Robustness analysis and tuning of synthetic gene networks , 2007, Bioinform..

[116]  Marta Z. Kwiatkowska,et al.  PRISM 2.0: a tool for probabilistic model checking , 2004, First International Conference on the Quantitative Evaluation of Systems, 2004. QEST 2004. Proceedings..

[117]  Gwendolyn G. Hays Computer-Aided Design: Simulation of Digital Design Logic , 1969, IEEE Transactions on Computers.

[118]  Leslie Lamport,et al.  Proving Liveness Properties of Concurrent Programs , 1982, TOPL.

[119]  François Fages,et al.  A general computational method for robustness analysis with applications to synthetic gene networks , 2009, Bioinform..

[120]  Axel Legay,et al.  A Bayesian Approach to Model Checking Biological Systems , 2009, CMSB.