Learning probabilistic models for model checking: an evolutionary approach and an empirical study

Many automated system analysis techniques (e.g., model checking, model-based testing) rely on first obtaining a model of the system under analysis. System modeling is often done manually, which is often considered as a hindrance to adopt model-based system analysis and development techniques. To overcome this problem, researchers have proposed to automatically “learn” models based on sample system executions and shown that the learned models can be useful sometimes. There are however many questions to be answered. For instance, how much shall we generalize from the observed samples and how fast would learning converge? Or, would the analysis result based on the learned model be more accurate than the estimation we could have obtained by sampling many system executions within the same amount of time? Moreover, how well does learning scale to real-world applications? If the answer is negative, what are the potential methods to improve the efficiency of learning? In this work, we first investigate existing algorithms for learning probabilistic models for model checking and propose an evolution-based approach for better controlling the degree of generalization. Then, we present existing approaches to learn abstract models to improve the efficiency of learning for scalability reasons. Lastly, we conduct an empirical study in order to answer the above questions. Our findings include that the effectiveness of learning may sometimes be limited and it is worth investigating how abstraction should be done properly in order to learn abstract models.

[1]  Kim G. Larsen,et al.  Learning deterministic probabilistic automata from a model checking perspective , 2016, Machine Learning.

[2]  Dana Ron,et al.  On the learnability and usage of acyclic probabilistic finite automata , 1995, COLT '95.

[3]  Kim G. Larsen,et al.  Learning Markov Decision Processes for Model Checking , 2012, QFM.

[4]  Dana Angluin,et al.  Learning Regular Sets from Queries and Counterexamples , 1987, Inf. Comput..

[5]  Axel Legay,et al.  Faster Statistical Model Checking by Means of Abstraction and Learning , 2014, RV.

[6]  Pierre Dupont,et al.  Stochastic Grammatical Inference with Multinomial Tests , 2002, ICGI.

[7]  Christian Rohr,et al.  Simulative Model Checking of Steady State and Time-Unbounded Temporal Operators , 2013, Trans. Petri Nets Other Model. Concurr..

[8]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, CACM.

[9]  Y. Guédon Estimating Hidden Semi-Markov Chains From Discrete Sequences , 2003 .

[10]  Håkan L. S. Younes,et al.  Statistical Verification of Probabilistic Properties with Unbounded Until , 2010, SBMF.

[11]  Éric Tanter,et al.  Supporting dynamic crosscutting with partial behavioral reflection: a case study , 2004, XXIV International Conference of the Chilean Computer Science Society.

[12]  Grigore Rosu,et al.  Synthesizing Monitors for Safety Properties , 2002, TACAS.

[13]  Dana Ron,et al.  The power of amnesia: Learning probabilistic automata with variable memory length , 1996, Machine Learning.

[14]  Håkan L. S. Younes,et al.  Probabilistic Verification of Discrete Event Systems Using Acceptance Sampling , 2002, CAV.

[15]  José Oncina,et al.  Learning deterministic regular grammars from stochastic samples in polynomial time , 1999, RAIRO Theor. Informatics Appl..

[16]  S. V. N. Vishwanathan,et al.  Learnability of Probabilistic Automata via Oracles , 2005, ALT.

[17]  Marc Sebban,et al.  Improvement of the State Merging Rule on Noisy Data in Probabilistic Grammatical Inference , 2003, ECML.

[18]  Michael K. Reiter,et al.  Crowds: anonymity for Web transactions , 1998, TSEC.

[19]  Axel Legay,et al.  Statistical Model Checking: An Overview , 2010, RV.

[20]  Xiaoyu Song,et al.  Integrating Evolutionary Computation with Abstraction Refinement for Model Checking , 2010, IEEE Transactions on Computers.

[21]  Jun Pang,et al.  ASSA-PBN: An Approximate Steady-State Analyser of Probabilistic Boolean Networks , 2015, ATVA.

[22]  Håkan L. S. Younes,et al.  Numerical vs. statistical probabilistic model checking , 2006, International Journal on Software Tools for Technology Transfer.

[23]  Alon Itai,et al.  Symmetry breaking in distributed networks , 1990, Inf. Comput..

[24]  Amaury Habrard,et al.  Learning Rational Stochastic Languages , 2006, COLT.

[25]  Mahesh Viswanathan,et al.  Learning continuous time Markov chains from sample executions , 2004, First International Conference on the Quantitative Evaluation of Systems, 2004. QEST 2004. Proceedings..

[26]  Sandeep K. Shukla,et al.  Evaluating the reliability of NAND multiplexing with PRISM , 2005, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[27]  Christel Baier,et al.  Principles of model checking , 2008 .

[28]  Jingyi Wang,et al.  Toward 'verifying' a Water Treatment System , 2017, ArXiv.

[29]  Mahesh Viswanathan,et al.  Statistical Model Checking of Black-Box Probabilistic Systems , 2004, CAV.

[30]  R. Khan,et al.  Sequential Tests of Statistical Hypotheses. , 1972 .

[31]  Håkan L. S. Younes,et al.  Statistical probabilistic model checking with a focus on time-bounded properties , 2006, Inf. Comput..

[32]  Hassen Saïdi,et al.  Construction of Abstract State Graphs with PVS , 1997, CAV.

[33]  Moshe Y. Vardi,et al.  Experimental Evaluation of Classical Automata Constructions , 2005, LPAR.

[34]  Kim G. Larsen,et al.  Learning Probabilistic Automata for Model Checking , 2011, 2011 Eighth International Conference on Quantitative Evaluation of SysTems.

[35]  Cees Witteveen,et al.  A Likelihood-Ratio Test for Identifying Probabilistic Deterministic Real-Time Automata from Positive Data , 2010, ICGI.

[36]  Krishnendu Chatterjee,et al.  Verification of Markov Decision Processes Using Learning Algorithms , 2014, ATVA.

[37]  Ted Herman,et al.  Probabilistic Self-Stabilization , 1990, Information Processing Letters.

[38]  Marijn J. H. Heule,et al.  Exact DFA Identification Using SAT Solvers , 2010, ICGI.

[39]  Andrea Bianco,et al.  Model Checking of Probabalistic and Nondeterministic Systems , 1995, FSTTCS.

[40]  Alexander Clark,et al.  PAC-learnability of Probabilistic Deterministic Finite State Automata , 2004, J. Mach. Learn. Res..

[41]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[42]  Edward R. Dougherty,et al.  From Boolean to probabilistic Boolean networks as models of genetic regulatory networks , 2002, Proc. IEEE.

[43]  Marta Z. Kwiatkowska,et al.  PRISM: Probabilistic Symbolic Model Checker , 2002, Computer Performance Evaluation / TOOLS.

[44]  Edmund M. Clarke,et al.  Statistical Model Checking for Cyber-Physical Systems , 2011, ATVA.

[45]  José Oncina,et al.  Learning Stochastic Regular Grammars by Means of a State Merging Method , 1994, ICGI.

[46]  Martin Leucker,et al.  Monitoring of Real-Time Properties , 2006, FSTTCS.

[47]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[48]  Stephan Merz,et al.  Model Checking , 2000 .

[49]  Marius Bozga,et al.  Improved Learning for Stochastic Timed Models by State-Merging Algorithms , 2017, NFM.

[50]  Kim G. Larsen,et al.  Learning Markov Models for Stationary System Behaviors , 2012, NASA Formal Methods.

[51]  Frits W. Vaandrager,et al.  Proof-Checking a Data Link Protocol , 1994, TYPES.

[52]  Vitaly Shmatikov,et al.  Analysis of probabilistic contract signing , 2002, J. Comput. Secur..

[53]  Qin Lin,et al.  Interpreting Finite Automata for Sequential Data , 2016, NIPS 2016.

[54]  Marta Z. Kwiatkowska,et al.  The PRISM Benchmark Suite , 2012, 2012 Ninth International Conference on Quantitative Evaluation of Systems.

[55]  Håkan L. S. Younes,et al.  Verification and planning for stochastic processes with asynchronous events , 2004 .

[56]  Jun Sun,et al.  Improving Probability Estimation Through Active Probabilistic Model Learning , 2017, ICFEM.