论文信息 - Reachability in Recursive Markov Decision Processes

Reachability in Recursive Markov Decision Processes

We consider a class of infinite-state Markov decision processes generated by stateless pushdown automata. This class corresponds to $1 \frac{1}{2}$-player games over graphs generated by BPA systems or (equivalently) 1-exit recursive state machines. An extended reachability objective is specified by two sets S and T of safe and terminal stack configurations, where the membership to S and T depends just on the top-of-the-stack symbol. The question is whether there is a suitable strategy such that the probability of hitting a terminal configuration by a path leading only through safe configurations is equal to (or different from) a given x ∈{0,1}. We show that the qualitative extended reachability problem is decidable in polynomial time, and that the set of all configurations for which there is a winning strategy is effectively regular. More precisely, this set can be represented by a deterministic finite-state automaton with a fixed number of control states. This result is a generalization of a recent theorem by Etessami & Yannakakis which says that the qualitative termination for 1-exit RMDPs (which exactly correspond to our $1 \frac{1}{2}$-player BPA games) is decidable in polynomial time. Interestingly, the properties of winning strategies for the extended reachability objectives are quite different from the ones for termination, and new observations are needed to obtain the result. As an application, we derive the EXPTIME-completeness of the model-checking problem for $1 \frac{1}{2}$-player BPA games and qualitative PCTL formulae.

Tomás Brázdil | Antonín Kucera | Vojtech Forejt | Václav Brozek

[1] Igor Walukiewicz. Pushdown Processes: Games and Model-Checking , 2001, Inf. Comput..

[2] Bengt Jonsson,et al. A logic for reasoning about time and reliability , 1990, Formal Aspects of Computing.

[3] Christel Baier,et al. Model checking for a probabilistic branching time logic with fairness , 1998, Distributed Computing.

[4] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[5] Javier Esparza,et al. Model checking LTL with regular valuations for pushdown systems , 2001, Inf. Comput..

[6] J. Esparza,et al. Model checking probabilistic pushdown automata , 2004, LICS 2004.

[7] Kousha Etessami,et al. Efficient Qualitative Analysis of Classes of Recursive Markov Decision Processes and Simple Stochastic Games , 2006, STACS.

[8] Andrew Hinton,et al. PRISM: A Tool for Automatic Verification of Probabilistic Systems , 2006, TACAS.

[9] Kousha Etessami,et al. Recursive Markov Decision Processes and Recursive Stochastic Games , 2005, ICALP.

[10] Andrea Bianco,et al. Model Checking of Probabalistic and Nondeterministic Systems , 1995, FSTTCS.

[11] William Feller,et al. An Introduction to Probability Theory and Its Applications , 1967 .

[12] Tomás Brázdil,et al. On the Decidability of Temporal Properties of Probabilistic Pushdown Automata , 2005, STACS.

[13] William Feller,et al. An Introduction to Probability Theory and Its Applications , 1951 .

[14] Javier Esparza,et al. Reachability Analysis of Pushdown Automata: Application to Model-Checking , 1997, CONCUR.