On the Controller Synthesis for Finite-State Markov Decision Processes

We study the problem of effective controller synthesis for finite-state Markov decision processes (MDPs) and the class of properties definable in the logic PCTL extended with long-run average propositions. We show that the existence of such a controller is decidable, and we give an algorithm which computes the controller if it exists. We also address the issue of "controller robustness", i.e., the problem whether there is a controller which still guarantees the satisfaction of a given property when the probabilities in the considered MDP slightly deviate from their original values. From a practical point of view, this is an important aspect since the probabilities are often determined empirically and hence they are inherently imprecise. We show that the existence of robust controllers is also decidable, and that such controllers are effectively computable if they exist.

[1]  Bengt Jonsson,et al.  A logic for reasoning about time and reliability , 1990, Formal Aspects of Computing.

[2]  Laurent El Ghaoui,et al.  Robustness in Markov Decision Problems with Uncertain Transition Matrices , 2003, NIPS.

[3]  Uri Zwick,et al.  The Complexity of Mean Payoff Games , 1995, COCOON.

[4]  Dima Grigoriev,et al.  Complexity of Deciding Tarski Algebra , 1988, J. Symb. Comput..

[5]  Uri Zwick,et al.  The Complexity of Mean Payoff Games on Graphs , 1996, Theor. Comput. Sci..

[6]  Christel Baier,et al.  Controller Synthesis for Probabilistic Systems , 2004, IFIP TCS.

[7]  Luca de Alfaro,et al.  How to Specify and Verify the Long-Run Average Behavior of Probabilistic Systems , 1998, LICS.

[8]  A. ´. in On the Controller Synthesis for Finite-State Markov Decision Processes , 2005 .

[9]  Nancy A. Lynch,et al.  Probabilistic Simulations for Probabilistic Processes , 1994, Nord. J. Comput..

[10]  Wolfgang Thomas,et al.  Infinite Games and Verification (Extended Abstract of a Tutorial) , 2002, CAV.

[11]  Thomas A. Henzinger,et al.  The Element of Surprise in Timed Games , 2003, CONCUR.

[12]  D. Grigor'ev Complexity of deciding Tarski algebra , 1988 .

[13]  A. Tarski A Decision Method for Elementary Algebra and Geometry , 2023 .

[14]  Krishnendu Chatterjee,et al.  Semiperfect-Information Games , 2005, FSTTCS.

[15]  Deepak D'Souza,et al.  Timed Control with Partial Observability , 2003, CAV.

[16]  Christel Baier,et al.  Stochastic game logic , 2007, Fourth International Conference on the Quantitative Evaluation of Systems (QEST 2007).

[17]  Kim G. Larsen,et al.  Bisimulation through Probabilistic Testing , 1991, Inf. Comput..

[18]  Mihalis Yannakakis,et al.  The complexity of probabilistic verification , 1995, JACM.

[19]  Tomás Brázdil,et al.  Stochastic games with branching-time winning objectives , 2006, 21st Annual IEEE Symposium on Logic in Computer Science (LICS'06).

[20]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .