Robust control of uncertain Markov Decision Processes with temporal logic specifications

We present a method for designing a robust control policy for an uncertain system subject to temporal logic specifications. The system is modeled as a finite Markov Decision Process (MDP) whose transition probabilities are not exactly known but are known to belong to a given uncertainty set. A robust control policy is generated for the MDP that maximizes the worst-case probability of satisfying the specification over all transition probabilities in this uncertainty set. To this end, we use a procedure from probabilistic model checking to combine the system model with an automaton representing the specification. This new MDP is then transformed into an equivalent form that satisfies assumptions for stochastic shortest path dynamic programming. A robust version of dynamic programming solves for a ε-suboptimal robust control policy with time complexity O(log1/ε) times that for the non-robust case.

[1]  Laurent El Ghaoui,et al.  Robust Control of Markov Decision Processes with Uncertain Transition Matrices , 2005, Oper. Res..

[2]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[3]  Robert Givan,et al.  Bounded-parameter Markov decision processes , 2000, Artif. Intell..

[4]  Krishnendu Chatterjee,et al.  Model-Checking omega-Regular Properties of Interval Markov Chains , 2008, FoSSaCS.

[5]  Mihalis Yannakakis,et al.  The complexity of probabilistic verification , 1995, JACM.

[6]  E. Allen Emerson,et al.  Temporal and Modal Logic , 1991, Handbook of Theoretical Computer Science, Volume B: Formal Models and Sematics.

[7]  Calin Belta,et al.  A Fully Automated Framework for Control of Linear Systems from Temporal Logic Specifications , 2008, IEEE Transactions on Automatic Control.

[8]  Andrew Y. Ng,et al.  Solving Uncertain Markov Decision Processes , 2001 .

[9]  John M Gozzolino,et al.  MARKOVIAN DECISION PROCESSES WITH UNCERTAIN TRANSITION PROBABILITIES , 1965 .

[10]  Di Wu,et al.  Reachability analysis of uncertain systems using bounded-parameter Markov decision processes , 2008, Artif. Intell..

[11]  Maria Domenica Di Benedetto,et al.  Approximate Abstractions of Stochastic Hybrid Systems , 2011, IEEE Transactions on Automatic Control.

[12]  Rajeev Alur,et al.  Deterministic generators and games for Ltl fragments , 2004, TOCL.

[13]  Ufuk Topcu,et al.  Receding Horizon Temporal Logic Planning , 2012, IEEE Transactions on Automatic Control.

[14]  Marta Z. Kwiatkowska,et al.  PRISM 4.0: Verification of Probabilistic Real-Time Systems , 2011, CAV.

[15]  Calin Belta,et al.  MDP optimal control under temporal logic constraints , 2011, IEEE Conference on Decision and Control and European Control Conference.

[16]  Olivier Buffet,et al.  Reachability analysis for uncertain SSPs , 2005, 17th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'05).

[17]  George J. Pappas,et al.  Discrete abstractions of hybrid systems , 2000, Proceedings of the IEEE.

[18]  Christel Baier,et al.  Experiments with deterministic omega-automata for formulas of linear temporal logic , 2006, Theor. Comput. Sci..

[19]  L. M. Bujorianu,et al.  Approximate Abstractions of Stochastic Hybrid Systems , 2008 .

[20]  Arch W. Naylor,et al.  Linear Operator Theory in Engineering and Science , 1971 .

[21]  Ufuk Topcu,et al.  Receding horizon control for temporal logic specifications , 2010, HSCC '10.

[22]  Christel Baier,et al.  Principles of model checking , 2008 .

[23]  Calin Belta,et al.  A probabilistic approach for control of a stochastic system from LTL specifications , 2009, Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference.

[24]  Paulo Tabuada,et al.  Robust discrete synthesis against unspecified disturbances , 2011, HSCC '11.

[25]  Stephen E. Fienberg,et al.  Testing Statistical Hypotheses , 2005 .

[26]  C. Belta,et al.  Control of Markov decision processes from PCTL specifications , 2011, Proceedings of the 2011 American Control Conference.

[27]  George J. Pappas,et al.  Discrete abstraction of stochastic nonlinear systems: A bisimulation function approach , 2010, Proceedings of the 2010 American Control Conference.

[28]  Benjamin Johnson,et al.  Probabilistic Analysis of Correctness of High-Level Robot Behavior with Sensor Error , 2011, Robotics: Science and Systems.

[29]  Zohar Manna,et al.  Formal verification of probabilistic systems , 1997 .

[30]  Christel Baier,et al.  PROBMELA: a modeling language for communicating probabilistic processes , 2004, Proceedings. Second ACM and IEEE International Conference on Formal Methods and Models for Co-Design, 2004. MEMOCODE '04..

[31]  Calin Belta,et al.  LTL Control in Uncertain Environments with Probabilistic Satisfaction Guarantees , 2011, ArXiv.

[32]  Hadas Kress-Gazit,et al.  Where's Waldo? Sensor-Based Temporal Logic Motion Planning , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.