Approximate value iteration for a class of deterministic optimal control problems with infinite state and input alphabets

We consider discrete-time deterministic optimal control problems in which termination at some finite, but not predetermined nor a-priori bounded time is mandatory. We characterize the value function as the maximal fixed point of a suitable dynamic programming operator and establish the convergence of both exact and approximate value iteration under very general assumptions, which cover the classical deterministic shortest path problem and its extension to hypergraphs as well as reachability and minimum-time problems for sampled versions of continuous control systems under constraints. In particular, the state and input alphabets are infinite sets or metric spaces, and the plant dynamics may be nonlinear and subject to disturbances and constraints. The optimization is in the minimax (or maximin) sense, the additive, extended real-valued running and terminal costs are undiscounted and may be unbounded and of arbitrary signs, the value function is typically discontinuous, and our results do apply to the maximization of non-negative rewards under hard constraints.

[1]  R. Tyrrell Rockafellar,et al.  Variational Analysis , 1998, Grundlehren der mathematischen Wissenschaften.

[2]  P. Tabuada,et al.  Symbolic approximate time-optimal control , 2010, Syst. Control. Lett..

[3]  Antoine Girard,et al.  Controller synthesis for safety and reachability via approximate bisimulation , 2010, Autom..

[4]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[5]  Gerald Beer,et al.  Topologies on Closed and Closed Convex Sets , 1993 .

[6]  Gunther Reissig,et al.  Computing Abstractions of Nonlinear Systems , 2009, IEEE Transactions on Automatic Control.

[7]  Gunther Reissig,et al.  Abstraction-based solution of optimal stopping problems under uncertainty , 2013, 52nd IEEE Conference on Decision and Control.

[8]  Gunther Reissig,et al.  Classical and Strong Convexity of Sublevel Sets and Application to Attainable Sets of Nonlinear Systems , 2013, SIAM J. Control. Optim..

[9]  Dimitri P. Bertsekas,et al.  Stochastic Shortest Path Problems Under Weak Conditions , 2013 .

[10]  G. Reissig Convexity of reachable sets of nonlinear ordinary differential equations , 2012, ArXiv.

[11]  D. Bertsekas Infinite-Space Shortest Path Problems and Semicontractive Dynamic Programming † , 2014 .

[12]  Alexander Schrijver Paths, flows, matchings , 2003 .

[13]  L. Grüne,et al.  Global Optimal Control of Perturbed Systems , 2007 .

[14]  Dimitri P. Bertsekas,et al.  Robust shortest path planning and semicontractive dynamic programming , 2016, ArXiv.

[15]  Gunther Reissig,et al.  Feedback Refinement Relations for the Synthesis of Symbolic Controllers , 2015, IEEE Transactions on Automatic Control.

[16]  Dimitri P. Bertsekas,et al.  Value and Policy Iterations in Optimal Control and Adaptive Dynamic Programming , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[17]  Gunther Reissig,et al.  Strong Structural Controllability and Observability of Linear Time-Varying Systems , 2013, IEEE Transactions on Automatic Control.

[18]  Lars Grüne,et al.  Approximately optimal nonlinear stabilization with preservation of the Lyapunov function property , 2007, 2007 46th IEEE Conference on Decision and Control.

[19]  G. Reissig,et al.  Local characterization of strongly convex sets , 2012, 1207.4347.