Risk-sensitive safety specifications for stochastic systems using Conditional Value-at-Risk

This paper proposes a safety analysis method that facilitates a tunable balance between worst-case and risk-neutral perspectives. First, we define a risk-sensitive safe set to specify the degree of safety attained by a stochastic system. This set is defined as a sublevel set of the solution to an optimal control problem that is expressed using the Conditional Value-at-Risk (CVaR) measure. This problem does not satisfy Bellman's Principle, thus our next contribution is to show how risk-sensitive safe sets can be under-approximated by the solution to a CVaR-Markov Decision Process. Our third and fourth contributions are to show that a value iteration algorithm solves the reduced problem and enables tractable policy synthesis for a class of linear systems. Fifth, we develop a realistic numerical example of a stormwater system to show that our approach can be applied to non-linear systems. Finally, we compare the CVaR criterion to the exponential disutility criterion. The latter allocates control effort evenly across the cost distribution to reduce variance, while the CVaR criterion focuses control effort on a given worst-case quantile--where it matters most for safety.

[1]  Alexandre M. Bayen,et al.  A time-dependent Hamilton-Jacobi formulation of reachable sets for continuous dynamic games , 2005, IEEE Transactions on Automatic Control.

[2]  P. Abbeel,et al.  A Learning-Based Approach to Safety for Uncertain Robotic Systems , 2018 .

[3]  John Lygeros,et al.  Discrete time stochastic hybrid dynamical games: Verification & controller synthesis , 2011, IEEE Conference on Decision and Control and European Control Conference.

[4]  Daniel Kuhn,et al.  From Infinite to Finite Programs: Explicit Error Bounds with Applications to Approximate Dynamic Programming , 2018, SIAM J. Optim..

[5]  Alexander Shapiro,et al.  On a time consistency concept in risk averse multistage stochastic programming , 2009, Oper. Res. Lett..

[6]  Alexander Shapiro Interchangeability principle and dynamic equations in risk averse stochastic programming , 2017, Oper. Res. Lett..

[7]  Insoon Yang,et al.  Risk-Aware Motion Planning and Control Using CVaR-Constrained Optimization , 2019, IEEE Robotics and Automation Letters.

[8]  Insoon Yang,et al.  Optimal Control of Conditional Value-at-Risk in Continuous Time , 2015, SIAM J. Control. Optim..

[9]  Georg Ch. Pflug,et al.  Time-inconsistent multistage stochastic programs: Martingale bounds , 2016, Eur. J. Oper. Res..

[10]  Shie Mannor,et al.  Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach , 2015, NIPS.

[11]  John Lygeros,et al.  Probabilistic reachability and safety for controlled discrete time stochastic hybrid systems , 2008, Autom..

[12]  John Lygeros,et al.  Hamilton–Jacobi Formulation for Reach–Avoid Differential Games , 2009, IEEE Transactions on Automatic Control.

[13]  Claire J. Tomlin,et al.  Annual Review of Control , Robotics , and Autonomous Systems Hamilton – Jacobi Reachability : Some Recent Theoretical Advances and Applications in Unmanned Airspace Management , 2019 .

[14]  P. Whittle Risk-Sensitive Optimal Control , 1990 .

[15]  Andrzej Ruszczynski,et al.  Risk-averse dynamic programming for Markov decision processes , 2010, Math. Program..

[16]  John Lygeros,et al.  A stochastic games framework for verification and control of discrete time stochastic hybrid systems , 2013, Autom..

[17]  Marco Pavone,et al.  A Framework for Time-Consistent, Risk-Sensitive Model Predictive Control: Theory and Algorithms , 2019, IEEE Transactions on Automatic Control.

[18]  Takayuki Osogami,et al.  Robustness and risk-sensitivity in Markov decision processes , 2012, NIPS.

[19]  Mo Chen,et al.  Decomposition of Reachable Sets and Tubes for a Class of Nonlinear Systems , 2016, IEEE Transactions on Automatic Control.

[20]  D. D. Sworder,et al.  Minimax Control of Discrete Time Stochastic Systems , 1964 .

[21]  R. Rockafellar,et al.  Conditional Value-at-Risk for General Loss Distributions , 2001 .

[22]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[23]  William B. Haskell,et al.  A Convex Analytic Approach to Risk-Aware Markov Decision Processes , 2015, SIAM J. Control. Optim..

[24]  Kevin M. Smith,et al.  Reachability Analysis as a Design Tool for Stormwater Systems , 2018, 2018 IEEE Conference on Technologies for Sustainability (SusTech).

[25]  David Heath,et al.  Coherent multiperiod risk adjusted values and Bellman’s principle , 2007, Ann. Oper. Res..

[26]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[27]  Jakob Kisiala,et al.  Conditional Value-at-Risk: Theory and Applications , 2015, 1511.00140.

[28]  Vivek S. Borkar,et al.  Risk-constrained Markov decision processes , 2010, 49th IEEE Conference on Decision and Control (CDC).

[29]  Dudley,et al.  Real Analysis and Probability: Measurability: Borel Isomorphism and Analytic Sets , 2002 .

[30]  Kevin Barraclough,et al.  I and i , 2001, BMJ : British Medical Journal.

[31]  Insoon Yang,et al.  Safety-Aware Optimal Control of Stochastic Systems Using Conditional Value-at-Risk , 2018, 2018 Annual American Control Conference (ACC).

[32]  Alexander Shapiro,et al.  Lectures on Stochastic Programming: Modeling and Theory , 2009 .

[33]  Insoon Yang,et al.  A dynamic game approach to distributionally robust safety specifications for stochastic systems , 2017, Autom..

[34]  Igor Mezic,et al.  Linear predictors for nonlinear dynamical systems: Koopman operator meets model predictive control , 2016, Autom..

[35]  Stephen P. Boyd,et al.  Graph Implementations for Nonsmooth Convex Programs , 2008, Recent Advances in Learning and Control.

[36]  Biagio Ricceri,et al.  Minimax theory and applications , 1998 .

[37]  Georg Ch. Pflug,et al.  Time-Consistent Decisions and Temporal Decomposition of Coherent Risk Functionals , 2016, Math. Oper. Res..

[38]  V. Barbu,et al.  Convexity and optimization in banach spaces , 1972 .

[39]  R. E. Mortensen,et al.  A stochastic computer model for heating and cooling loads , 1988 .

[40]  Kevin M. Smith,et al.  A Risk-Sensitive Finite-Time Reachability Approach for Safety of Stochastic Dynamic Systems , 2019, 2019 American Control Conference (ACC).

[41]  John Lygeros,et al.  Verification of discrete time stochastic hybrid systems: A stochastic reach-avoid decision problem , 2010, Autom..