Learning-Based Distributionally Robust Model Predictive Control of Markovian Switching Systems with Guaranteed Stability and Recursive Feasibility

We present a data-driven model predictive control scheme for chance-constrained Markovian switching systems with unknown switching probabilities. Using samples of the underlying Markov chain, ambiguity sets of transition probabilities are estimated which include the true conditional probability distributions with high probability. These sets are updated online and used to formulate a time-varying, risk-averse optimal control problem. We prove recursive feasibility of the resulting MPC scheme and show that the original chance constraints remain satisfied at every time step. Furthermore, we show that under sufficient decrease of the confidence levels, the resulting MPC scheme renders the closed-loop system mean-square stable with respect to the true-but-unknown distributions, while remaining less conservative than a fully robust approach.

[1]  Marco Pavone,et al.  A Framework for Time-Consistent, Risk-Sensitive Model Predictive Control: Theory and Algorithms , 2019, IEEE Transactions on Automatic Control.

[2]  G. Pflug,et al.  Multistage Stochastic Optimization , 2014 .

[3]  Milan Korda,et al.  Strongly feasible stochastic model predictive control , 2011, IEEE Conference on Decision and Control and European Control Conference.

[4]  Jean-Philippe Vial,et al.  Robust Optimization , 2021, ICORES.

[5]  Alberto Bemporad,et al.  Stochastic model predictive control for constrained discrete-time Markovian switching systems , 2014, Autom..

[6]  Marko Bacic,et al.  Model predictive control , 2003 .

[7]  Daniel Kuhn,et al.  Data-driven distributionally robust optimization using the Wasserstein metric: performance guarantees and tractable reformulations , 2015, Mathematical Programming.

[8]  Vikram Krishnamurthy,et al.  Partially Observed Markov Decision Processes: From Filtering to Controlled Sensing , 2016, 1604.08127.

[9]  Panagiotis Patrinos,et al.  Learning-Based Risk-Averse Model Predictive Control for Adaptive Cruise Control with Stochastic Driver Models , 2020, IFAC-PapersOnLine.

[10]  R. P. Marques,et al.  Discrete-Time Markov Jump Linear Systems , 2004, IEEE Transactions on Automatic Control.

[11]  Alberto Bemporad,et al.  Risk-averse model predictive control , 2017, Autom..

[12]  Alexander Shapiro,et al.  Lectures on Stochastic Programming: Modeling and Theory , 2009 .

[13]  Lukas Hewing,et al.  Learning-Based Model Predictive Control: Toward Safe Learning in Control , 2020, Annu. Rev. Control. Robotics Auton. Syst..

[14]  Sanjay Mehrotra,et al.  Distributionally Robust Optimization: A Review , 2019, ArXiv.

[15]  Zhengtao Ding,et al.  Reinforcement learning and adaptive optimization of a class of Markov jump systems with completely unknown dynamic information , 2019, Neural Comput. Appl..

[16]  André da Motta Salles Barreto,et al.  Online TD(A) for discrete-time Markov jump linear systems , 2018, 2018 IEEE Conference on Decision and Control (CDC).

[17]  Daniel Kuhn,et al.  Distributionally Robust Control of Constrained Stochastic Systems , 2016, IEEE Transactions on Automatic Control.

[18]  Andrzej Ruszczynski,et al.  Risk-averse dynamic programming for Markov decision processes , 2010, Math. Program..

[19]  Alberto Bemporad,et al.  Scenario-based model predictive control of stochastic constrained linear systems , 2009, Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference.

[20]  Kerem Ugurlu,et al.  Robust optimal control using conditional risk mappings in infinite horizon , 2018, J. Comput. Appl. Math..

[21]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[22]  Insoon Yang,et al.  Wasserstein Distributionally Robust Stochastic Control: A Data-Driven Approach , 2018, IEEE Transactions on Automatic Control.

[23]  Arkadi Nemirovski,et al.  On safe tractable approximations of chance constraints , 2012, Eur. J. Oper. Res..

[24]  Marco Pavone,et al.  A framework for time-consistent, risk-averse model predictive control: Theory and algorithms , 2014, 2014 American Control Conference.

[25]  Pantelis Sopasakis,et al.  Safe Learning-Based Control of Stochastic Jump Linear Systems: a Distributionally Robust Approach , 2019, 2019 IEEE 58th Conference on Decision and Control (CDC).