Off-policy reinforcement learning for robust control of discrete-time uncertain linear systems

In this paper, an off-policy reinforcement learning method is developed for the robust stabilizing controller design of discrete-time uncertain linear systems. The proposed robust control design consists of two steps. First, the robust control problem is transformed to an optimal control problem. Second, the off-policy RL method is used to design the optimal control policy which guarantees the robust stability of the original system with uncertainty. The condition for the equivalence between the robust control problem and the optimal control problem is discussed. The off-policy does not require any knowledge of the system knowledge and efficiently utilize the data collected from on-line to improve the performance of approximate optimal control policy in each iteration successively. Finally, a simulation example is carried out to verify the effectiveness of the presented algorithm for the robust control problem of discrete-time linear system with uncertainty.

[1]  Jun Yang,et al.  Exponential synchronization for stochastic neural networks with multi-delayed and Markovian switching via adaptive feedback control , 2015, Commun. Nonlinear Sci. Numer. Simul..

[2]  S. F. R. F. Stengel 3 Model-Based Adaptive Critic Designs , 2004 .

[3]  Bor-Sen Chen,et al.  Robust linear controller design: Time domain approach , 1987 .

[4]  Tingwen Huang,et al.  Off-Policy Reinforcement Learning for $ H_\infty $ Control Design , 2013, IEEE Transactions on Cybernetics.

[5]  J. Cruz,et al.  RELATIONSHIP BETWEEN SENSITIVITY AND STABILITY OF MULTIVARIABLE FEEDBACK SYSTEMS. , 1981 .

[6]  Victor M. Becerra,et al.  Optimal control , 2008, Scholarpedia.

[7]  R. D. Brandt,et al.  Robust control of nonlinear systems: compensating for uncertainty , 1992 .

[8]  Paul J. Werbos,et al.  Approximate dynamic programming for real-time control and neural modeling , 1992 .

[9]  Zhong-Ping Jiang,et al.  Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics , 2012, Autom..

[10]  Frank L. Lewis,et al.  Adaptive optimal control for continuous-time linear systems based on policy iteration , 2009, Autom..

[11]  Derong Liu,et al.  Guaranteed cost neural tracking control for a class of uncertain nonlinear systems using adaptive dynamic programming , 2016, Neurocomputing.

[12]  Derong Liu,et al.  Data-based robust optimal control of continuous-time affine nonlinear systems with matched uncertainties , 2016, Inf. Sci..

[13]  George N. Saridis,et al.  An Approximation Theory of Optimal Control for Trainable Manipulators , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[14]  Frank L. Lewis,et al.  $ {H}_{ {\infty }}$ Tracking Control of Completely Unknown Continuous-Time Systems via Off-Policy Reinforcement Learning , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[15]  Indra Narayan Kar,et al.  Stabilization of Uncertain Discrete-Time Linear System With Limited Communication , 2017, IEEE Transactions on Automatic Control.

[16]  Derong Liu,et al.  Reinforcement-Learning-Based Robust Controller Design for Continuous-Time Uncertain Nonlinear Systems Subject to Input Constraints , 2015, IEEE Transactions on Cybernetics.

[17]  Derong Liu,et al.  Neural-Network-Based Online HJB Solution for Optimal Robust Guaranteed Cost Control of Continuous-Time Uncertain Nonlinear Systems , 2014, IEEE Transactions on Cybernetics.

[18]  Frank L. Lewis,et al.  Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach , 2005, Autom..

[19]  D. Kleinman On an iterative technique for Riccati equation computations , 1968 .

[20]  Frank L. Lewis,et al.  2009 Special Issue: Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems , 2009 .

[21]  Derong Liu,et al.  Neural-Network-Based Distributed Adaptive Robust Control for a Class of Nonlinear Multiagent Systems With Time Delays and External Noises , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[22]  Jun Zhou,et al.  Asymptotical synchronization for delayed stochastic neural networks with uncertainty via adaptive control , 2016 .

[23]  Frank L. Lewis,et al.  H∞ control of linear discrete-time systems: Off-policy reinforcement learning , 2017, Autom..

[24]  Derong Liu,et al.  Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming , 2014, Inf. Sci..

[25]  Frank L. Lewis,et al.  Optimal model-free output synchronization of heterogeneous systems using off-policy reinforcement learning , 2016, Autom..

[26]  Yuhua Xu,et al.  Adaptive synchronization for stochastic T-S fuzzy neural networks with time-delay and Markovian jumping parameters , 2013, Neurocomputing.

[27]  Feng-Yi Lin Robust Control Design: An Optimal Control Approach , 2007 .

[28]  Qinglai Wei,et al.  Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming , 2012, Autom..