Quantization-Based Adaptive Actor-Critic Tracking Control With Tracking Error Constraints

In this paper, the problem of adaptive actor-critic (AC) tracking control is investigated for a class of continuous-time nonlinear systems with unknown nonlinearities and quantized inputs. Different from the existing results based on reinforcement learning, the tracking error constraints are considered and new critic functions are constructed to improve the performance further. To ensure that the tracking errors keep within the predefined time-varying boundaries, a tracking error transformation technique is used to constitute an augmented error system. Specific critic functions, rather than the long-term cost function, are introduced to supervise the tracking performance and tune the weights of the AC neural networks (NNs). A novel adaptive controller with a special structure is designed to reduce the effect of the NN reconstruction errors, input quantization, and disturbances. Based on the Lyapunov stability theory, the boundedness of the closed-loop signals and the desired tracking performance can be guaranteed. Finally, simulations on two connected inverted pendulums are given to illustrate the effectiveness of the proposed method.

[1]  Shuzhi Sam Ge,et al.  Adaptive tracking control of uncertain MIMO nonlinear systems with input constraints , 2011, Autom..

[2]  Mingyu Wang,et al.  Approximation-Based Adaptive Tracking Control for MIMO Nonlinear Systems With Input Saturation , 2015, IEEE Transactions on Cybernetics.

[3]  Shaocheng Tong,et al.  Reinforcement Learning Design-Based Adaptive Tracking Control With Less Learning Parameters for Nonlinear Discrete-Time MIMO Systems , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Warren E. Dixon,et al.  Approximate optimal trajectory tracking for continuous-time nonlinear systems , 2013, Autom..

[5]  Robert Babuska,et al.  A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[6]  Warren B. Powell,et al.  Approximate Dynamic Programming - Solving the Curses of Dimensionality , 2007 .

[7]  Tomohisa Hayakawa,et al.  Adaptive quantized control for nonlinear uncertain systems , 2006 .

[8]  Frank L. Lewis,et al.  Reinforcement adaptive learning neural-net-based friction compensation control for high speed and precision , 2000, IEEE Trans. Control. Syst. Technol..

[9]  Shaocheng Tong,et al.  Adaptive Fuzzy Output Feedback Control of MIMO Nonlinear Systems With Unknown Dead-Zone Inputs , 2013, IEEE Transactions on Fuzzy Systems.

[10]  Frank L. Lewis,et al.  Adaptive Optimal Control of Unknown Constrained-Input Systems Using Policy Iteration and Neural Networks , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[11]  Guang-Hong Yang,et al.  Adaptive Backstepping Stabilization of Nonlinear Uncertain Systems With Quantized Input Signal , 2014, IEEE Transactions on Automatic Control.

[12]  Frank L. Lewis,et al.  Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2009, 2009 International Joint Conference on Neural Networks.

[13]  Charalampos P. Bechlioulis,et al.  Robust Adaptive Control of Feedback Linearizable MIMO Nonlinear Systems With Prescribed Performance , 2008, IEEE Transactions on Automatic Control.

[14]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[15]  Xin Zhang,et al.  Data-Driven Robust Approximate Optimal Tracking Control for Unknown General Nonlinear Systems Using Adaptive Dynamic Programming Method , 2011, IEEE Transactions on Neural Networks.

[16]  Qinmin Yang,et al.  Reinforcement Learning Controller Design for Affine Nonlinear Discrete-Time Systems using Online Approximators , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[17]  N. Elia,et al.  Quantized feedback stabilization of non-linear affine systems , 2004 .

[18]  Shuzhi Sam Ge,et al.  Adaptive dynamic surface control of nonlinear systems with unknown dead zone in pure feedback form , 2008, Autom..

[19]  Frank L. Lewis,et al.  Optimal control of nonlinear discrete time-varying systems using a new neural network approximation structure , 2015, Neurocomputing.

[20]  Zhong-Ping Jiang,et al.  Adaptive Dynamic Programming and Adaptive Optimal Output Regulation of Linear Systems , 2016, IEEE Transactions on Automatic Control.

[21]  Shuzhi Sam Ge,et al.  Adaptive neural control of MIMO nonlinear state time-varying delay systems with unknown dead-zones and gain signs , 2007, Autom..

[22]  Shaocheng Tong,et al.  Fuzzy adaptive quantized output feedback tracking control for switched nonlinear systems with input quantization , 2016, Fuzzy Sets Syst..

[23]  Cong Wang,et al.  Dynamic Learning From Adaptive Neural Network Control of a Class of Nonaffine Nonlinear Systems , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[24]  Frank L. Lewis,et al.  Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning , 2014, Autom..

[25]  Kao-Shing Hwang,et al.  Reinforcement learning to adaptive control of nonlinear systems , 2003, IEEE Trans. Syst. Man Cybern. Part B.

[26]  Frank L. Lewis,et al.  A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems , 2013, Autom..

[27]  Lihua Xie,et al.  The sector bound approach to quantized feedback control , 2005, IEEE Transactions on Automatic Control.

[28]  Shuzhi Sam Ge,et al.  Robust Adaptive Control for a Class of MIMO Nonlinear Systems by State and Output Feedback , 2014, IEEE Transactions on Automatic Control.

[29]  Nicola Elia,et al.  Stabilization of linear systems with limited information , 2001, IEEE Trans. Autom. Control..

[30]  Frank L. Lewis,et al.  Reinforcement Q-learning for optimal tracking control of linear discrete-time systems with unknown dynamics , 2014, Autom..

[31]  Qiuye Sun,et al.  Adaptive critic design-based robust neural network control for nonlinear distributed parameter systems with unknown dynamics , 2015, Neurocomputing.

[32]  C. L. Philip Chen,et al.  Fuzzy Adaptive Quantized Control for a Class of Stochastic Nonlinear Uncertain Systems , 2016, IEEE Transactions on Cybernetics.