Adaptive Critic Design for Pure-Feedback Discrete-Time MIMO Systems Preceded by Unknown Backlashlike Hysteresis

This paper concentrates on the adaptive critic design (ACD) issue for a class of uncertain multi-input multioutput (MIMO) nonlinear discrete-time systems preceded by unknown backlashlike hysteresis. The considered systems are in a block-triangular pure-feedback form, in which there exist nonaffine functions and couplings between states and inputs. This makes that the ACD-based optimal control becomes very difficult and complicated. To this end, the mean value theorem is employed to transform the original systems into input–output models. Based on the reinforcement learning algorithm, the optimal control strategy is established with an actor-critic structure. Not only the stability of the systems is ensured but also the performance index is minimized. In contrast to the previous results, the main contributions are: 1) it is the first time to build an ACD framework for such MIMO systems with unknown hysteresis and 2) an adaptive auxiliary signal is developed to compensate the influence of hysteresis. In the end, a numerical study is provided to demonstrate the effectiveness of the present method.

[1]  Shaocheng Tong,et al.  Reinforcement Learning Design-Based Adaptive Tracking Control With Less Learning Parameters for Nonlinear Discrete-Time MIMO Systems , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[2]  Mohammad Reza Zakerzadeh,et al.  Characterizing Hysteresis Nonlinearity Behavior of SMA Actuators by Krasnosel'skii-Pokrovskii Model , 2012 .

[3]  Peter Xiaoping Liu,et al.  Adaptive Neural Synchronization Control for Bilateral Teleoperation Systems With Time Delay and Backlash-Like Hysteresis , 2017, IEEE Transactions on Cybernetics.

[4]  Dongbin Zhao,et al.  Model-Free Optimal Control for Affine Nonlinear Systems With Convergence Analysis , 2015, IEEE Transactions on Automation Science and Engineering.

[5]  Leonardo Riccardi,et al.  Adaptive Control of Positioning Systems With Hysteresis Based on Magnetic Shape Memory Alloys , 2013, IEEE Transactions on Control Systems Technology.

[6]  J.A. De Abreu-Garcia,et al.  Tracking control of a piezoceramic actuator with hysteresis compensation using inverse Preisach model , 2005, IEEE/ASME Transactions on Mechatronics.

[7]  Shaocheng Tong,et al.  Neural Network Controller Design for an Uncertain Robot With Time-Varying Output Constraint , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[8]  T. Tjahjowidodo,et al.  A New Approach to Modeling Hysteresis in a Pneumatic Artificial Muscle Using The Maxwell-Slip Model , 2011, IEEE/ASME Transactions on Mechatronics.

[9]  Dong-Juan Li,et al.  Adaptive Neural Tracking Control for Nonlinear Time-Delay Systems With Full State Constraints , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[10]  Sarika Khushalani Solanki,et al.  Adaptive Critic Design-Based Dynamic Stochastic Optimal Control Design for a Microgrid With Multiple Renewable Resources , 2015, IEEE Transactions on Smart Grid.

[11]  Shengwei Mei,et al.  Policy Approximation in Policy Iteration Approximate Dynamic Programming for Discrete-Time Nonlinear Systems , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[12]  Qinglai Wei,et al.  Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming , 2012, Autom..

[13]  Huai-Ning Wu,et al.  Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control , 2017, IEEE Transactions on Cybernetics.

[14]  Yan Lin,et al.  Fuzzy Approximator Based Adaptive Dynamic Surface Control for Unknown Time Delay Nonlinear Systems With Input Asymmetric Hysteresis Nonlinearities , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[15]  Huaguang Zhang,et al.  Adaptive Predefined Performance Control for MIMO Systems With Unknown Direction via Generalized Fuzzy Hyperbolic Model , 2017, IEEE Transactions on Fuzzy Systems.

[16]  Wei Wang,et al.  Decentralized adaptive backstepping stabilization of interconnected systems with dynamic input and output interactions , 2009, Autom..

[17]  Jennie Si,et al.  A New Powered Lower Limb Prosthesis Control Framework Based on Adaptive Dynamic Programming , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[18]  Huaguang Zhang,et al.  Data-Based Adaptive Fault Estimation and Fault-Tolerant Control for MIMO Model-Free Systems Using Generalized Fuzzy Hyperbolic Model , 2018, IEEE Transactions on Fuzzy Systems.

[19]  Bin Wang,et al.  Adaptive Sliding-Mode With Hysteresis Control Strategy for Simple Multimode Hybrid Energy Storage System in Electric Vehicles , 2017, IEEE Transactions on Industrial Electronics.

[20]  Jing Xu,et al.  DHP Method for Ramp Metering of Freeway Traffic , 2011, IEEE Transactions on Intelligent Transportation Systems.

[21]  Seyed Mohsen Hosseini-Ardali,et al.  Output feedback adaptive control of a class of nonlinear discrete-time systems with unknown control directions and preceded by hysteresis , 2015, Int. J. Control.

[22]  Gang Tao,et al.  Dynamic Surface Control Using Neural Networks for a Class of Uncertain Nonlinear Systems With Input Saturation , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[23]  Huaguang Zhang,et al.  Neural-Network-Based Robust Optimal Tracking Control for MIMO Discrete-Time Systems With Unknown Uncertainty Using Adaptive Critic Design , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[24]  Hartmut Logemann,et al.  Discrete-Time and Sampled-Data Low-Gain Control of Infinite-Dimensional Linear Systems in the Presence of Input Hysteresis , 2002, SIAM J. Control. Optim..

[25]  Chenguang Yang,et al.  Adaptive neural network control of a robotic manipulator with unknown backlash-like hysteresis , 2017 .

[26]  Shuzhi Sam Ge,et al.  Adaptive output feedback NN control of a class of discrete-time MIMO nonlinear systems with unknown control directions , 2009, 2009 7th Asian Control Conference.

[27]  Qichao Zhang,et al.  Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics , 2016, IEEE Transactions on Cybernetics.

[28]  Shaocheng Tong,et al.  Adaptive Fuzzy Output Feedback Control for a Class of Nonlinear Systems With Full State Constraints , 2018, IEEE Transactions on Fuzzy Systems.

[29]  Jianqiang Yi,et al.  Motion and Internal Force Control for Omnidirectional Wheeled Mobile Robots , 2009, IEEE/ASME Transactions on Mechatronics.

[30]  Thanh Nho Do,et al.  A survey on hysteresis modeling, identification and control , 2014 .

[31]  Lei Liu,et al.  Adaptive Neural Network Control for a DC Motor System with Dead-Zone , 2012, Nonlinear Dynamics.

[32]  Shaocheng Tong,et al.  Barrier Lyapunov functions for Nussbaum gain adaptive control of full state constrained nonlinear systems , 2017, Autom..

[33]  Yan Lin,et al.  Adaptive Neural Network Dynamic Surface Control for a Class of Time-Delay Nonlinear Systems With Hysteresis Inputs and Dynamic Uncertainties , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[34]  Tieshan Li,et al.  Adaptive Output Feedback Control of Uncertain Nonlinear Systems With Hysteresis Nonlinearity , 2012, IEEE Transactions on Automatic Control.

[35]  Youxian Sun,et al.  Adaptive Neural Control of Nonaffine Systems With Unknown Control Coefficient and Nonsmooth Actuator Nonlinearities , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[36]  Shugang Li,et al.  Adaptive Neural Output Feedback Control for Nonstrict-Feedback Stochastic Nonlinear Systems With Unknown Backlash-Like Hysteresis and Unknown Control Directions , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[37]  Dong Xu,et al.  Trajectory Tracking Control of Omnidirectional Wheeled Mobile Manipulators: Robust Neural Network-Based Sliding Mode Approach , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[38]  Jinyu Wen,et al.  Adaptive Learning in Tracking Control Based on the Dual Critic Network Design , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[39]  C. L. Philip Chen,et al.  Adaptive Fuzzy Control for a Class of Stochastic Pure-Feedback Nonlinear Systems With Unknown Hysteresis , 2016, IEEE Transactions on Fuzzy Systems.

[40]  Haibo He,et al.  Q-Learning-Based Vulnerability Analysis of Smart Grid Against Sequential Topology Attacks , 2017, IEEE Transactions on Information Forensics and Security.

[41]  Huaguang Zhang,et al.  Adaptive Fault-Tolerant Tracking Control for MIMO Discrete-Time Systems via Reinforcement Learning Algorithm With Less Learning Parameters , 2017, IEEE Transactions on Automation Science and Engineering.

[42]  Guo-Xing Wen,et al.  Adaptive Consensus Control for a Class of Nonlinear Multiagent Time-Delay Systems Using Neural Networks , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[43]  Dongbin Zhao,et al.  MEC—A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[44]  Yangmin Li,et al.  Modeling and High Dynamic Compensating the Rate-Dependent Hysteresis of Piezoelectric Actuators via a Novel Modified Inverse Preisach Model , 2013, IEEE Transactions on Control Systems Technology.

[45]  Shaocheng Tong,et al.  Optimal Control-Based Adaptive NN Design for a Class of Nonlinear Discrete-Time Block-Triangular Systems , 2016, IEEE Transactions on Cybernetics.

[46]  Xi Chen,et al.  Adaptive Critic Design for Energy Minimization of Portable Video Communication Devices , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[47]  Chun-Yi Su,et al.  Robust Adaptive Inverse Control of a Class of Nonlinear Systems With Prandtl-Ishlinskii Hysteresis Model , 2014, IEEE Transactions on Automatic Control.

[48]  Frank L. Lewis,et al.  Reinforcement Learning for Partially Observable Dynamic Processes: Adaptive Dynamic Programming Using Measured Output Data , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).