Fault-Tolerant Controller Design for a Class of Nonlinear MIMO Discrete-Time Systems via Online Reinforcement Learning Algorithm

This paper concentrates on the reinforcement learning (RL)-based fault-tolerant control (FTC) problem for a class of multiple-input-multiple-output (MIMO) nonlinear discrete-time systems. Both incipient faults and abrupt faults are taken into account. Based on the approximation ability of neural networks (NNs), an RL algorithm is incorporated into the FTC strategy, in which an action network is developed to generate the optimal control signal and a critic network is used to approximate the novel cost function, respectively. Compared with the existing results, a novel fault tolerant controller is proposed based on an RL method to reduce a long-term performance index after a fault occurs. The meaning of minimizing the performance index after a fault occurs in an MIMO system is that waste will be decreased and energy will be saved. Note that the weights of NNs are adjusted online rather than offline. Then, it is proven that the adaptive parameters, tracking errors, and optimal control signals are uniformly bounded even in the presence of the unknown fault dynamics. Finally, a numerical simulation is provided to show the effectiveness of the proposed FTC approach.

[1]  Abdel Aitouche,et al.  Fuzzy Scheduler Fault-Tolerant Control for Wind Energy Conversion Systems , 2014, IEEE Transactions on Control Systems Technology.

[2]  Steven X. Ding,et al.  Optimal fault detection for linear discrete time-varying systems , 2010, Autom..

[3]  Wei Wang,et al.  Adaptive actuator failure compensation control of uncertain nonlinear systems with guaranteed transient performance , 2010, Autom..

[4]  Huaguang Zhang,et al.  A Novel Infinite-Time Optimal Tracking Control Scheme for a Class of Discrete-Time Nonlinear Systems via the Greedy HDP Iteration Algorithm , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[5]  Hao Xu,et al.  Stochastic Optimal Controller Design for Uncertain Nonlinear Networked Control System via Neuro Dynamic Programming , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[6]  Marios M. Polycarpou,et al.  Distributed Fault Accommodation for a Class of Interconnected Nonlinear Systems With Partial Communication , 2011, IEEE Transactions on Automatic Control.

[7]  Heng Wang,et al.  Fault Detection for a Class of Uncertain State-Feedback Control Systems , 2010, IEEE Transactions on Control Systems Technology.

[8]  Huaguang Zhang,et al.  Neural-Network-Based Near-Optimal Control for a Class of Discrete-Time Affine Nonlinear Systems With Control Constraints , 2009, IEEE Transactions on Neural Networks.

[9]  Huaguang Zhang,et al.  Adaptive NN fault-tolerant control for discrete-time systems in triangular forms with actuator fault , 2015, Neurocomputing.

[10]  Shaocheng Tong,et al.  Observer-Based Adaptive Fuzzy Tracking Control of MIMO Stochastic Nonlinear Systems With Unknown Control Directions and Unknown Dead Zones , 2015, IEEE Transactions on Fuzzy Systems.

[11]  Bing Chen,et al.  Adaptive Neural Tracking Control for a Class of Nonstrict-Feedback Stochastic Nonlinear Systems With Unknown Backlash-Like Hysteresis , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[12]  Frank L. Lewis,et al.  Online policy iteration based algorithms to solve the continuous-time infinite horizon optimal control problem , 2009, 2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning.

[13]  Huaguang Zhang,et al.  Fault tolerant synchronization for a class of complex interconnected neural networks with delay , 2014 .

[14]  Zhongke Shi,et al.  Reinforcement Learning Output Feedback NN Control Using Deterministic Learning Technique , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[15]  Qinmin Yang,et al.  Reinforcement Learning Controller Design for Affine Nonlinear Discrete-Time Systems using Online Approximators , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[16]  Xiaobo Li,et al.  Characterization of ℋ-ℋ- index for linear time-varying systems , 2013, Autom..

[17]  Kyriakos G. Vamvoudakis,et al.  Optimal adaptive control for unknown systems using output feedback by reinforcement learning methods , 2010, IEEE ICCA 2010.

[18]  F. Lewis,et al.  Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers , 2012, IEEE Control Systems.

[19]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[20]  Darwin G. Caldwell,et al.  Online discovery of AUV control policies to overcome thruster failures , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[21]  Yongming Li,et al.  Observer-Based Adaptive Decentralized Fuzzy Fault-Tolerant Control of Nonlinear Large-Scale Systems With Actuator Failures , 2014, IEEE Transactions on Fuzzy Systems.

[22]  Bin Jiang,et al.  Fault-Tolerant Control for T–S Fuzzy Systems With Application to Near-Space Hypersonic Vehicle With Actuator Faults , 2012, IEEE Transactions on Fuzzy Systems.

[23]  Yi Zhang,et al.  A self-learning call admission control scheme for CDMA cellular networks , 2005, IEEE Transactions on Neural Networks.

[24]  Sarangapani Jagannathan,et al.  A Model-Based Fault-Detection and Prediction Scheme for Nonlinear Multivariable Discrete-Time Systems With Asymptotic Stability Guarantees , 2010, IEEE Transactions on Neural Networks.

[25]  Guang-Hong Yang,et al.  Adaptive Fault-Tolerant Tracking Control Against Actuator Faults With Application to Flight Control , 2006, IEEE Transactions on Control Systems Technology.

[26]  Derong Liu,et al.  Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning , 2014, Neural Networks.

[27]  Yongming Li,et al.  Observer-Based Adaptive Fuzzy Backstepping Dynamic Surface Control for a Class of MIMO Nonlinear Systems , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[28]  Huijun Gao,et al.  Fault-tolerant control of Markovian jump stochastic systems via the augmented sliding mode observer approach , 2014, Autom..

[29]  Jiang Bin,et al.  Backstepping-based active fault-tolerant control for a class of uncertain SISO nonlinear systems , 2012 .

[30]  Ruiyun Qi,et al.  A discrete-time parameter estimation based adaptive actuator failure compensation control scheme , 2011 .

[31]  Jin Zhang,et al.  Adaptive neural network control for a class of MIMO nonlinear systems with disturbances in discrete-time , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[32]  A. Zolghadri,et al.  Advanced model-based Fault Detection and Diagnosis for civil aircraft structural design optimization , 2013, 2013 Conference on Control and Fault-Tolerant Systems (SysTol).

[33]  Dan Zhang,et al.  Fault-Tolerant Control for Discrete-Time Switched Linear Systems with Time-Varying Delay and Actuator Saturation , 2011, Journal of Optimization Theory and Applications.

[34]  Shaocheng Tong,et al.  Fuzzy Adaptive Output Feedback Control of MIMO Nonlinear Systems With Partial Tracking Errors Constrained , 2015, IEEE Transactions on Fuzzy Systems.

[35]  M. Krstić,et al.  Stochastic nonlinear stabilization—I: a backstepping design , 1997 .

[36]  Gary G. Yen,et al.  Improving the performance of globalized dual heuristic programming for fault tolerant control through an online learning supervisor , 2005, IEEE Transactions on Automation Science and Engineering.

[37]  Denis Efimov,et al.  Optimization of fault detection performance for a class of nonlinear systems , 2012 .

[38]  Shaocheng Tong,et al.  Reinforcement Learning Design-Based Adaptive Tracking Control With Less Learning Parameters for Nonlinear Discrete-Time MIMO Systems , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[39]  Shaocheng Tong,et al.  Fuzzy Adaptive Actuator Failure Compensation Control of Uncertain Stochastic Nonlinear Systems With Unmodeled Dynamics , 2014, IEEE Transactions on Fuzzy Systems.

[40]  Sarangapani Jagannathan,et al.  A Model-Based Fault Detection and Prognostics Scheme for Takagi–Sugeno Fuzzy Systems , 2014, IEEE Transactions on Fuzzy Systems.