Neural-Network-Based Output-Feedback Control Under Round-Robin Scheduling Protocols

The neural-network (NN)-based output-feedback control is considered for a class of stochastic nonlinear systems under round-Robin (RR) scheduling protocols. For the purpose of effectively mitigating data congestions and saving energies, the RR protocols are implemented and the resulting nonlinear systems become the so-called protocol-induced periodic ones. Taking such a periodic characteristic into account, an NN-based observer is first proposed to reconstruct the system states where a novel adaptive tuning law on NN weights is adopted to cater to the requirement of performance analysis. In addition, with the established boundedness of the periodic systems in the mean-square sense, the desired observer gain is obtained by solving a set of matrix inequalities. Then, an actor–critic NN scheme with a time-varying step length in adaptive law is developed to handle the considered control problem with terminal constraints over finite-horizon. Some sufficient conditions are derived to guarantee the boundedness of estimation errors of critic and actor NN weights. In view of these conditions, some key parameters in adaptive tuning laws are easily determined via elementary algebraic operations. Furthermore, the stability in the mean-square sense is investigated for the discussed issue in infinite horizon. Finally, a simulation example is utilized to illustrate the applicability of the proposed control scheme.

[1]  Shuzhi Sam Ge,et al.  Optimal Critic Learning for Robot Control in Time-Varying Environments , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[2]  Lei Zou,et al.  Set-membership filtering for time-varying systems with mixed time-delays under Round-Robin and Weighted Try-Once-Discard protocols , 2016, Autom..

[3]  Qing-Long Han,et al.  An Overview and Deep Investigation on Sampled-Data-Based Event-Triggered Control and Filtering for Networked Systems , 2017, IEEE Transactions on Industrial Informatics.

[4]  Zhong-Ping Jiang,et al.  Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design , 2016, Autom..

[5]  Andrea J. Goldsmith,et al.  LQG Control for MIMO Systems Over Multiple Erasure Channels With Perfect Acknowledgment , 2012, IEEE Transactions on Automatic Control.

[6]  Sarangapani Jagannathan,et al.  Online Optimal Control of Affine Nonlinear Discrete-Time Systems With Unknown Internal Dynamics by Using Time-Based Policy Update , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[7]  Bernard Widrow,et al.  Punish/Reward: Learning with a Critic in Adaptive Threshold Systems , 1973, IEEE Trans. Syst. Man Cybern..

[8]  Fuad E. Alsaadi,et al.  Security‐guaranteed filtering for discrete‐time stochastic delayed systems with randomly occurring sensor saturations and deception attacks , 2017 .

[9]  Frank Allgöwer,et al.  Constrained Stabilization of Periodic Discrete-Time Systems Via Periodic Lyapunov Functions , 2013, PSYCO.

[10]  Haibo He,et al.  Gr-GDHP: A New Architecture for Globalized Dual Heuristic Dynamic Programming , 2017, IEEE Transactions on Cybernetics.

[11]  S. Jagannathan,et al.  Near optimal output feedback control of nonlinear discrete-time systems based on reinforcement neural network learning , 2014, IEEE/CAA Journal of Automatica Sinica.

[12]  Fuwen Yang,et al.  Robust $H_{\infty}$ Control for Networked Systems With Random Packet Losses , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[13]  Frank Allgöwer,et al.  Stability of periodically time-varying systems: Periodic Lyapunov functions , 2012, Autom..

[14]  Huijun Gao,et al.  On H-infinity Estimation of Randomly Occurring Faults for A Class of Nonlinear Time-Varying Systems With Fading Channels , 2016, IEEE Transactions on Automatic Control.

[15]  Graziano Chesi,et al.  Stabilization and Entropy Reduction via SDP-Based Design of Fixed-Order Output Feedback Controllers and Tuning Parameters , 2017, IEEE Transactions on Automatic Control.

[16]  Huaguang Zhang,et al.  Neural-Network-Based Near-Optimal Control for a Class of Discrete-Time Affine Nonlinear Systems With Control Constraints , 2009, IEEE Transactions on Neural Networks.

[17]  Frank L. Lewis,et al.  Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[18]  Frank L. Lewis,et al.  A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems , 2013, Autom..

[19]  Hong Qiao,et al.  Event-Triggered State Estimation for Discrete-Time Multidelayed Neural Networks With Stochastic Parameters and Incomplete Measurements , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[20]  Daniel W. C. Ho,et al.  Observer-Based Event-Triggering Consensus Control for Multiagent Systems With Lossy Sensors and Cyber-Attacks , 2017, IEEE Transactions on Cybernetics.

[21]  Atle Seierstad Stochastic Control in Discrete and Continuous Time , 2008 .

[22]  Derui Ding,et al.  Event-triggered consensus control for discrete-time stochastic multi-agent systems: The input-to-state stability in probability , 2015, Autom..

[23]  Frank L. Lewis,et al.  Optimal control of nonlinear discrete time-varying systems using a new neural network approximation structure , 2015, Neurocomputing.

[24]  Hao Xu,et al.  Neural Network-Based Finite Horizon Stochastic Optimal Control Design for Nonlinear Networked Control Systems , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[25]  Qing-Long Han,et al.  Consensus of Multiagent Systems Subject to Partially Accessible and Overlapping Markovian Network Topologies , 2017, IEEE Transactions on Cybernetics.

[26]  S. S. Ge,et al.  Adaptive NN control for a class of discrete-time non-linear systems , 2003 .

[27]  Xinghuo Yu,et al.  Survey on Recent Advances in Networked Control Systems , 2016, IEEE Transactions on Industrial Informatics.

[28]  Fuad E. Alsaadi,et al.  A sampled-data approach to distributed H∞ resilient state estimation for a class of nonlinear time-delay systems over sensor networks , 2017, J. Frankl. Inst..

[29]  Emilia Fridman,et al.  A Round-Robin Type Protocol for Distributed Estimation with H∞ Consensus , 2014, Syst. Control. Lett..

[30]  Hao Xu,et al.  Finite-Horizon Near-Optimal Output Feedback Neural Network Control of Quantized Nonlinear Discrete-Time Systems With Input Constraint , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[31]  Zidong Wang,et al.  Variance-constrained H∞ control for a class of nonlinear stochastic discrete time-varying systems: The event-triggered design , 2016, Autom..

[32]  Yu Jiang,et al.  Robust Adaptive Dynamic Programming and Feedback Stabilization of Nonlinear Systems , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[33]  Frank L. Lewis,et al.  Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2009, 2009 International Joint Conference on Neural Networks.

[34]  Derong Liu,et al.  Reinforcement-Learning-Based Robust Controller Design for Continuous-Time Uncertain Nonlinear Systems Subject to Input Constraints , 2015, IEEE Transactions on Cybernetics.

[35]  Frank L. Lewis,et al.  Actor–Critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[36]  Frank L. Lewis,et al.  Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach , 2005, Autom..

[37]  Tingwen Huang,et al.  Quantized/Saturated Control for Sampled-Data Systems Under Noisy Sampling Intervals: A Confluent Vandermonde Matrix Approach , 2017, IEEE Transactions on Automatic Control.

[38]  Yongming Li,et al.  Adaptive Neural Network Output Feedback Control for Stochastic Nonlinear Systems With Unknown Dead-Zone and Unmodeled Dynamics , 2014, IEEE Transactions on Cybernetics.