论文信息 - A Stable Distributed Neural Controller for Physically Coupled Networked Discrete-Time System via Online Reinforcement Learning

A Stable Distributed Neural Controller for Physically Coupled Networked Discrete-Time System via Online Reinforcement Learning

The large scale, time varying, and diversification of physically coupled networked infrastructures such as power grid and transportation system lead to the complexity of their controller design, implementation, and expansion. For tackling these challenges, we suggest an online distributed reinforcement learning control algorithm with the one-layer neural network for each subsystem or called agents to adapt the variation of the networked infrastructures. Each controller includes a critic network and action network for approximating strategy utility function and desired control law, respectively. For avoiding a large number of trials and improving the stability, the training of action network introduces supervised learning mechanisms into reduction of long-term cost. The stability of the control system with learning algorithm is analyzed; the upper bound of the tracking error and neural network weights are also estimated. The effectiveness of our proposed controller is illustrated in the simulation; the results indicate the stability under communication delay and disturbances as well.

Jie Li | Jian Sun

[1] Yoshua Bengio,et al. BinaryConnect: Training Deep Neural Networks with binary weights during propagations , 2015, NIPS.

[2] Alberto Bemporad,et al. Networked Control Systems , 2010 .

[3] Ke Zhang,et al. UPFCs control design for avoiding generator trip of electric power grid with barrier function , 2015 .

[4] Jian Sun,et al. L-infinity event-triggered networked control under time-varying communication delay with communication cost reduction , 2015, Journal of the Franklin Institute.

[5] Dan Wang,et al. Distributed model reference adaptive control for cooperative tracking of uncertain dynamical multi‐agent systems , 2013, IET Control Theory & Applications.

[6] Housheng Su,et al. Adaptive cluster synchronisation of coupled harmonic oscillators with multiple leaders , 2013 .

[7] W. Zhang,et al. Observer-based adaptive consensus tracking for linear multi-agent systems with input saturation , 2015 .

[8] Yang Li,et al. Adaptive Neural Network Control of AUVs With Control Input Nonlinearities Using Reinforcement Learning , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[9] Huaguang Zhang,et al. Adaptive Fault-Tolerant Tracking Control for MIMO Discrete-Time Systems via Reinforcement Learning Algorithm With Less Learning Parameters , 2017, IEEE Transactions on Automation Science and Engineering.

[10] Tingwen Huang,et al. High-Performance Consensus Control in Networked Systems With Limited Bandwidth Communication and Time-Varying Directed Topologies , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[11] Yuezu Lv,et al. Distributed adaptive consensus protocols for linearly coupled Lur'e systems over a directed topology , 2017 .