论文信息 - Hierarchical Reinforcement Learning for Optimal Control of Linear Multi-Agent Systems: the Homogeneous Case

Hierarchical Reinforcement Learning for Optimal Control of Linear Multi-Agent Systems: the Homogeneous Case

Individual agents in a multi-agent system (MAS) may have decoupled open-loop dynamics, but a cooperative control objective usually results in coupled closed-loop dynamics thereby making the control design computationally expensive. The computation time becomes even higher when a learning strategy such as reinforcement learning (RL) needs to be applied to deal with the situation when the agents dynamics are not known. To resolve this problem, this paper proposes a hierarchical RL scheme for a linear quadratic regulator (LQR) design in a continuous-time linear MAS. The idea is to exploit the structural properties of two graphs embedded in the $Q$ and $R$ weighting matrices in the LQR objective to define an orthogonal transformation that can convert the original LQR design to multiple decoupled smaller-sized LQR designs. We show that if the MAS is homogeneous then this decomposition retains closed-loop optimality. Conditions for decomposability, an algorithm for constructing the transformation matrix, a hierarchical RL algorithm, and robustness analysis when the design is applied to non-homogeneous MAS are presented. Simulations show that the proposed approach can guarantee significant speed-up in learning without any loss in the cumulative value of the LQR cost.

[1] Silvia Ferrari,et al. Distributed optimal control for multi-agent trajectory optimization , 2014, Autom..

[2] F. Lewis,et al. Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers , 2012, IEEE Control Systems.

[3] Frank L. Lewis,et al. Cooperative Optimal Control for Multi-Agent Systems on Directed Graph Topologies , 2014, IEEE Transactions on Automatic Control.

[4] Charles R. Johnson,et al. Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[5] Tatsuo Narikiyo,et al. Hierarchical Decentralized Robust Optimal Design for Homogeneous Linear Multi-Agent Systems , 2016, ArXiv.

[6] Frank L. Lewis,et al. Cooperative Control of Multi-Agent Systems: Optimal and Adaptive Design Approaches , 2013 .

[7] Francesco Borrelli,et al. Distributed LQR Design for Identical Dynamically Decoupled Systems , 2008, IEEE Transactions on Automatic Control.

[8] He Bai,et al. Model-Free Optimal Control of Linear Multi-Agent Systems via Decomposition and Hierarchical Approximation , 2020, ArXiv.

[9] Tomonori Sadamoto,et al. Fast Online Reinforcement Learning Control using State-Space Dimensionality Reduction , 2019, ArXiv.

[10] Zhong-Ping Jiang,et al. Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics , 2012, Autom..

[11] Frank L. Lewis,et al. Adaptive optimal control for continuous-time linear systems based on policy iteration , 2009, Autom..

[12] Aranya Chakrabortty,et al. On Model-Free Reinforcement Learning of Reduced-Order Optimal Control for Singularly Perturbed Systems , 2018, 2018 IEEE Conference on Decision and Control (CDC).

[13] Aranya Chakrabortty,et al. Optimal Control of Large-Scale Networks using Clustering Based Projections , 2016, ArXiv.

[14] Aranya Chakrabortty,et al. Model-Free Reinforcement Learning of Minimal-Cost Variance Control , 2020, IEEE Control Systems Letters.