Linear Quadratic Mean Field Teams: Optimal and Approximately Optimal Decentralized Solutions

We consider team optimal control of decentralized systems with linear dynamics, quadratic costs, and arbitrary disturbance that consist of multiple sub-populations with exchangeable agents (i.e., exchanging two agents within the same sub-population does not affect the dynamics or the cost). Such a system is equivalent to one where the dynamics and costs are coupled across agents through the mean-field (or empirical mean) of the states and actions (even when the primitive random variables are non-exchangeable). Two information structures are investigated. In the first, all agents observe their local state and the mean-field of all sub-populations, in the second, all agents observe their local state but the mean-field of only a subset of the sub-populations. Both information structures are non-classical and not partially nested. Nonetheless, it is shown that linear control strategies are optimal for the first and approximately optimal for the second, the approximation error is inversely proportional to the size of the sub-populations whose mean-fields are not observed. The corresponding gains are determined by the solution of K+1 decoupled standard Riccati equations, where K is the number of sub-populations. The dimensions of the Riccati equations do not depend on the size of the sub-populations, thus the solution complexity is independent of the number of agents. Generalizations to major-minor agents, tracking cost, weighted mean-field, and infinite horizon are provided. The results are illustrated using an example of demand response in smart grids.

[1]  Nuno C. Martins,et al.  Information structures in optimal decentralized control , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).

[2]  Minyi Huang,et al.  Large-Population LQG Games Involving a Major Player: The Nash Certainty Equivalence Principle , 2009, SIAM J. Control. Optim..

[3]  H. Witsenhausen Separation of estimation and control for discrete time systems , 1971 .

[4]  Neil Immerman,et al.  The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.

[5]  Sanjay Lall,et al.  A Characterization of Convex Problems in Decentralized Control$^ast$ , 2005, IEEE Transactions on Automatic Control.

[6]  P. Whittle,et al.  The optimal linear solution of a symmetric team control problem , 1974 .

[7]  Tamer Basar,et al.  Linear Quadratic Risk-Sensitive and Robust Mean Field Games , 2017, IEEE Transactions on Automatic Control.

[8]  Tao Li,et al.  Asymptotically Optimal Decentralized Control for Large Population Stochastic Multiagent Systems , 2008, IEEE Transactions on Automatic Control.

[9]  J. Ma,et al.  Forward-Backward Stochastic Differential Equations and their Applications , 2007 .

[10]  Panganamala Ramana Kumar,et al.  The ISO problem: Decentralized stochastic control via bidding schemes , 2015, 2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[11]  Xun Li,et al.  Discrete time mean-field stochastic linear-quadratic optimal control problems , 2013, Autom..

[12]  R. Radner,et al.  Team Decision Problems , 1962 .

[13]  Zhen Wu,et al.  Mean Field Linear-Quadratic-Gaussian (LQG) Games: Major and Minor Players , 2014, 1403.3999.

[14]  P. Caines,et al.  Social optima in mean field LQG control: Centralized and decentralized strategies , 2009 .

[15]  Nuno C. Martins,et al.  Remote State Estimation With Communication Costs for First-Order LTI Systems , 2011, IEEE Transactions on Automatic Control.

[16]  Yasuhiko Takahara MULTI-LEVEL APPROACH TO DYNAMIC OPTIMIZATION, , 1964 .

[17]  H. Witsenhausen A Counterexample in Stochastic Optimum Control , 1968 .

[18]  Minyi Huang,et al.  Large-Population Cost-Coupled LQG Problems With Nonuniform Agents: Individual-Mass Behavior and Decentralized $\varepsilon$-Nash Equilibria , 2007, IEEE Transactions on Automatic Control.

[19]  Peter E. Caines,et al.  Mean Field Estimation for Partially Observed LQG Systems with Major and Minor Agents , 2014 .

[20]  Aditya Mahajan,et al.  Team Optimal Control of Coupled Major-Minor Subsystems with Mean-Field Sharing , 2020 .

[21]  Yi Ouyang,et al.  Signaling for decentralized routing in a queueing network , 2014, Annals of Operations Research.

[22]  P. Caines Linear Stochastic Systems , 1988 .

[23]  Sanjay Lall,et al.  Optimal Control of Two-Player Systems With Output Feedback , 2013, IEEE Transactions on Automatic Control.

[24]  P. Lions,et al.  Jeux à champ moyen. II – Horizon fini et contrôle optimal , 2006 .

[25]  Leonid Mirkin,et al.  Distributed Control with Low-Rank Coordination , 2014, IEEE Transactions on Control of Network Systems.

[26]  Seyed Mohammad Asghari,et al.  Decentralized control problems with substitutable actions , 2015, 2015 54th IEEE Conference on Decision and Control (CDC).

[27]  Y. Ho,et al.  Team decision theory and information structures in optimal control problems--Part II , 1972 .

[28]  Peter E. Caines,et al.  Mean Field Games , 2015 .

[29]  Aditya Mahajan,et al.  Team optimal control of coupled subsystems with mean-field sharing , 2014, 53rd IEEE Conference on Decision and Control.

[30]  Diogo A. Gomes,et al.  Mean Field Games Models—A Brief Survey , 2013, Dynamic Games and Applications.

[31]  Petros G. Voulgaris,et al.  A convex characterization of distributed control problems in spatially invariant systems with communication constraints , 2005, Syst. Control. Lett..

[32]  Aditya Mahajan,et al.  Team-optimal solution of finite number of mean-field coupled LQG subsystems , 2015, 2015 54th IEEE Conference on Decision and Control (CDC).

[33]  G. Cohen On an algorithm of decentralized optimal control , 1977 .

[34]  P. Lions,et al.  Jeux à champ moyen. I – Le cas stationnaire , 2006 .

[35]  Jiongmin Yong,et al.  Linear-Quadratic Optimal Control Problems for Mean-Field Stochastic Differential Equations , 2013, SIAM J. Control. Optim..

[36]  Thomas R. Kane,et al.  THEORY AND APPLICATIONS , 1984 .

[37]  P. Caines,et al.  Individual and mass behaviour in large population stochastic wireless power control problems: centralized and Nash equilibrium solutions , 2003, 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475).

[38]  Babak Hassibi,et al.  Indefinite-Quadratic Estimation And Control , 1987 .