Distributed control of multi-agent systems with random parameters and a major agent

Distributed control of the multi-agent systems involving a major agent and a large number of minor agents is investigated in this paper. There exist Markov jump parameters in the dynamic equation and random parameters in the index functions. The major agent has salient impact on others. Each minor agent merely has tiny influence, while the average effect of all the minor agents is not negligible, which plays a significant role in the evolution and performance index of each agent. Besides the state of the major agent, each minor agent can only access to the information of its state and parameters. Based on the mean field (MF) theory, a set of distributed control laws is designed. By the probability limit theory, the uniform stability of the closed-loop system and the upper bound of the corresponding index values are obtained. Via a numerical example, the consistency of the MF estimation and the influence of the initial state values and parameters on the index values are demonstrated.

[1]  Minyi Huang,et al.  Nash Certainty Equivalence in Large Population Stochastic Dynamic Games: Connections with the Physics of Interacting Particle Systems , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.

[2]  Finn E. Kydland,et al.  Rules Rather than Discretion: The Inconsistency of Optimal Plans , 1977, Journal of Political Economy.

[3]  M. Dufwenberg Game theory. , 2011, Wiley interdisciplinary reviews. Cognitive science.

[4]  Eitan Altman,et al.  Mean field asymptotics of Markov Decision Evolutionary Games and teams , 2009, 2009 International Conference on Game Theory for Networks.

[5]  P. Caines,et al.  Individual and mass behaviour in large population stochastic wireless power control problems: centralized and Nash equilibrium solutions , 2003, 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475).

[6]  P. Dutta A Folk Theorem for Stochastic Games , 1995 .

[7]  H. Teicher,et al.  Probability theory: Independence, interchangeability, martingales , 1978 .

[8]  Tao Li,et al.  Decentralized tracking-type games for multi-agent systems with coupled ARX models: Asymptotic Nash equilibria , 2008, Autom..

[9]  C. L. Benkard,et al.  Markov Perfect Industry Dynamics with Many Firms , 2005 .

[10]  Craig W. Reynolds Flocks, herds, and schools: a distributed behavioral model , 1998 .

[11]  Zahia Guessoum,et al.  Using adaptive multi-agent systems to simulate economic models , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[12]  Ron Sun,et al.  Simulating Organizational Decision-Making Using a Cognitively Realistic Agent Model , 2004, J. Artif. Soc. Soc. Simul..

[13]  Tao Li,et al.  Asymptotically Optimal Decentralized Control for Large Population Stochastic Multiagent Systems , 2008, IEEE Transactions on Automatic Control.

[14]  Benjamin Van Roy,et al.  Oblivious Equilibrium: A Mean Field Approximation for Large-Scale Dynamic Games , 2005, NIPS.

[15]  Bruno Gaujal,et al.  Mean Field for Markov Decision Processes: From Discrete to Continuous Optimization , 2010, IEEE Transactions on Automatic Control.

[16]  D. Fudenberg,et al.  When Are Nonanonymous Players Negligible , 1998 .

[17]  T. Kurtz Strong approximation theorems for density dependent Markov chains , 1978 .

[18]  Benjamin Van Roy,et al.  Markov Perfect Industry Dynamics with Many Firms , 2005 .

[19]  Kai Lai Chung,et al.  A Course in Probability Theory , 1949 .

[20]  Gabriel Y. Weintraub,et al.  Oblivious Equilibrium for Concentrated Industries , 2013 .

[21]  Minyi Huang,et al.  Large-Population Cost-Coupled LQG Problems With Nonuniform Agents: Individual-Mass Behavior and Decentralized $\varepsilon$-Nash Equilibria , 2007, IEEE Transactions on Automatic Control.

[22]  Val E. Lambson Self-enforcing collusion in large dynamic markets , 1984 .

[23]  Vicsek,et al.  Novel type of phase transition in a system of self-driven particles. , 1995, Physical review letters.

[24]  M. Mariton,et al.  Jump Linear Systems in Automatic Control , 1992 .

[25]  Bruno Gaujal,et al.  A mean field approach for optimization in discrete time , 2011, Discret. Event Dyn. Syst..

[26]  S. Meyn,et al.  Synchronization of coupled oscillators is a game , 2010, ACC 2010.

[27]  R. P. Marques,et al.  Discrete-Time Markov Jump Linear Systems , 2004, IEEE Transactions on Automatic Control.

[28]  M. Benaïm,et al.  Deterministic Approximation of Stochastic Evolution in Games , 2003 .

[29]  P. Lions,et al.  Jeux à champ moyen. I – Le cas stationnaire , 2006 .

[30]  H. Chizeck,et al.  Controllability, stabilizability, and continuous-time Markovian jump linear quadratic control , 1990 .

[31]  Finn E. Kydland Equilibrium solutions in dynamic dominant-player models , 1977 .

[32]  T. Başar,et al.  Dynamic Noncooperative Game Theory , 1982 .

[33]  D. Levine,et al.  When Are Agents Negligible , 1995 .

[34]  P. Lions,et al.  Mean field games , 2007 .

[35]  W. Wonham Random differential equations in control theory , 1970 .

[36]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[37]  Ji-Feng Zhang,et al.  Mean Field Games for Large-Population Multiagent Systems with Markov Jump Parameters , 2012, SIAM J. Control. Optim..

[38]  Minyi Huang,et al.  Large-Population LQG Games Involving a Major Player: The Nash Certainty Equivalence Principle , 2009, SIAM J. Control. Optim..

[39]  D. Sworder Feedback control of a class of linear systems with jump parameters , 1969 .

[40]  John P. Lewis,et al.  The Future of Disaster Response: Humans Working with Multiagent Teams using DEFACTO , 2005, AAAI Spring Symposium: AI Technologies for Homeland Security.