Nash Strategies and Adaptation for Decentralized Games Involving Weakly-coupled Agents

We consider dynamic games in large population conditions where the agents evolve according to non-uniform dynamics and are weakly coupled via their individual dynamics and costs. A state aggregation technique is employed to obtain a set of decentralized control laws for the individuals which ensures closed-loop stability and possesses an é-Nash equilibrium property. We then propose a scheme for Nash strategy adaptation when the agents have unknown parameters. The issue of transient performance improvement is addressed by introducing dither signals.

[1]  P. Caines,et al.  Large-population cost-coupled LQG problems: generalizations to non-uniform individuals , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[2]  Kevin M. Passino,et al.  Stable social foraging swarms in a noisy environment , 2004, IEEE Transactions on Automatic Control.

[3]  G. Papavassilopoulos On the linear-quadratic-Gaussian Nash game with one-step delay observation sharing pattern , 1982 .

[4]  George J. Pappas,et al.  Stable flocking of mobile agents, part I: fixed topology , 2003, 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475).

[5]  Dirk Helbing,et al.  Simulating dynamical features of escape panic , 2000, Nature.

[6]  G. Erickson Differential game models of advertising competition , 1995 .

[7]  Z. Gajic,et al.  Recursive solution of linear-quadratic Nash games for weakly interconnected systems , 1986, 1986 American Control Conference.

[8]  P. Caines,et al.  Nash Equilibria for Large-Population Linear Stochastic Systems of Weakly Coupled Agents , 2005 .

[9]  Michael L. Honig,et al.  Performance of coded DS-CDMA with pilot-assisted channel estimation and linear interference suppression , 2002, IEEE Trans. Commun..

[10]  P. Caines,et al.  Individual and mass behaviour in large population stochastic wireless power control problems: centralized and Nash equilibrium solutions , 2003, 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475).

[11]  P. Kokotovic,et al.  A singular perturbation approach to modeling and control of Markov chains , 1981 .

[12]  Peter E. Caines,et al.  Uplink power adjustment in wireless communication systems: a stochastic control analysis , 2004, IEEE Transactions on Automatic Control.

[13]  A. Mas-Colell,et al.  Microeconomic Theory , 1995 .

[14]  A. Bensoussan Perturbation Methods in Optimal Control , 1988 .

[15]  Yu-Chi Ho,et al.  Granular optimization: An approach to function optimization , 2000, Proceedings of the 39th IEEE Conference on Decision and Control (Cat. No.00CH37187).

[16]  T. Başar,et al.  Iterative computation of noncooperative equilibria in nonzero-sum differential games with weakly coupled players , 1990 .

[17]  Val E. Lambson Self-enforcing collusion in large dynamic markets , 1984 .

[18]  Han-Fu Chen,et al.  Identification and Stochastic Adaptive Control , 1991 .

[19]  Peter E. Caines,et al.  On Persistent Excitation for Linear Systems with Stochastic Coefficients , 2001, SIAM J. Control. Optim..