Passive dynamics in mean field control

Mean-field models are a popular tool in a variety of fields. They provide an understanding of the impact of interactions among a large number of particles or people or other “self-interested agents”, and are an increasingly popular tool in distributed control. This paper considers a particular randomized distributed control architecture introduced in our own recent work. In numerical results it was found that the associated meanfield model had attractive properties for purposes of control. In particular, when viewed as an input-output system, its linearization was found to be minimum phase. In this paper we take a closer look at the control model. The results are summarized as follows: (i) The Markov Decision Process framework of Todorov is extended to continuous time models, in which the “control cost” is based on relative entropy. This is the basis of the construction of a family of Markovian generators, parameterized by a scalar ζ E R. (ii) A decentralized control architecture is proposed in which each agent evolves as a controlled Markov process. A central authority broadcasts a common control signal {ζt} to each agent. The central authority chooses {ζt} based on an aggregate scalar output of the Markovian agents. This is the basis of the mean field model. (iii) Provided the control-free system (with ζ 0) is a reversible Markov process, the following identity holds for the transfer function G obtained from the linearization, Re (G(jω)) = PSDY(ω) ≥ 0 ω E R , where the right hand side denotes the power spectral density for the output of any one of the individual Markov processes (with ζ 0).

[1]  V. Borkar,et al.  Asymptotics of the invariant measure in mean field models with jumps , 2012 .

[2]  Ana Busic,et al.  Ancillary service to the grid from deferrable loads: The case for intelligent pool pumps in Florida , 2013, 52nd IEEE Conference on Decision and Control.

[3]  Amir Dembo,et al.  Large Deviations Techniques and Applications , 1998 .

[4]  Sean P. Meyn,et al.  Universal and Composite Hypothesis Testing via Mismatched Divergence , 2009, IEEE Transactions on Information Theory.

[5]  Bruno Gaujal,et al.  Mean Field for Markov Decision Processes: From Discrete to Continuous Optimization , 2010, IEEE Transactions on Automatic Control.

[6]  Vivek S. Borkar,et al.  Asymptotics of the invariant measure in mean field models with jumps , 2011, 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[7]  R. E. Kalman,et al.  When Is a Linear Control System Optimal , 1964 .

[8]  Michael Athans,et al.  Gain and phase margin for multiloop LQG regulators , 1976, 1976 IEEE Conference on Decision and Control including the 15th Symposium on Adaptive Processes.

[9]  Minyi Huang,et al.  Large-Population Cost-Coupled LQG Problems With Nonuniform Agents: Individual-Mass Behavior and Decentralized $\varepsilon$-Nash Equilibria , 2007, IEEE Transactions on Automatic Control.

[10]  S. Meyn,et al.  Large Deviations Asymptotics and the Spectral Theory of Multiplicatively Regular Markov Processes , 2005, math/0509310.

[11]  S. Meyn,et al.  Spectral theory and limit theorems for geometrically ergodic Markov processes , 2002, math/0209200.

[12]  Ana Busic,et al.  Ancillary Service to the Grid Using Intelligent Deferrable Loads , 2014, IEEE Transactions on Automatic Control.

[13]  S. Kullback,et al.  Certain Inequalities in Information Theory and the Cramer-Rao Inequality , 1954 .

[14]  Wolfgang J. Runggaldier,et al.  Connections between stochastic control and dynamic games , 1996, Math. Control. Signals Syst..

[15]  Rebecca Willett,et al.  Online Markov Decision Processes With Kullback–Leibler Control Cost , 2014, IEEE Transactions on Automatic Control.

[16]  Hamidou Tembine,et al.  Electrical Vehicles in the Smart Grid: A Mean Field Game Analysis , 2011, IEEE Journal on Selected Areas in Communications.

[17]  A. Rantzer On the Kalman-Yakubovich-Popov lemma , 1996 .