Collective Stochastic Discrete Choice Problems: A Min-LQG Game Formulation

We consider a class of dynamic collective choice models with social interactions, whereby a large number of non-uniform agents have to individually settle on one of multiple discrete alternative choices, with the relevance of their would-be choices continuously impacted by noise and the unfolding group behavior. This class of problems is modeled here as a so-called Min-LQG game, i.e., a linear quadratic Gaussian dynamic and non-cooperative game, with an additional combinatorial aspect in that it includes a final choice-related minimization in its terminal cost. The presence of this minimization term is key to enforcing some specific discrete choice by each individual agent. The theory of mean field games is invoked to generate a class of decentralized agent feedback control strategies which are then shown to converge to an exact Nash equilibrium of the game as the number of players increases to infinity. A key building block in our approach is an explicit solution to the problem of computing the best response of a generic agent to some arbitrarily posited smooth mean field trajectory. Ultimately, an agent is shown to face a continuously revised discrete choice problem, where greedy choices dictated by current conditions must be constantly balanced against the risk of the future process noise upsetting the wisdom of such decisions.Even though an agent's ultimately chosen alternative is random and dictated by its entire noise history and initial state, the limiting infinite population macroscopic behavior can still be predicted. It is shown that any Nash equilibrium of the game is defined by an a priori computable probability matrix characterizing the manner in which the agent population ultimately splits among the available alternatives.

[1]  William A. Brock,et al.  Discrete Choice with Social Interactions , 2001 .

[2]  I. Couzin,et al.  Collective behavior in cancer cell populations , 2009, BioEssays : news and reviews in molecular, cellular and developmental biology.

[3]  Rainer Hegselmann,et al.  Opinion dynamics and bounded confidence: models, analysis and simulation , 2002, J. Artif. Soc. Soc. Simul..

[4]  D. McFadden Conditional logit analysis of qualitative choice behavior , 1972 .

[5]  Minyi Huang,et al.  Large-Population LQG Games Involving a Major Player: The Nash Certainty Equivalence Principle , 2009, SIAM J. Control. Optim..

[6]  Roland P. Malhamé,et al.  A Dynamic Game Model of Collective Choice: Stochastic Dynamics and Closed Loop Solutions , 2016, ArXiv.

[7]  Minyi Huang,et al.  Large-Population Cost-Coupled LQG Problems With Nonuniform Agents: Individual-Mass Behavior and Decentralized $\varepsilon$-Nash Equilibria , 2007, IEEE Transactions on Automatic Control.

[8]  W. Fleming,et al.  Controlled Markov processes and viscosity solutions , 1992 .

[9]  P. Lions,et al.  Jeux à champ moyen. I – Le cas stationnaire , 2006 .

[10]  T M Li Ge Te Interacting Particle Systems , 2013 .

[11]  P. Caines,et al.  Individual and mass behaviour in large population stochastic wireless power control problems: centralized and Nash equilibrium solutions , 2003, 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475).

[12]  P. Lions,et al.  Jeux à champ moyen. II – Horizon fini et contrôle optimal , 2006 .

[13]  T. Liggett Interacting Particle Systems , 1985 .

[14]  F. Koppelman,et al.  Incorporating Variance and Covariance Heterogeneity in the Generalized Nested Logit Model: An Application to Modeling Long Distance Travel Choice Behavior , 2005 .

[15]  Roland P. Malhamé,et al.  A Dynamic Collective Choice Model with an Advertiser , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[16]  Roland P. Malhamé,et al.  Consensus and disagreement in collective homing problems: A mean field games formulation , 2014, 53rd IEEE Conference on Decision and Control.

[17]  Ryo Nakajima,et al.  Measuring Peer Effects on Youth Smoking Behavior , 2004 .

[18]  P. Lions,et al.  Mean field games , 2007 .

[19]  R. Durrett Probability: Theory and Examples , 1993 .

[20]  Manuela Veloso,et al.  Multi-Robot Dynamic Role Assignment and Coordination Through Shared Potential Fields , 2002 .

[21]  C. Villani Optimal Transport: Old and New , 2008 .

[22]  Jie Lin,et al.  Coordination of groups of mobile autonomous agents using nearest neighbor rules , 2003, IEEE Trans. Autom. Control..

[23]  Asuman E. Ozdaglar,et al.  Opinion Fluctuations and Disagreement in Social Networks , 2010, Math. Oper. Res..

[24]  L. Perko Differential Equations and Dynamical Systems , 1991 .

[25]  Gerhard Freiling,et al.  A survey of nonsymmetric Riccati equations , 2002 .

[26]  John N. Tsitsiklis,et al.  Parallel and distributed computation , 1989 .

[27]  C. G. Broyden A Class of Methods for Solving Nonlinear Simultaneous Equations , 1965 .

[28]  Lawrence A. Bergman,et al.  Numerical Solution of the Fokker–Planck Equation by Finite Difference and Finite Element Methods—A Comparative Study , 2013 .

[29]  Eric van Damme,et al.  Non-Cooperative Games , 2000 .

[30]  K. Lehrer Social consensus and rational agnoiology , 1975, Synthese.

[31]  Spring Berman,et al.  Dynamic redistribution of a swarm of robots among multiple sites , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[32]  John Rust Structural estimation of markov decision processes , 1986 .

[33]  R. Cooke Real and Complex Analysis , 2011 .

[34]  G. Erickson Differential game models of advertising competition , 1995 .

[35]  X. Zhou,et al.  Stochastic Controls: Hamiltonian Systems and HJB Equations , 1999 .

[36]  Vaibhav Srivastava,et al.  Multiagent Decision-Making Dynamics Inspired by Honeybees , 2017, IEEE Transactions on Control of Network Systems.

[37]  Alain Bensoussan,et al.  Mean Field Stackelberg Games: Aggregation of Delayed Instructions , 2015, SIAM J. Control. Optim..

[38]  Chandra R. Bhat,et al.  A MIXED SPATIALLY CORRELATED LOGIT MODEL: FORMULATION AND APPLICATION TO RESIDENTIAL CHOICE MODELING , 2004 .

[39]  Ioannis Karatzas,et al.  Brownian Motion and Stochastic Calculus , 1987 .

[40]  P. Visscher,et al.  House-hunting by honey bee swarms: collective decisions and individual behaviors , 1999, Insectes Sociaux.

[41]  Noah E. Friedkin,et al.  Social influence and opinions , 1990 .

[42]  F. Clarke Functional Analysis, Calculus of Variations and Optimal Control , 2013 .

[43]  Spring Berman,et al.  Biologically inspired redistribution of a swarm of robots among multiple sites , 2008, Swarm Intelligence.

[44]  B. Grofman,et al.  A Unified Theory of Voting: Directional and Proximity Spatial Models , 1999 .

[45]  Peter E. Caines,et al.  Large population stochastic dynamic games: closed-loop McKean-Vlasov systems and the Nash certainty equivalence principle , 2006, Commun. Inf. Syst..

[46]  Philip E. Converse,et al.  A Dynamic Simultaneous Equation Model of Electoral Choice , 1979, American Political Science Review.

[47]  Roland P. Malhamé,et al.  A dynamic game model of collective choice in multi-agent systems , 2015, 2015 54th IEEE Conference on Decision and Control (CDC).

[48]  T. Seeley,et al.  Collective decision-making in honey bees: how colonies choose among nectar sources , 1991, Behavioral Ecology and Sociobiology.

[49]  P. Caines,et al.  Social optima in mean field LQG control: Centralized and decentralized strategies , 2009 .

[50]  B. Anderson,et al.  Optimal control: linear quadratic methods , 1990 .

[51]  Karin Rothschild,et al.  A Course In Functional Analysis , 2016 .

[52]  S. Peng A general stochastic maximum principle for optimal control problems , 1990 .

[53]  René Carmona,et al.  Probabilistic Analysis of Mean-field Games , 2013 .

[54]  Sandro Zampieri,et al.  Randomized consensus algorithms over large scale networks , 2007 .