Distribution-Valued Solution Concepts

Under its conventional positive interpretation, game theory makes predictions about the mixed strategy profile of the players in a noncooperative game by specifying a "solution set" of such profiles, e.g., the set of Nash equilibria of that game. Profiles outside of that set are implicitly assigned probability zero, and relative probabilities of profiles in that set are not provided. In contrast, Bayesian analysis does not make predictions about the state of a system by specifying a set of possible states of that system. Rather it provides a probability density over all those states, conditioned on all relevant information we have concerning the system. So when the "state of a system" is the strategy profile of the players of a game, and our information comprises the game specification, a Bayesian analysis would result in a posterior density over the set of all profiles, conditioned on the game specification. Evidently the very form of a prediction provided by a Bayesian analysis is different from the form provided by solution sets. In this paper, we show how to construct a Bayesian posterior density over profiles and discuss its practical advantages over solution sets. As an example, by combining this posterior density with a loss function of the scientist making the prediction, standard decision theory fixes the unique Bayes-optimal prediction of the profile conditioned on the game specification. So it provides a universal refinement. As another example, Bayesian regulators of the players involved in a game would use such a posterior density to make Bayes-optimal choices of a mechanism to control player behavior (and thereby fully adhere to Savage's axioms). In particular, a regulator can do this in situations where conventional mechanism design cannot provide advice. We illustrate all of this numerically with Cournot duopoly games.

[1]  E. Jaynes Information Theory and Statistical Mechanics , 1957 .

[2]  Howard Raiffa,et al.  Games And Decisions , 1958 .

[3]  R. Aumann Subjectivity and Correlation in Randomized Strategies , 1974 .

[4]  Flemming Topsøe,et al.  Information-theoretical optimization techniques , 1979, Kybernetika.

[5]  R. Radner Collusive behavior in noncooperative epsilon-equilibria of oligopolies with long but finite lives , 1980 .

[6]  Timothy R. C. Read,et al.  Multinomial goodness-of-fit tests , 1984 .

[7]  Jesús Seade,et al.  Profitable cost increases and the shifting of taxation : equilibrium response of markets in oligopoly , 1985 .

[8]  S. Gull Bayesian Inductive Inference and Maximum Entropy , 1988 .

[9]  J. Berger Statistical Decision Theory and Bayesian Analysis , 1988 .

[10]  Philip A. Haile,et al.  On the Empirical Content of Quantal Response Equilibrium , 2003 .

[11]  J. Geweke,et al.  Bayesian Inference in Econometric Models Using Monte Carlo Integration , 1989 .

[12]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[13]  D. R. Wolf,et al.  Alpha, Evidence, and the Entropic Prior , 1993 .

[14]  J. Paris The Uncertain Reasoner's Companion: A Mathematical Perspective , 1994 .

[15]  D. Stahl,et al.  Experimental evidence on players' models of other players , 1994 .

[16]  Kim C. Border,et al.  Infinite dimensional analysis , 1994 .

[17]  R. Aumann,et al.  Epistemic Conditions for Nash Equilibrium , 1995 .

[18]  R. McKelvey,et al.  Quantal Response Equilibria for Normal Form Games , 1995 .

[19]  Arnold Zellner,et al.  Models, prior information, and Bayesian analysis☆ , 1996 .

[20]  Douglas J. Miller,et al.  Maximum entropy econometrics: robust estimation with limited data , 1996 .

[21]  Colin Camerer Prospect Theory In The Wild: Evidence From The Field , 1998 .

[22]  R. McKelvey,et al.  Quantal Response Equilibria for Extensive Form Games , 1998 .

[23]  D. Fudenberg,et al.  The Theory of Learning in Games , 1998 .

[24]  Robert J. Aumann,et al.  Interactive epistemology II: Probability , 1999, Int. J. Game Theory.

[25]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[26]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[27]  C. Starmer Developments in Non-expected Utility Theory: The Hunt for a Descriptive Theory of Choice under Risk , 2000 .

[28]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[29]  Noam Nisan,et al.  Algorithmic Mechanism Design , 2001, Games Econ. Behav..

[30]  Jacob K. Goeree,et al.  Quantal Response Equilibrium and Overbidding in Private-Value Auctions , 2002, J. Econ. Theory.

[31]  Colin Camerer Behavioral Game Theory: Experiments in Strategic Interaction , 2003 .

[32]  Kevin S. Van Horn,et al.  Constructing a logic of plausible inference: a guide to Cox's theorem , 2003, Int. J. Approx. Reason..

[33]  Miguel A. Costa-Gomes,et al.  Cognition and Behavior in Two-Person Guessing Games: An Experimental Study , 2003 .

[34]  Teck-Hua Ho,et al.  Models of Thinking, Learning, and Teaching in Games , 2003 .

[35]  E. Jaynes Probability theory : the logic of science , 2003 .

[36]  Michael P. Wellman,et al.  Nash Q-Learning for General-Sum Stochastic Games , 2003, J. Mach. Learn. Res..

[37]  David H. Wolpert,et al.  Information Theory - The Bridge Connecting Bounded Rational Game Theory and Statistical Physics , 2004, ArXiv.

[38]  Christian P. Robert,et al.  Monte Carlo Statistical Methods , 2005, Springer Texts in Statistics.

[39]  Colin Camerer,et al.  A Cognitive Hierarchy Model of Games , 2004 .

[40]  A. Dawid,et al.  Game theory, maximum entropy, minimum discrepancy and robust Bayesian decision theory , 2004, math/0410076.

[41]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[42]  Eric Horvitz From the Guest Editor...: Special Issue on Graph-Based Representations , 2005, Decis. Anal..

[43]  Robert Kurzban,et al.  Experiments investigating cooperative types in humans: a complement to evolutionary theory and simulations. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[44]  John A List,et al.  A simple test of expected utility theory using professional traders. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[45]  David H. Wolpert,et al.  A Predictive Theory of Games , 2005, ArXiv.

[46]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[47]  Victor Aguirregabiria,et al.  Dynamic Discrete Choice Structural Models: A Survey , 2010, SSRN Electronic Journal.

[48]  Yoav Shoham,et al.  If multi-agent learning is the answer, what is the question? , 2007, Artif. Intell..

[49]  Alvin E. Roth,et al.  Multi-agent learning and the descriptive value of simple models , 2007, Artif. Intell..

[50]  Arnold Zellner,et al.  Some aspects of the history of Bayesian information processing , 2007 .

[51]  V. Crawford,et al.  Level-k Auctions: Can a Non-Equilibrium Model of Strategic Thinking Explain the Winner's Curse and Overbidding in Private-Value Auctions? , 2007 .

[52]  Peter Stone,et al.  Multiagent learning is not the answer. It is the question , 2007, Artif. Intell..

[53]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1999, Innovations in Bayesian Networks.

[54]  D. Wolpert,et al.  Statistical Prediction of the Outcome of a Noncooperative Game , 2008 .

[55]  D. Wolpert,et al.  A theory of unstructured bargaining using distribution-valued solution concepts , 2010 .

[56]  David H. Wolpert,et al.  PGT: A Statistical Approach to Prediction and Mechanism Design , 2010, SBP.

[57]  Han Hong,et al.  Identification and Estimation of a Discrete Game of Complete Information , 2010 .

[58]  Han Hong,et al.  A Note on Semiparametric Estimation of Finite Mixtures of Discrete Choice Models with Application to Game Theoretic Models , 2011 .

[59]  Evgueni A. Haroutunian,et al.  Information Theory and Statistics , 2011, International Encyclopedia of Statistical Science.

[60]  D. Wolpert,et al.  Theory of Collective Intelligence , 2013 .