论文信息 - Level-0 meta-models for predicting human behavior in games - 字舞流文

Level-0 meta-models for predicting human behavior in games

Behavioral game theory seeks to describe the way actual people (as compared to idealized, ``rational'' agents) act in strategic situations. Our own recent work has identified iterative models (such as quantal cognitive hierarchy) as the state of the art for predicting human play in unrepeated, simultaneous-move games [Wright and Leyton-Brown 2012]. Iterative models predict that agents reason iteratively about their opponents, building up from a specification of nonstrategic behavior called level-0. The modeler is in principle free to choose any description of level-0 behavior that makes sense for the given setting; however, in practice almost all existing work specifies this behavior as a uniform distribution over actions. In most games it is not plausible that even nonstrategic agents would choose an action uniformly at random, nor that other agents would expect them to do so. A more accurate model for level-0 behavior has the potential to dramatically improve predictions of human behavior, since a substantial fraction of agents may play level-0 strategies directly, and furthermore since iterative models ground all higher-level strategies in responses to the level-0 strategy. Our work considers ``meta-models'' of level-0 behavior: models of the way in which level-0 agents construct a probability distribution over actions, given an arbitrary game. We evaluated many such meta-models, each of which makes its prediction based only on general features that can be computed from any normal form game. We evaluated the effects of combining each new level-0 meta-model with various iterative models, and in many cases observed large improvements in the models' predictive accuracies. In the end, we recommend a meta-model that achieved excellent performance across the board: a linear weighting of features that requires the estimation of five weights.

Kevin Leyton-Brown | James R. Wright | J. R. Wright | Kevin Leyton-Brown

[1] David J. Cooper,et al. Evidence on the equivalence of the strategic and extensive form representation of games , 2003, J. Econ. Theory.

[2] I. Simonson,et al. Choice Based on Reasons: The Case of Attraction and Compromise Effects , 1989 .

[3] Kevin Leyton-Brown,et al. Beyond equilibrium: predicting human behaviour in normal form games , 2010, AAAI.

[4] Charles A. Holt,et al. Ten Little Treasures of Game Theory and Ten Intuitive Contradictions , 2001 .

[5] Ian Witten,et al. Data Mining , 2000 .

[6] Kevin Leyton-Brown,et al. Behavioral game theoretic models: a Bayesian framework for parameter analysis , 2012, AAMAS.

[7] R. Aumann,et al. Unraveling in Guessing Games : An Experimental Study , 2007 .

[8] Miguel A. Costa-Gomes,et al. Cognition and Behavior in Normal-Form Games: An Experimental Study , 1998 .

[9] Arad Ayala,et al. The Tennis Coach Problem: A Game-Theoretic and Experimental Study , 2012 .

[10] Dan Ariely,et al. Seeking Subjective Dominance in Multidimensional Space: An Explanation of the Asymmetric Dominance Effect , 1995 .

[11] D. Stahl,et al. Experimental evidence on players' models of other players , 1994 .

[12] Kevin Leyton-Brown,et al. Mechanical TA: Partially Automated High-Stakes Peer Grading , 2015, SIGCSE.

[13] V. Crawford,et al. Level-k Auctions: Can a Non-Equilibrium Model of Strategic Thinking Explain the Winner's Curse and Overbidding in Private-Value Auctions? , 2007 .

[14] D. Stahl,et al. Equilibrium selection and bounded rationality in symmetric normal-form games , 2007 .

[15] 주철환. H.O.T , 1999 .

[16] Stefan P. Penczynski,et al. Out of your mind: Eliciting individual reasoning in one shot games , 2014, Games Econ. Behav..

[17] Kevin Leyton-Brown,et al. Linear solvers for nonlinear games: using pivoting algorithms to find Nash equilibria in n-player games , 2011, SECO.

[18] R. Thaler,et al. Anomalies: Ultimatums, Dictators and Manners , 1995 .

[19] D. Stahl,et al. On Players' Models of Other Players: Theory and Experimental Evidence , 1995 .

[20] Radford M. Neal. Annealed importance sampling , 1998, Stat. Comput..

[21] D. Stahl,et al. Modeling and Testing for Heterogeneity in Observed Strategic Behavior , 2001, Review of Economics and Statistics.

[22] Colin Camerer,et al. A Cognitive Hierarchy Model of Games , 2004 .

[23] Andrew Caplin,et al. The Process of Choice in Guessing Games , 2010 .

[24] Colin Camerer. Behavioral Game Theory: Experiments in Strategic Interaction , 2003 .

[25] Thomas R. Palfrey,et al. Heterogeneous quantal response equilibrium and cognitive hierarchies , 2006, J. Econ. Theory.

[26] Nikolaus Hansen,et al. Completely Derandomized Self-Adaptation in Evolution Strategies , 2001, Evolutionary Computation.

[27] Ariel Rubinstein,et al. Colonel Blotto’s Top Secret Files , 2010 .

[28] T. Scharping. Hide-and-seek: China's elusive population data , 2001 .

[29] A. Rubinstein,et al. The 11-20 Money Request Game: A Level-k Reasoning Study , 2012 .

[30] R. Thaler. The Ultimatum Game , 1988 .

[31] V. Crawford,et al. Fatal Attraction: Focality, Naivete, and Sophistication in Experimental "Hide-and-Seek" Games , 2007 .

[32] Ram Ramamoorthy,et al. Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2012) , 2012, AAMAS 2012.

[33] Kevin Leyton-Brown,et al. Incentivizing Evaluation via Limited Access to Ground Truth: Peer-Prediction Makes Things Worse , 2016, ArXiv.

[34] D. Stahl,et al. Level-n bounded rationality and dominated strategies in normal-form games , 2008 .

[35] Leonard J. Savage,et al. The Theory of Statistical Decision , 1951 .

[36] Ian H. Witten,et al. Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.