Using iterated reasoning to predict opponent strategies

The field of multiagent decision making is extending its tools from classical game theory by embracing reinforcement learning, statistical analysis, and opponent modeling. For example, behavioral economists conclude from experimental results that people act according to levels of reasoning that form a "cognitive hierarchy" of strategies, rather than merely following the hyper-rational Nash equilibrium solution concept. This paper expands this model of the iterative reasoning process by widening the notion of a level within the hierarchy from one single strategy to a distribution over strategies, leading to a more general framework of multiagent decision making. It provides a measure of sophistication for strategies and can serve as a guide for designing good strategies for multiagent games, drawing it's main strength from predicting opponent strategies. We apply these lessons to the recently introduced Lemonade-stand Game, a simple setting that includes both collaborative and competitive elements, where an agent's score is critically dependent on its responsiveness to opponent behavior. The opening moves are significant to the end result and simple heuristics have achieved faster cooperation than intricate learning schemes. Using results from the past two real-world tournaments, we show how the submitted entries fit naturally into our model and explain why the top agents were successful.

[1]  Edmund H. Durfee,et al.  A Rigorous, Operational Formalization of Recursive Modeling , 1995, ICMAS.

[2]  Miguel A. Costa-Gomes,et al.  Cognition and Behavior in Normal-Form Games: An Experimental Study , 1998 .

[3]  Dan Egnor,et al.  Iocaine powder , 2000, J. Int. Comput. Games Assoc..

[4]  Darse Billings,et al.  The First International RoShamBo Programming Competition , 2000, J. Int. Comput. Games Assoc..

[5]  Colin Camerer Behavioral Game Theory: Experiments in Strategic Interaction , 2003 .

[6]  Edmund H. Durfee,et al.  Rational Communication in Multi-Agent Environments , 2001, Autonomous Agents and Multi-Agent Systems.

[7]  Ya'akov Gal,et al.  Reasoning about rationality and beliefs , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[8]  P. J. Gmytrasiewicz,et al.  A Framework for Sequential Planning in Multi-Agent Settings , 2005, AI&M.

[9]  Colin Camerer,et al.  A Cognitive Hierarchy Model of Games , 2004 .

[10]  John Aycock,et al.  Black Market Botnets , 2007 .

[11]  Peter McBurney,et al.  An Analysis of Entries in the First TAC Market Design Competition , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[12]  Yoav Shoham,et al.  Essentials of Game Theory: A Concise Multidisciplinary Introduction , 2008, Essentials of Game Theory: A Concise Multidisciplinary Introduction.

[13]  Peter Dayan,et al.  Bayesian Model of Behaviour in Economic Games , 2008, NIPS.

[14]  Kevin Leyton-Brown,et al.  Beyond equilibrium: predicting human behaviour in normal form games , 2010, AAAI.

[15]  Archie C. Chapman,et al.  Automated Planning in Repeated Adversarial Games , 2010, UAI.

[16]  Michael P. Wellman,et al.  Strategy and mechanism lessons from the first ad auctions trading agent competition , 2010, EC '10.