Achieving Sustainable Cooperation in Generalized Prisoner's Dilemma with Observation Errors

A repeated game is a formal model for analyzing cooperation in long-term relationships, e.g., in the prisoner's dilemma. Although the case where each player observes her opponent's action with some observation errors (imperfect private monitoring) is difficult to analyze, a special type of an equilibrium called belief-free equilibrium is identified to make the analysis in private monitoring tractable. However, existing works using a belief-free equilibrium show that cooperative relations can be sustainable only in ideal situations. We deal with a generic problem that can model both the prisoner's dilemma and the team production problem. We examine a situation with an additional action that is dominated by another action. To our surprise, by adding this seemingly irrelevant action, players can achieve sustainable cooperative relations far beyond the ideal situations. More specifically, we identify a class of strategies called one-shot punishment strategy that can constitute a belief-free equilibrium in a wide range of parameters. Moreover, for a two-player case, the obtained welfare matches a theoretical upper bound.

[1]  Vincent Conitzer,et al.  Fast Equilibrium Computation for Infinitely Repeated Games , 2013, AAAI.

[2]  Brahim Chaib-draa,et al.  Repeated games for multiagent systems: a survey , 2013, The Knowledge Engineering Review.

[3]  Tadashi Sekiguchi Repeated Partnerships with Decreasing Returns , 2013 .

[4]  Takuo Sugaya Folk Theorem in Repeated Games with Private Monitoring , 2011 .

[5]  Moshe Tennenholtz,et al.  Learning equilibria in repeated congestion games , 2009, AAMAS.

[6]  Adam Tauman Kalai,et al.  The myth of the folk theorem , 2008, Games Econ. Behav..

[7]  Prashant Doshi,et al.  On the Difficulty of Achieving Equilibrium in Interactive POMDPs , 2006, AI&M.

[8]  Vincent Conitzer,et al.  AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents , 2003, Machine Learning.

[9]  Jeffrey C. Ely,et al.  Belief-free Equilibria in Repeated Games , 2005 .

[10]  Shlomo Zilberstein,et al.  Dynamic Programming for Partially Observable Stochastic Games , 2004, AAAI.

[11]  Peter Stone,et al.  A polynomial-time nash equilibrium algorithm for repeated games , 2003, EC '03.

[12]  N. Whitsed Learning and teaching. , 2003, Health information and libraries journal.

[13]  Michele Piccione,et al.  The Repeated Prisoner's Dilemma with Imperfect Private Monitoring , 2002, J. Econ. Theory.

[14]  Jeffrey C. Ely,et al.  A Robust Folk Theorem for the Prisoner's Dilemma , 2002, J. Econ. Theory.

[15]  Tadashi Sekiguchi Efficiency in Repeated Prisoner's Dilemma with Private Monitoring , 1997 .

[16]  S. Heap,et al.  Repeated Games and Reputations , 1995 .

[17]  D. Fudenberg,et al.  Digitized by the Internet Archive in 2011 with Funding from Working Paper Department of Economics the Folk Theorem with Imperfect Public Information , 2022 .

[18]  David Pearce,et al.  Information and timing in repeated partnerships , 1991 .

[19]  Drew Fudenberg,et al.  The Folk Theorem in Repeated Games with Discounting or with Incomplete Information , 1986 .

[20]  R. Radner,et al.  An Example of a Repeated Partnership Game with Discounting and with Uniformly Inefficient Equilibria , 1986 .