Evolutionary behavior of generalized zero-determinant strategies in iterated prisoner’s dilemma

We study the competition and strategy selections between a class of generalized zero-determinant (ZD) strategies and the classic strategies of always cooperate (AllC), always defect (AllD), tit-for-tat (TFT), and win-stay-lose-shift (WSLS) strategies in an iterated prisoner’s dilemma comprehensively. Using the generalized ZD strategy, a player could get a payoff that is χ (χ>1) times that of his opponent’s, when the payoff is measured with respect to a referencing baseline parameterized by 0≤σ≤1. Varying σ gives ZD strategies of tunable generosity from the extortionate-like ZD strategy for σ≪1 to the compliance-like strategy at σ≈1. Expected payoffs when ZD strategy competes with each one of the classic strategies are presented. Strategy evolution based on adopting the strategy of a better performing neighbor is studied in a well-mixed population of finite size and a population on a square lattice. Depending on the parameters, extortion-like strategies may not be evolutionarily stable despite a positive surplus over cooperative strategies, while extortion-like strategies may dominate or coexist with other strategies that tend to defect despite a negative surplus. The dependence of the equilibrium fraction of ZD strategy players on the model parameters in a well-mixed population can be understood analytically by comparing the average payoffs to the competing strategies. On a square lattice, the success of the ZD strategy can be qualitatively understood by focusing on the relative alignments of the finite number of payoff values that the two competing strategies could attain when the spatial structure is imposed. ZD strategies with properly chosen generosity could be more successful in evolutionary competing systems.

[1]  M. Nowak,et al.  The evolution of stochastic strategies in the Prisoner's Dilemma , 1990 .

[2]  Arend Hintze,et al.  Critical Dynamics in the Evolution of Stochastic Strategies for the Iterated Prisoner's Dilemma , 2010, PLoS Comput. Biol..

[3]  M. Nowak,et al.  Evolutionary games and spatial chaos , 1992, Nature.

[4]  M. Milinski,et al.  Cooperation through image scoring in humans. , 2000, Science.

[5]  P. Grim,et al.  Spatialization and greater generosity in the stochastic Prisoner's Dilemma. , 1996, Bio Systems.

[6]  Michael Doebeli,et al.  Spatial structure often inhibits the evolution of cooperation in the snowdrift game , 2004, Nature.

[7]  Chen Xu,et al.  Networking effects on cooperation in evolutionary snowdrift game , 2006 .

[8]  Zhen Wang,et al.  If players are sparse social dilemmas are too: Importance of percolation for evolution of cooperation , 2012, Scientific Reports.

[9]  M. Nowak,et al.  A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner's Dilemma game , 1993, Nature.

[10]  R. Boyd,et al.  No pure strategy is evolutionarily stable in the repeated Prisoner's Dilemma game , 1987, Nature.

[11]  P. Hui,et al.  Analyzing phase diagrams and phase transitions in networked competing populations , 2011 .

[12]  M. Nowak,et al.  Stochastic dynamics of invasion and fixation. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  Dong Hao,et al.  Zero-determinant strategy: An underway revolution in game theory , 2014 .

[14]  M. Nowak Stochastic strategies in the Prisoner's Dilemma , 1990 .

[15]  Martin A Nowak,et al.  Evolution of extortion in Iterated Prisoner’s Dilemma games , 2012, Proceedings of the National Academy of Sciences.

[16]  Y. Iwasa,et al.  The evolution of cooperation in a lattice-structured population. , 1997, Journal of theoretical biology.

[17]  Zhihai Rong,et al.  Boosting cooperation by involving extortion in spatial prisoner's dilemma games. , 2014, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  M. Doebeli,et al.  Spatial evolutionary game theory: Hawks and Doves revisited , 1996, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[19]  R. Axelrod The Emergence of Cooperation among Egoists , 1981, American Political Science Review.

[20]  M. Nowak,et al.  Tit for tat in heterogeneous populations , 1992, Nature.

[21]  Siew Kian Tey,et al.  Evolutionary snowdrift game incorporating costly punishment in structured populations , 2013 .

[22]  M. Nowak,et al.  Evolution of indirect reciprocity by image scoring , 1998, Nature.

[23]  M. Nowak,et al.  Adaptive Dynamics of Extortion and Compliance , 2013, PloS one.

[24]  F. C. Santos,et al.  Scale-free networks provide a unifying framework for the emergence of cooperation. , 2005, Physical review letters.

[25]  Guanrong Chen,et al.  Coevolution of strategy-selection time scale and cooperation in spatial prisoner's dilemma game , 2013 .

[26]  W. Hamilton,et al.  The Evolution of Cooperation , 1984 .

[27]  W. Press,et al.  Iterated Prisoner’s Dilemma contains strategies that dominate any evolutionary opponent , 2012, Proceedings of the National Academy of Sciences.

[28]  Alexander J. Stewart,et al.  From extortion to generosity, evolution in the Iterated Prisoner’s Dilemma , 2013, Proceedings of the National Academy of Sciences.

[29]  J. Hirshleifer,et al.  What Strategies Can Support the Evolutionary Emergence of Cooperation? , 1988 .

[30]  Alexander J. Stewart,et al.  Extortion and cooperation in the Prisoner’s Dilemma , 2012, Proceedings of the National Academy of Sciences.

[31]  H. Ohtsuki,et al.  A simple rule for the evolution of cooperation on graphs and social networks , 2006, Nature.

[32]  M. Perc Transition from Gaussian to Levy distributions of stochastic payoff variations in the spatial prisoner's dilemma game. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[33]  G T Vickers,et al.  The spatial struggle of tit-for-tat and defect. , 1995, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.