The Imitation Game: Learned Reciprocity in Markov games
暂无分享,去创建一个
Tom Eccles | Joel Z. Leibo | János Kramár | Edward Hughes | Steven Wheelwright | Edward Hughes | János Kramár | S. Wheelwright | Tom Eccles
[1] A. Rapoport,et al. Prisoner's Dilemma: A Study in Conflict and Co-operation , 1970 .
[2] Peter Duersch,et al. When is tit-for-tat unbeatable? , 2013, Int. J. Game Theory.
[3] K. Laland,et al. Social intelligence, innovation, and enhanced brain size in primates , 2002, Proceedings of the National Academy of Sciences of the United States of America.
[4] Robin I. M. Dunbar. Coevolution of neocortical size, group size and language in humans , 1993, Behavioral and Brain Sciences.
[5] R. Boyd. Mistakes allow evolutionary stability in the repeated prisoner's dilemma game. , 1989, Journal of theoretical biology.
[6] E. Ostrom. A Behavioral Approach to the Rational Choice Theory of Collective Action: Presidential Address, American Political Science Association, 1997 , 1998, American Political Science Review.
[7] Alexander Peysakhovich,et al. Maintaining cooperation in complex social dilemmas using deep reinforcement learning , 2017, ArXiv.
[8] Enrique Fatás Juberías,et al. Reciprocity, matching and conditional Cooperation in two public goods games , 2005 .
[9] G. Brady. Governing the Commons: The Evolution of Institutions for Collective Action , 1993 .
[10] T. L. Schwartz. The Logic of Collective Action , 1986 .
[11] H. Francis Song,et al. Relational Forward Models for Multi-Agent Learning , 2018, ICLR.
[12] Alexander Peysakhovich,et al. Consequentialist conditional cooperation in social dilemmas with imperfect information , 2017, AAAI Workshops.
[13] Nuttapong Chentanez,et al. Intrinsically Motivated Reinforcement Learning , 2004, NIPS.
[14] B. Latané. The psychology of social impact. , 1981 .
[15] H. Kelley,et al. The social psychology of groups , 1960 .
[16] R. Axelrod,et al. Evolutionary Dynamics , 2004 .