论文信息 - Optimised Reputation-Based Adaptive Punishment for Limited Observability

Optimised Reputation-Based Adaptive Punishment for Limited Observability

The use of social norms has proven to be effective in the self-governance of decentralised systems in which there is no central authority. Axelrod's seminal model of norm establishment in populations of self-interested individuals provides some insight into the mechanisms needed to support this through the use of metanorms, but is not directly applicable to real world scenarios such as online peer-to-peer communities, for example. In particular, it does not reflect different topological arrangements of interactions. While some recent efforts have sought to address these limitations, they are also limited in not considering the point-to-point interactions between agents that arise in real systems, but only interactions that are visible to an entire neighbourhood. The objective of this paper is twofold: firstly to incorporate these realistic adaptations to the original model, and secondly, to provide agents with reputation based mechanisms that allow them to dynamically optimise the intensity of punishment ensuring norm establishment in exactly these limited observation conditions.

Michael Luck | Jeroen Keppens | Daniel Villatoro | Samhar Mahmoud

[1] Nikos Nikiforakis,et al. A Comparative Statics Analysis of Punishment in Public-Good Experiments , 2005 .

[2] José Manuel Galán,et al. Appearances Can Be Deceiving: Lessons Learned Re-Implementing Axelrod's 'Evolutionary Approach to Norms' , 2005, J. Artif. Soc. Soc. Simul..

[3] Koichi Kurumatani,et al. An Investigation into the Use of Group Dynamics for Solving Social Dilemmas , 2004, MABS.

[4] Jordi Sabater-Mir,et al. Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence Dynamic Sanctioning for Robust and Cost-Efficient Norm Compliance , 2022 .

[5] Joshua M. Epstein,et al. Learning to Be Thoughtless: Social Norms and Individual Computation , 2001 .

[6] Sandip Sen,et al. Topology and Memory Effect on Convention Emergence , 2009, 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology.

[7] R. Boyd,et al. The evolution of altruistic punishment , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[8] R. Axelrod. An Evolutionary Approach to Norms , 1986, American Political Science Review.

[9] Katia P. Sycara,et al. Middle-Agents for the Internet , 1997, IJCAI.

[10] Manuela M. Veloso,et al. Rational and Convergent Learning in Stochastic Games , 2001, IJCAI.

[11] Rosaria Conte,et al. A cognitive model of punishment , 2010 .

[12] Michael Luck,et al. Efficient Norm Emergence through Experiential Dynamic Punishment , 2012, ECAI.

[13] Bastin Tony Roy Savarimuthu,et al. Social Norm Emergence in Virtual Agent Societies , 2008, DALT.

[14] Michael Luck,et al. Norm Establishment via Metanorms in Network Topologies , 2011, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[15] Attila Szolnoki,et al. Punish, but not too hard: how costly punishment spreads in the spatial public goods game , 2010, 1007.0431.

[16] Sanjiva Weerawarana,et al. Unraveling the Web services web: an introduction to SOAP, WSDL, and UDDI , 2002, IEEE Internet Computing.

[17] Michael Luck,et al. Norm emergence : overcoming hub effects in scale free networks , 2012 .

[18] Albert,et al. Emergence of scaling in random networks , 1999, Science.

[19] E. Fehr,et al. Altruistic punishment in humans , 2002, Nature.

[20] Michael Luck,et al. Overcoming Omniscience in Axelrod's Model , 2011, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[21] Peter Dayan,et al. Q-learning , 1992, Machine Learning.

[22] Albert-László Barabási,et al. Statistical mechanics of complex networks , 2001, ArXiv.

[23] Frank Dignum,et al. Ubi Lex, Ibi Poena : Designing Norm Enforcement in E-Institutions , 2006, COIN@AAMAS/ECAI.

[24] Daniel Polani,et al. Modelling the Emergence of Possession Norms using Memes , 2001, J. Artif. Soc. Soc. Simul..

[25] Michael Luck,et al. An analysis of norm emergence in Axelrod's model , 2010 .

[26] Ernst Fehr,et al. Detrimental effects of sanctions on human altruism , 2003, Nature.

[27] Mark E. J. Newman,et al. The Structure and Function of Complex Networks , 2003, SIAM Rev..

[28] Carles Sierra,et al. Distributed Norm Enforcement: Ostracism in Open Multi-Agent Systems , 2008, Computable Models of the Law, Languages, Dialogues, Games, Ontologies.

[29] Nikos Nikiforakis,et al. Punishment and Counter-punishment in Public Good Games: Can We Still Govern Ourselves? , 2005 .

[30] Carles Sierra,et al. Friends no more: norm enforcement in multiagent systems , 2007, AAMAS '07.