Experimentation and Learning in Repeated Cooperation

We study an agency model, in which the principal has only incomplete information about the agent's preferences, in a dynamic setting. Through repeated interaction with the agent, the principal learns about the agent's preferences and can thus adjust the inventive system. In a dynamic computational model, we compare different learning strategies of the principal when facing different types of agents. The results indicate that better learning of preferences can improve the situation of both parties, but the learning process is rather sensitive to random disturbances.

[1]  M. Harris,et al.  Optimal incentive contracts with imperfect information , 1979 .

[2]  Tomas Klos,et al.  Decentralized Interaction and Co-Adaptation in the Repeated Prisoner&2018;s Dilemma , 1999, Comput. Math. Organ. Theory.

[3]  David M. Messick,et al.  Frontiers in Social Dilemmas Research , 1996 .

[4]  Lawrence Davis,et al.  Genetic Algorithms and Simulated Annealing , 1987 .

[5]  K. Spremann Agent and Principal , 1987 .

[6]  V. Grossmann Is it rational to internalize the personal norm that one should reciprocate , 2002 .

[7]  Erhard Bruderer,et al.  Organizational Evolution, Learning, and Selection: A Genetic-Algorithm-Based Model , 1996 .

[8]  Joel Watson,et al.  Learning about a population of agents and the evolution of trust and cooperation , 1997 .

[9]  J. Mirrlees The Optimal Structure of Incentives and Authority Within an Organization , 1976 .

[10]  Bart Nooteboom,et al.  Agent-based computational transaction cost economics ☆ , 2001 .

[11]  Wim B. G. Liebrand,et al.  Computer Simulation of Cooperative Decision Making , 1996 .

[12]  Ramakrishnan Pakath,et al.  The Iterated Prisoner's Dilemma: early experiences with Learning Classifier System-based simple agents , 2001, Decis. Support Syst..

[13]  John H. Miller,et al.  Communication and cooperation , 1998 .

[14]  Nahoko Hayashi,et al.  Selective Play: Social Embeddedness of Social Dilemmas , 1996 .

[15]  DAVID ROSE,et al.  The principal-agent problem with adaptive players , 1996, Comput. Math. Organ. Theory.

[16]  K. Eisenhardt Agency Theory: An Assessment and Review , 1989 .

[17]  K. Spremann,et al.  Agency Theory, Information and Incentives , 1987 .

[18]  Behavioral uncertainty and investments in cooperative relationships , 2004 .

[19]  Joshua M. Epstein,et al.  Learning to Be Thoughtless: Social Norms and Individual Computation , 2001 .

[20]  Robert Axelrod,et al.  The Evolution of Strategies in the Iterated Prisoner's Dilemma , 2001 .

[21]  Kathleen M. Carley,et al.  Modeling Organizational Adaptation as a Simulated Annealing Process , 1996 .

[22]  A. Rubinstein Finite automata play the repeated prisoner's dilemma , 1986 .

[23]  Robert Hoffmann,et al.  The Ecology of Cooperation , 2001 .

[24]  Michael W. Macy,et al.  Natural Selection and Social Learning in Prisoner's Dilemma: Coadaptation with Genetic Algorithms and Artificial Neural Networks , 1996 .

[25]  Robert Hoffmann,et al.  The Independent Localisations of Interaction and Learning in the Repeated Prisoner's Dilemma , 1999 .

[26]  Christoph A. Schneeweiss,et al.  Hierarchies in Distributed Decision Making , 1999 .

[27]  Patrick D. Larkey,et al.  Subjective Probability and the Theory of Games , 1982 .

[28]  Joseph B. Kadane,et al.  The Confusion of Is and Ought in Game Theoretic Contexts , 1983 .

[29]  David Rose,et al.  The principal-agent problem with evolutionary learning , 1996, Comput. Math. Organ. Theory.

[30]  M. Macy,et al.  Stochastic Collusion and the Power Law of Learning , 2002 .

[31]  Esther Hauk,et al.  Leaving the Prison: Permitting Partner Choice and Refusal in Prisoner's Dilemma Games , 2001 .

[32]  W. Hamilton,et al.  The Evolution of Cooperation , 1984 .

[33]  M EpsteinJoshua Learning to Be Thoughtless , 2001 .

[34]  Oliver Kirchkamp,et al.  Simultaneous Evolution of Learning Rules and Strategies , 1996 .

[35]  Bruce Edmonds,et al.  Towards a Descriptive Model of Agent Strategy Search , 1999 .

[36]  L. Tesfatsion,et al.  Preferential partner selection in an evolutionary study of Prisoner's Dilemma. , 1994, Bio Systems.

[37]  John H. Miller,et al.  The coevolution of automata in the repeated Prisoner's Dilemma , 1996 .

[38]  Jean-François Laslier,et al.  A Behavioral Learning Process in Games , 2001, Games Econ. Behav..