We study the process of multiagent learning in the context of the \Santa Fe Bar Problem" (Arthur 1994). We imagine a system of bounded rational agents taking decisions within a repeated game. Each agent has a nite number of decision procedures with which to reason, and access to the entire history of outcomes of the game. We rst allow the agents to use only deterministic decision procedures. We show that an eecient solution emerges, without central coordination or explicit coordination, through the dynamics and diversity of the agents. We then introduce randomized decision procedures and demonstrate that good coordination emerges more quickly, and that the long term equilibrium is more stable and less bursty.
[1]
W. Hamilton,et al.
The evolution of cooperation.
,
1984,
Science.
[2]
Hogg,et al.
Dynamics of computational ecosystems.
,
1989,
Physical review. A, General physics.
[3]
Moshe Tennenholtz,et al.
Adaptive Load Balancing: A Study in Multi-Agent Learning
,
1994,
J. Artif. Intell. Res..
[4]
W. Arthur.
Inductive Reasoning and Bounded Rationality
,
1994
.
[5]
Bernardo A. Huberman,et al.
Resource Contention in Multiagent Systems
,
1995,
ICMAS.
[6]
Tad Hogg,et al.
Controlling chaos in distributed systems
,
1991,
IEEE Trans. Syst. Man Cybern..