The goal of a self-interested agent within a multiagent system is to maximize its utility over time. In a situation of strategic interdependence, where the actions of one agent may affect the utilities of other agents, the optimal behavior of an agent must be conditioned on the expected behaviors of the other agents in the system. Standard game theory assumes that the rationality and preferences of all the agents is common knowledge: each agent is then able to compute the set of possible equilibria, and if there is a unique equilibrium, choose a best-response to the actions that the other agents will all play. Real agents acting within a multiagent system face multiple problems: the agents may have incomplete information about the preferences and rationality of the other agents in the game, computing the equilibria can be computationally complex, and there might be many equilibria from which to choose. An alternative explanation of the emergence of a stable equilibrium is that it arises as the long-run outcome of a repeated game, in which bounded-rational agents adapt their strategies as they learn about the other agents in the system. We review some possible models of learning for games, and then show the pros and cons of using learning in a particular game, the Compensation Mechanism, a mechanism for the efficient coordination of actions within a multiagent system.
[1]
David M. Kreps,et al.
Learning Mixed Equilibria
,
1993
.
[2]
Michael P. Wellman.
A Market-Oriented Programming Environment and its Application to Distributed Multicommodity Flow Problems
,
1993,
J. Artif. Intell. Res..
[3]
R. McAfee,et al.
Auctions and Bidding
,
1986
.
[4]
Edmund H. Durfee,et al.
A Rigorous, Operational Formalization of Recursive Modeling
,
1995,
ICMAS.
[5]
G. Hardin,et al.
The Tragedy of the Commons
,
1968,
Green Planet Blues.
[6]
D. Fudenberg,et al.
The Theory of Learning in Games
,
1998
.
[7]
Sarit Kraus,et al.
An Overview of Incentive Contracting
,
1996,
Artif. Intell..
[8]
H. Varian.
A Solution to the Problem of Externalities When Agents Are Well-Informed
,
1994
.
[9]
Robert H. Crites,et al.
Multiagent reinforcement learning in the Iterated Prisoner's Dilemma.
,
1996,
Bio Systems.
[10]
Junling Hu,et al.
Self-fulfilling Bias in Multiagent Learning
,
1996
.