A hierarchy of prescriptive goals for multiagent learning
暂无分享,去创建一个
A great deal of theoretical effort in multiagent learning involves either embracing or avoiding the inherent symmetry between the problem and the solution. Regret minimization is an approach to the prescriptive, non-cooperative goal that explicitly breaks this symmetry, but, since it makes no assumptions about the adversary, it achieves only limited guarantees. In this paper, we consider a hierarchy of goals that begins with the basics of regret minimization and moves towards the utility guarantees achievable by agents that could also guarantee converging to a game-theoretic equilibrium.
[1] Dean P. Foster,et al. Regret in the On-Line Decision Problem , 1999 .
[2] Manuela Veloso,et al. Multiagent learning in the presence of agents with limitations , 2003 .
[3] R. Vohra,et al. Calibrated Learning and Correlated Equilibrium , 1996 .
[4] Amy Greenwald,et al. A General Class of No-Regret Learning Algorithms and Game-Theoretic Equilibria , 2003, COLT.