论文信息 - Conjectural Equilibrium in Multiagent Learning

Conjectural Equilibrium in Multiagent Learning

Learning in a multiagent environment is complicated by the fact that as other agents learn, the environment effectively changes. Moreover, other agents' actions are often not directly observable, and the actions taken by the learning agent can strongly bias which range of behaviors are encountered. We define the concept of a conjectural equilibrium, where all agents' expectations are realized, and each agent responds optimally to its expectations. We present a generic multiagent exchange situation, in which competitive behavior constitutes a conjectural equilibrium. We then introduce an agent that executes a more sophisticated strategic learning strategy, building a model of the response of other agents. We find that the system reliably converges to a conjectural equilibrium, but that the final result achieved is highly sensitive to initial belief. In essence, the strategic learner's actions tend to fulfill its expectations. Depending on the starting point, the agent may be better or worse off than had it not attempted to learn a model of the other agents at all.

Michael P. Wellman | Junling Hu | Junling Hu

[1] Hogg,et al. Dynamics of computational ecosystems. , 1989, Physical review. A, General physics.

[2] D. Sattinger,et al. Calculus on Manifolds , 1986 .

[3] Edmund H. Durfee,et al. Learning nested agent models in an information economy , 1998, J. Exp. Theor. Artif. Intell..

[4] Junling Hu,et al. Self-fulfilling Bias in Multiagent Learning , 1996 .

[5] Karl C. Samples,et al. A Note On The Existence Of Starting Point Bias In Iterative Bidding Games , 1985 .

[6] Sandip Sen,et al. Evolution and learning in multiagent systems , 1998, Int. J. Hum. Comput. Stud..

[7] Michael P. Wellman,et al. A Simple Computational Market for Network Information Services , 1995, ICMAS.

[8] Craig Boutilier,et al. Learning Conventions in Multiagent Stochastic Domains using Likelihood Estimates , 1996, UAI.

[9] Michael P. Wellman,et al. Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm , 1998, ICML.

[10] J. Munkres,et al. Calculus on Manifolds , 1965 .

[11] Tuomas Sandholm,et al. On the Gains and Losses of Speculation in Equilibrium Markets , 1997, IJCAI.