论文信息 - A Decision-Theoretic Approach to Evaluating Posterior Probabilities of Mental Models

A Decision-Theoretic Approach to Evaluating Posterior Probabilities of Mental Models

Agents face the problem of maintaining and updating their beliefs over the possible mental models (whether goals, plans, activities, intentions, etc.) of other agents in many multiagent domains. Decision-theoretic agents typically model their uncertainty in these beliefs as a probability distribution over their possible mental models of others. They then update their beliefs by computing a posterior probability over mental models conditioned on their observations. We present a novel algorithm for performing this belief update over mental models that are in the form of Partially Observable Markov Decision Problems (POMDPs). POMDPs form a common model for decision-theoretic agents, but there is no existing method for translating a POMDP, which generates deterministic behavior, into a probability distribution over actions that is appropriate for abductive reasoning. In this work, we explore alternate methods to generate a more suitable probability distribution. We use a sample multiagent scenario to demonstrate the different behaviors of the approaches and to draw some conclusions about the conditions under which each is successful.

Stacy Marsella | David V. Pynadath | Jonathan Y. Ito

[1] Michael P. Wellman,et al. Probabilistic State-Dependent Grammars for Plan Recognition , 2000, UAI.

[2] Robert P. Goldman,et al. A Bayesian Model of Plan Recognition , 1993, Artif. Intell..

[3] Milind Tambe,et al. Multiagent teamwork: analyzing the optimality and complexity of key theories and models , 2002, AAMAS '02.

[4] P. J. Gmytrasiewicz,et al. A Framework for Sequential Planning in Multi-Agent Settings , 2005, AI&M.

[5] Stacy Marsella,et al. PsychSim: Modeling Theory of Mind with Decision-Theoretic Agents , 2005, IJCAI.

[6] David V. Pynadath,et al. PsychSim: Agent-based Modeling of Social Interactions and Influence , 2004, ICCM.

[7] Milind Tambe,et al. Monitoring deployed agent teams , 2001, AGENTS '01.

[8] Robert P. Goldman,et al. A New Model of Plan Recognition , 1999, UAI.

[9] Edward J. Sondik,et al. The Optimal Control of Partially Observable Markov Processes over a Finite Horizon , 1973, Oper. Res..

[10] Svetha Venkatesh,et al. Policy Recognition in the Abstract Hidden Markov Model , 2002, J. Artif. Intell. Res..