Multiagent Online Planning with Nested Beliefs and Dialogue

The problem of planning with partial observability in the presence of a single agent has been addressed as a contingent or POMDP problem. Since the task is computationally hard, on-line approaches have also been developed that just compute the action to do next rather than full policies. In this work, we address a similar problem but in a multiagent setting where agents share a common goal and plan with beliefs which are about the world and the possibly nested beliefs of other agents. For this, we extend the belief tracking formulation of Kominis and Geffner to the on-line setting where plans are supposed to work for the true hidden state as revealed by the observations, and develop an alternative translation into classical planning that is used within a plan-execute-observeand-replan cycle. Planning is done from the perspective of the agents, and there is a single planning agent in each replanning episode that can change across episodes. We present empirical results and show that interesting agent dialogues arise in this setting where agents collaborate by requesting or volunteering information in a goal-directed manner.

[1]  van der Hoek,et al.  Semantic Results for Ontic and Epistemic Change , 2008 .

[2]  Andreas Witzel,et al.  DEL Planning and Some Tractable Cases , 2011, LORI.

[3]  Bernhard Nebel,et al.  Cooperative Epistemic Multi-Agent Planning for Implicit Coordination , 2017, M4M@ICLA.

[4]  Hector Geffner,et al.  Beliefs In Multiagent Planning: From One Agent to Many , 2015, ICAPS.

[5]  Malte Helmert,et al.  The Fast Downward Planning System , 2006, J. Artif. Intell. Res..

[6]  Blai Bonet,et al.  Flexible and Scalable Partially Observable Planning with Linear Translations , 2014, AAAI.

[7]  Hector Geffner,et al.  A Translation-Based Approach to Contingent Planning , 2009, IJCAI.

[8]  Christian J. Muise,et al.  Planning Over Multi-Agent Epistemic States: A Classical Planning Approach , 2015, AAAI.

[9]  François Schwarzentruber,et al.  Complexity Results in Epistemic Planning , 2015, IJCAI.

[10]  Guy Shani,et al.  A Multi-Path Compilation Approach to Contingent Planning , 2012, AAAI.

[11]  Johan van Benthem Logical Dynamics of Information and Interaction: Preface , 2011 .

[12]  Tran Cao Son,et al.  An Action Language for Reasoning about Beliefs in Multi-Agent Domains , 2012 .

[13]  Blai Bonet,et al.  Planning with Incomplete Information as Heuristic Search in Belief Space , 2000, AIPS.

[14]  Thomas Bolander,et al.  Undecidability in Epistemic Planning , 2013, IJCAI.

[15]  Michael Brenner Creating Dynamic Story Plots with Continual Multiagent Planning , 2010, AAAI.

[16]  Guy Shani,et al.  Replanning in Domains with Partial Information and Sensing Actions , 2011, IJCAI.

[17]  Guy Shani,et al.  Qualitative Planning under Partial Observability in Multi-Agent Domains , 2013, AAAI.

[18]  François Schwarzentruber,et al.  On the Impact of Modal Depth in Epistemic Planning , 2016, IJCAI.

[19]  Ronald Fagin,et al.  Reasoning about knowledge , 1995 .

[20]  Jussi Rintanen,et al.  Complexity of Planning with Partial Observability , 2004, ICAPS.

[21]  Martin C. Cooper,et al.  A simple account of multiagent epistemic planning , 2015 .