Continual processing of situated dialogue in human-robot collaborative activities

This paper presents an implemented approach of processing situated dialogue between a human and a robot. The focus is on task-oriented dialogue, set in the larger context of human-robot collaborative activity. The approach models understanding and production of dialogue to include intension (what is being talked about), intention (the goal of why something is being said), and attention (what is being focused on). These dimensions are directly construed in terms of assumptions and assertions on situated multi-agent belief models. The approach is continual in that it allows for interpretations to be dynamically retracted, revised, or deferred. This makes it possible to deal with the inherent asymmetry in how robots and humans tend to understand dialogue, and the world in which it is set. The approach has been fully implemented, and integrated into a cognitive robot. The paper discusses the implementation, and illustrates it in a collaborative learning setting.

[1]  Laura Giordano,et al.  A Modal Extension of Logic Programming: Modularity, Beliefs and Hypothetical Reasoning , 1998, J. Log. Comput..

[2]  David DeVault,et al.  Enlightened Update: A Computational Architecture for Presupposition and Other Pragmatic Phenomena , 2006 .

[3]  Danijel Skocaj,et al.  A computer vision integration model for a multi-modal cognitive system , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[4]  Ronald Fagin,et al.  Reasoning about knowledge , 1995 .

[5]  Mark E. Stickel,et al.  A prolog-like inference system for computing minimum-cost abductive explanations in natural-language interpretation , 1991, Annals of Mathematics and Artificial Intelligence.

[6]  Matthew Stone,et al.  Coordinating Understanding and Generation in an Abductive Approach to Interpretation , 2002 .

[7]  M. Pickering,et al.  Toward a mechanistic psychology of dialogue , 2004, Behavioral and Brain Sciences.

[8]  Nick Hawes,et al.  Engineering intelligent information-processing systems with CAST , 2010, Adv. Eng. Informatics.

[9]  Bernhard Nebel,et al.  Continual planning and acting in dynamic multiagent environments , 2006 .

[10]  Milica Gasic,et al.  The Hidden Information State model: A practical framework for POMDP-based spoken dialogue management , 2010, Comput. Speech Lang..

[11]  Candace L. Sidner,et al.  Discourse structure and intention recognition. , 2000 .

[12]  Laura Giordano,et al.  A Modal Programming Language for Representing Complex Actions , 1998 .

[13]  Matthew Stone,et al.  Context in Abductive Interpretation , 2002 .

[14]  Michael Brenner,et al.  Phrasing Questions , 2009, AAAI Spring Symposium: Agents that Learn from Human Teachers.

[15]  Nick Hawes,et al.  Crossmodal content binding in information-processing architectures , 2008, 2008 3rd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[16]  Siobhan Chapman Logic and Conversation , 2005 .

[17]  Candace L. Sidner,et al.  Attention, Intentions, and the Structure of Discourse , 1986, CL.