Mediating between Qualitative and Quantitative Representations for Task-Orientated Human-Robot Interaction

In human-robot interaction (HRI) it is essential that the robot interprets and reacts to a human's utterances in a manner that reflects their intended meaning. In this paper we present a collection of novel techniques that allow a robot to interpret and execute spoken commands describing manipulation goals involving qualitative spatial constraints (e.g. "put the red ball near the blue cube"). The resulting implemented system integrates computer vision, potential field models of spatial relationships, and action planning to mediate between the continuous real world, and discrete, qualitative representations used for symbolic reasoning.

[1]  John D. Kelleher,et al.  Proximity in Context: An Empirically Grounded Computational Model of Proximity for Processing Topological Spatial Expressions , 2006, ACL.

[2]  N. Lesh,et al.  The Role of Dialog in Human Robot Interaction , 2003 .

[3]  Patrick Olivier,et al.  Quantitative perceptual representation of prepositional semantics , 2004, Artificial Intelligence Review.

[4]  Maria Fox,et al.  PDDL2.1: An Extension to PDDL for Expressing Temporal Planning Domains , 2003, J. Artif. Intell. Res..

[5]  Bernhard Nebel,et al.  The FF Planning System: Fast Plan Generation Through Heuristic Search , 2011, J. Artif. Intell. Res..

[6]  Gordon D. Logan,et al.  A computational analysis of the apprehension of spatial relations , 1996 .

[7]  Laura A. Carlson,et al.  Grounding spatial language in perception: an empirical and computational investigation. , 2001, Journal of experimental psychology. General.

[8]  James F. Allen,et al.  Toward Conversational Human-Computer Interaction , 2001, AI Mag..

[9]  Helge J. Ritter,et al.  Multi-modal human-machine communication for instructing robot grasping tasks , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[10]  Deb Roy,et al.  Grounded Situation Models for Robots: Where words and percepts meet , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[11]  John D. Kelleher,et al.  Information Fusion for Visual Reference Resolution in Dynamic Situated Dialogue , 2006, PIT.

[12]  James F. Allen,et al.  Towards Conversational Human-Computer Interaction , 2000 .

[13]  John D. Kelleher,et al.  Spatial Prepositions in Context: The Semantics of near in the Presence of Distractor Objects , 2006, ACL 2006.

[14]  Michael Brenner,et al.  Planning for Multiagent Environments From Individual Perceptions to Coordinated Execution , 2005 .