Situated Dialogue and Spatial Organization: What, Where… and Why?

The paper presents an HRI architecture for human-augmented mapping, which has been implemented and tested on an autonomous mobile robotic platform. Through interaction with a human, the robot can augment its autonomously acquired metric map with qualitative information about locations and objects in the environment. The system implements various interaction strategies observed in independently performed Wizard-of-Oz studies. The paper discusses an ontology-based approach to multi-layered conceptual spatial mapping that provides a common ground for human-robot dialogue. This is achieved by combining acquired knowledge with innate conceptual commonsense knowledge in order to infer new knowledge. The architecture bridges the gap between the rich semantic representations of the meaning expressed by verbal utterances on the one hand and the robot's internal sensor-based world representation on the other. It is thus possible to establish references to spatial areas in a situated dialogue between a human and a robot about their environment. The resulting conceptual descriptions represent qualitative knowledge about locations in the environment that can serve as a basis for achieving a notion of situational awareness.

[1]  Wolfram Burgard,et al.  People Tracking with Mobile Robots Using Sample-Based Joint Probabilistic Data Association Filters , 2003, Int. J. Robotics Res..

[2]  Michael A. Goodrich,et al.  Experiments in adjustable autonomy , 2001, 2001 IEEE International Conference on Systems, Man and Cybernetics. e-Systems and e-Man for Cybernetics in Cyberspace (Cat.No.01CH37236).

[3]  Alex Lascarides,et al.  Logics of Conversation , 2005, Studies in natural language processing.

[4]  Geert-Jan M. Kruijff,et al.  Service Robots Dealing with Indirect Speech Acts , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[5]  Benjamin Kuipers,et al.  The Spatial Semantic Hierarchy , 2000, Artif. Intell..

[6]  Reinhard Muskens Chapter 10 – Dynamics , 1997 .

[7]  John D. Kelleher,et al.  Information Fusion for Visual Reference Resolution in Dynamic Situated Dialogue , 2006, PIT.

[8]  Michael White,et al.  Efficient Realization of Coordinate Structures in Combinatory Categorial Grammar , 2006 .

[9]  Wolfram Burgard,et al.  Supervised Learning of Places from Range Data using AdaBoost , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[10]  John D. Kelleher,et al.  A Context-Dependent Model of Proximity in Physically Situated Environments , 2005 .

[11]  John R. Searle,et al.  Speech Acts: An Essay in the Philosophy of Language , 1970 .

[12]  Danica Kragic,et al.  Integrating SLAM and Object Detection for Service Robot Tasks , 2005 .

[13]  Henrik I. Christensen,et al.  Clarification dialogues in human-augmented mapping , 2006, HRI '06.

[14]  Albert C. Stevens,et al.  Distortions in judged spatial relations , 1978, Cognitive Psychology.

[15]  Andrew G. Clark,et al.  Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL) , 2002 .

[16]  Jason Baldridge,et al.  Coupling CCG and Hybrid Logic Dependency Semantics , 2002, ACL.

[17]  Johan Bos,et al.  Meaningful Conversation with a Mobile Robot , 2003, EACL.

[18]  Anders Green,et al.  Social and collaborative aspects of interaction with a service robot , 2003, Robotics Auton. Syst..

[19]  M. Steedman,et al.  Chapter 5 Combinatory Categorial Grammar ( Draft 4 . 0 , August 10 , 2003 ) , 2006 .

[20]  John J. Leonard,et al.  Explore and return: experimental validation of real-time concurrent mapping and localization , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[21]  Thomas C. Ormerod,et al.  Understanding interfaces - a handbook of human-computer dialogue , 1994, Computers and people series.

[22]  Bernt Schiele,et al.  Integrating representative and discriminant models for object category detection , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[23]  Frederik W. Heger,et al.  RESULTS IN SLIDING AUTONOMY FOR MULTI-ROBOT SPATIAL ASSEMBLY , 2005 .

[24]  Jason Baldridge,et al.  Multi-Modal Combinatory Categorial Grammar , 2003, EACL.

[25]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[26]  Thomas J. Barter,et al.  The box , 1997, SIGGRAPH '97.

[27]  Claudia Maienborn,et al.  Position und Bewegung: Zur Semantik lokaler Verben , 1990, IWBS Report.

[28]  Illah R. Nourbakhsh,et al.  A survey of socially interactive robots , 2003, Robotics Auton. Syst..

[29]  Helmut Feldweg,et al.  GermaNet - a Lexical-Semantic Net for German , 1997 .

[30]  Thora Tenbrink,et al.  Telling Rolland Where to Go: HRI Dialogues on Route Navigation , 2009, Spatial Language and Dialogue.

[31]  T. McNamara Mental representations of spatial relations , 1986, Cognitive Psychology.

[32]  Iryna Gurevych,et al.  Less is More: Using a single knowledge representation in dialogue systems , 2003, HLT-NAACL 2003.

[33]  John D. Kelleher,et al.  Structural descriptions in human-assisted robot visual learning , 2006, HRI '06.

[34]  J. Jonides,et al.  Evidence of hierarchies in cognitive maps , 1985, Memory & cognition.

[35]  Henrik I. Christensen,et al.  Tracking for following and passing persons , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[36]  Mark G. Core,et al.  Coding Dialogs with the DAMSL Annotation Scheme , 1997 .

[37]  Claudia Casadio Recensione a: J. van Benthem, A. ter Meulen (eds.), Handbook of Logic and Language , 1997 .

[38]  Henrik I. Christensen,et al.  Acquiring a shared environment representation , 2006, HRI '06.

[39]  Miles C. Walton,et al.  Collaborative Tools for Mixed Teams of Humans and Robots , 2003 .

[40]  Adam Cheyer,et al.  The Open Agent Architecture , 1997, Autonomous Agents and Multi-Agent Systems.

[41]  Claudia Greenwood I am Sorry , 1980 .

[42]  Leila Bowman,et al.  in the office , 1961 .

[43]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[44]  Henrik I. Christensen,et al.  Vision SLAM in the Measurement Subspace , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.