Intuitive linguistic Joint Object Reference in Human-Robot Interaction: Human Spatial Reference Systems and Function-Based Categorization for Symbol Grounding

The visionary goal of an easy to use service robot implies intuitive styles of interaction between humans and robots. Such natural interaction can only be achieved if means are found to bridge the gap between the forms of object perception and spatial knowledge maintained by such robots, and the forms of language, used by humans, to communicate such knowledge. Part of bridging this gap consists of allowing user and robot to establish joint reference on objects in the environment - without forcing the user to use unnatural means for object reference. We present an approach to establishing joint object reference which makes use of natural object classification and a computational model of basic intrinsic and relative reference systems. Our object recognition approach assigns natural categories (e.g. "desk", "chair", "table") to new objects based on their functional design. With basic objects within the environment classified, we can then make use of a computational reference model, to process natural projective relations (e.g. "the briefcase to the left of the chair"), allowing users to refer to objects which cannot be classified reliably by the recognition system alone.

[1]  R. Moratz,et al.  Instruction modes for joint spatial reference between naive users and a mobile robot , 2003, IEEE International Conference on Robotics, Intelligent Systems and Signal Processing, 2003. Proceedings. 2003.

[2]  Christian Freksa Spatial Cognition: An AI Perspective , 2004, ECAI.

[3]  Annette Herskovits,et al.  Language and spatial cognition , 1986 .

[4]  Stevan Harnad The Symbol Grounding Problem , 1999, ArXiv.

[5]  Reinhard Moratz,et al.  Spatial Reference in Linguistic Human-Robot Interaction: Iterative, Empirically Supported Development of a Model of Projective Relations , 2006, Spatial Cogn. Comput..

[6]  Reinhard Moratz,et al.  Automatic object recognition within an of .ce environment , 2004, First Canadian Conference on Computer and Robot Vision, 2004. Proceedings..

[7]  S. Levinson Frames of reference and Molyneux's question: Cross-linguistic evidence , 1996 .

[8]  K. Popper,et al.  Conjectures and Refutations , 1963 .

[9]  Jianwei Zhang,et al.  A two-arm situated artificial communicator for human-robot cooperative assembly , 2003, IEEE Trans. Ind. Electron..

[10]  Sebastian Thrun,et al.  Toward a Framework for Human-Robot Interaction , 2004, Hum. Comput. Interact..

[11]  James M. Keller,et al.  Generating Multi-Level Linguistic Spatial Descriptions from Range Sensor Readings Using the Histogram of Forces , 2003, Auton. Robots.

[12]  Kerstin Fischer,et al.  Cognitive Modeling of Spatial Reference for Human-Robot Interaction , 2001, Int. J. Artif. Intell. Tools.

[13]  S. Harnad Grounding Symbolic Capacity in Robotic Capacity , 1995 .