Strategies for Human-Driven Robot Comprehension of Spatial Descriptions by Older Adults in a Robot Fetch Task

This contribution presents a corpus of spatial descriptions and describes the development of a human-driven spatial language robot system for their comprehension. The domain of application is an eldercare setting in which an assistive robot is asked to "fetch" an object for an elderly resident based on a natural language spatial description given by the resident. In Part One, we describe a corpus of naturally occurring descriptions elicited from a group of older adults within a virtual 3D home that simulates the eldercare setting. We contrast descriptions elicited when participants offered descriptions to a human versus robot avatar, and under instructions to tell the addressee how to find the target versus where the target is. We summarize the key features of the spatial descriptions, including their dynamic versus static nature and the perspective adopted by the speaker. In Part Two, we discuss critical cognitive and perceptual processing capabilities necessary for the robot to establish a common ground with the human user and perform the "fetch" task. Based on the collected corpus, we focus here on resolving the perspective ambiguity and recognizing furniture items used as landmarks in the descriptions. Taken together, the work presented here offers the key building blocks of a robust system that takes as input natural spatial language descriptions and produces commands that drive the robot to successfully fetch objects within our eldercare scenario.

[1]  Dieter Fox,et al.  Following directions using statistical machine translation , 2010, HRI 2010.

[2]  Deb Roy,et al.  Interpretation of Spatial Language in a Map Navigation Task , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[3]  Jonathan T. Barron,et al.  A category-level 3-D object dataset: Putting the Kinect to work , 2011, ICCV Workshops.

[4]  B. Landau,et al.  “What” and “where” in spatial language and spatial cognition , 1993 .

[5]  D. Roy Grounding words in perception and action: computational insights , 2005, Trends in Cognitive Sciences.

[6]  Jodi Forlizzi,et al.  All robots are not created equal: the design and perception of humanoid robot heads , 2002, DIS '02.

[7]  J. Raven,et al.  Manual for Raven's progressive matrices and vocabulary scales , 1962 .

[8]  Andrea Lockerd Thomaz,et al.  Perspective Taking: An Organizing Principle for Learning in Human-Robot Interaction , 2006, AAAI.

[9]  付伶俐 打磨Using Language,倡导新理念 , 2014 .

[10]  A. D. Fisk,et al.  Is a robot an appliance, teammate, or friend? age-related differences in expectations of and attitudes towards personal home-based robots , 2008 .

[11]  Leonard Talmy,et al.  How Language Structures Space , 1983 .

[12]  Morgan Quigley,et al.  ROS: an open-source Robot Operating System , 2009, ICRA 2009.

[13]  Marjorie Skubic,et al.  Modeling spatial referencing language for human-robot interaction , 2005, ROMAN 2005. IEEE International Workshop on Robot and Human Interactive Communication, 2005..

[14]  J. Gregory Trafton,et al.  Enabling effective human-robot interaction using perspective-taking in robots , 2005, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[15]  M. F. Schober Spatial perspective-taking in conversation , 1993, Cognition.

[16]  S. Levinson Frames of reference and Molyneux's question: Cross-linguistic evidence , 1996 .

[17]  Rosemarie E. Yagoda,et al.  Location Label Speech Options Improve Robot Operator Performance , 2011 .

[18]  Daniel R. Montello,et al.  Linguistic and Nonlinguistic Turn Direction Concepts , 2007, COSIT.

[19]  H. H. Clark,et al.  Common ground at the understanding of demonstrative reference , 1983 .

[20]  S. Borson,et al.  The Mini‐Cog: a cognitive ‘vital signs’ measure for dementia screening in multi‐lingual elderly , 2000, International journal of geriatric psychiatry.

[21]  Dieter Fox,et al.  Sparse distance learning for object recognition combining RGB and depth information , 2011, 2011 IEEE International Conference on Robotics and Automation.

[22]  Brady T. West,et al.  Working Memory, Cues, and Wayfinding in Older Women , 2009 .

[23]  Benjamin Kuipers,et al.  Integrating Vision and Spatial Reasoning for Assistive Navigation , 1998, Assistive Technology and Artificial Intelligence.

[24]  Stefanie Tellex,et al.  Spatial routines for a simulated speech-controlled vehicle , 2006, HRI '06.

[25]  Matti Pietikäinen,et al.  A comparative study of texture measures with classification based on featured distributions , 1996, Pattern Recognit..

[26]  Michael Bader,et al.  The HCRC Map Task Corpus: A Natural Spoken Dialogue Corpus , 1993 .

[27]  Vladimir G. Kim,et al.  Shape-based recognition of 3D point clouds in urban environments , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[28]  Willem J. M. Levelt,et al.  Perspective taking and ellipsis in spatial descriptions , 1996 .

[29]  Marjorie Skubic,et al.  Testing an assistive fetch robot with spatial language from older and younger adults , 2013, 2013 IEEE RO-MAN.

[30]  Kerstin Fischer,et al.  Spatial Strategies in Human-Robot Communication , 2002, Künstliche Intell..

[31]  Jodie M. Plumert,et al.  The Content and Organization of Communication about Object Locations , 1995 .

[32]  Luc Steels,et al.  Language games for autonomous robots , 2001 .

[33]  Raymond J. Mooney,et al.  Learning to Interpret Natural Language Navigation Instructions from Observations , 2011, Proceedings of the AAAI Conference on Artificial Intelligence.

[34]  Barbara Tversky,et al.  Descriptions of Simple Spatial Scenes in English and Japanese , 2003, Spatial Cogn. Comput..

[35]  Ferdinando Fornara,et al.  Robots in a domestic setting: a psychological approach , 2005, Universal Access in the Information Society.

[36]  Klaus Stein,et al.  Coarse Qualitative Descriptions in Robot Navigation , 2000, Spatial Cognition.

[37]  Benjamin Kuipers,et al.  The Spatial Semantic Hierarchy , 2000, Artif. Intell..

[38]  Marjorie Skubic,et al.  Assessing the effectiveness of older adults' spatial descriptions in a fetch task , 2013, CogSci.

[39]  Jenay M. Beer,et al.  The domesticated robot: Design guidelines for assisting older adults to age in place , 2012, 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI).