Pointing to space: Modeling of deictic interaction referring to regions

In daily conversation, we sometimes observe a deictic interaction scene that refers to a region in a space, such as saying "please put it over there" with pointing. How can such an interaction be possible with a robot? Is it enough to simulate people's behaviors, such as utterance and pointing? Instead, we highlight the importance of simulating human cognition. In the first part of our study, we empirically demonstrate the importance of simulating human cognition of regions when a robot engages in a deictic interaction by referring to a region in a space. The experiments indicate that a robot with simulated cognition of regions improves efficiency of its deictic interaction. In the second part, we present a method for a robot to computationally simulate cognition of regions.

[1]  Yukie Nagai,et al.  Learning to comprehend deictic gestures in robots and human infants , 2005, ROMAN 2005. IEEE International Workshop on Robot and Human Interactive Communication, 2005..

[2]  Marc Hanheide,et al.  Mixed-initiative in human augmented mapping , 2009, 2009 IEEE International Conference on Robotics and Automation.

[3]  Hideaki Kuzuoka,et al.  GestureMan: a mobile robot that embodies a remote instructor's actions , 2000, CSCW '00.

[4]  Raj M. Ratwani,et al.  Integrating vision and audition within a cognitive architecture to track conversations , 2008, 2008 3rd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[5]  Reinhard Moratz,et al.  Spatial Reference in Linguistic Human-Robot Interaction: Iterative, Empirically Supported Development of a Model of Projective Relations , 2006, Spatial Cogn. Comput..

[6]  Chrystopher L. Nehaniv,et al.  Human to robot demonstrations of routine home tasks: Exploring the role of the robot's feedback , 2008, 2008 3rd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[7]  Stephan Winter,et al.  Selection of Salient Features for Route Directions , 2004, Spatial Cogn. Comput..

[8]  J. Gregory Trafton,et al.  Children and robots learning to play hide and seek , 2006, HRI '06.

[9]  Brian Scassellati,et al.  Investigating models of social development using a humanoid robot , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[10]  Andrea Lockerd Thomaz,et al.  Perspective Taking: An Organizing Principle for Learning in Human-Robot Interaction , 2006, AAAI.

[11]  Barbara Kryk-Kastovsky The linguistic, cognitive and cultural variables of the conceptualization of space , 1996 .

[12]  Takayuki Kanda,et al.  Humanlike conversation with gestures and verbal cues based on a three-layer attention-drawing model , 2006, Connect. Sci..

[13]  Alan Penn,et al.  Encoding Natural Movement as an Agent-Based System: An Investigation into Human Pedestrian Behaviour in the Built Environment , 2002 .

[14]  Leonard Talmy The representation of spatial structure in spoken and signed language , 2004 .

[15]  Takayuki Kanda,et al.  Natural deictic communication with humanoid robots , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[16]  Reinhard Moratz,et al.  Qualitative spatial reasoning about relative point position , 2008, J. Vis. Lang. Comput..

[17]  Karin Schweizer,et al.  Spatial Cognition: The Role of Landmark, Route, and Survey Knowledge in Human and Robot Navigation , 1997, GI Jahrestagung.

[18]  Cynthia Breazeal,et al.  Working with robots and objects: revisiting deictic reference for achieving spatial common ground , 2006, HRI '06.

[19]  Takayuki Kanda,et al.  Footing in human-robot conversations: How robots might shape participant roles using gaze cues , 2009, 2009 4th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[20]  Takayuki Kanda,et al.  Who will be the customer?: a social robot that anticipates people's behavior from their trajectories , 2008, UbiComp.

[21]  Martha W. Alibali,et al.  Gesture in Spatial Cognition: Expressing, Communicating, and Thinking About Spatial Information , 2005, Spatial Cogn. Comput..

[22]  Henrik I. Christensen,et al.  Topological Modelling for Human Augmented Mapping , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[23]  Yiannis Demiris,et al.  Perceptual Perspective Taking and Action Recognition , 2005 .

[24]  Ipke Wachsmuth,et al.  Deictic object reference in task-oriented dialogue , 2006 .

[25]  Allison Woodruff,et al.  A Quantitative Method for Revealing and Comparing Places in the Home , 2006, UbiComp.

[26]  E. Vatikiotis-Bateson,et al.  Communicative criteria for processing time/space-varying information , 2001, Proceedings 10th IEEE International Workshop on Robot and Human Interactive Communication. ROMAN 2001 (Cat. No.01TH8591).

[27]  Michael F. Goodchild,et al.  Where's Downtown?: Behavioral Methods for Determining Referents of Vague Spatial Queries , 2003 .

[28]  Bilge Mutlu,et al.  A Storytelling Robot: Modeling and Evaluation of Human-like Gaze Behavior , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[29]  Takayuki Kanda,et al.  Providing route directions: Design of robot's utterance, gesture, and timing , 2009, 2009 4th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[30]  Wolfram Burgard,et al.  Supervised semantic labeling of places using information extracted from sensor data , 2007, Robotics Auton. Syst..

[31]  R. Krauss Why Do We Gesture When We Speak? , 1998 .

[32]  Candace L. Sidner,et al.  Where to look: a study of human-robot engagement , 2004, IUI '04.

[33]  David P. Wilkins Why pointing with the index finger is not a universal (in sociocultural and semiotic terms). , 2003 .

[34]  Johan Koolwaaij,et al.  Identifying meaningful locations , 2006, 2006 Third Annual International Conference on Mobile and Ubiquitous Systems: Networking & Services.

[35]  R. Weale Vision. A Computational Investigation Into the Human Representation and Processing of Visual Information. David Marr , 1983 .

[36]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  J. Gregory Trafton,et al.  Spatial Representation and Reasoning for Human-Robot Collaboration , 2007, AAAI.

[38]  J. Gregory Trafton,et al.  Enabling effective human-robot interaction using perspective-taking in robots , 2005, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[39]  Konrad Tollmar,et al.  Activity Zones for Context-Aware Computing , 2003, UbiComp.

[40]  A. Kendon Gesture: Visible Action as Utterance , 2004 .