论文信息 - Turn-taking, feedback and joint attention in situated human-robot interaction

Turn-taking, feedback and joint attention in situated human-robot interaction

Abstract In this paper, we present a study where a robot instructs a human on how to draw a route on a map. The human and robot are seated face-to-face with the map placed on the table between them. The user’s and the robot’s gaze can thus serve several simultaneous functions: as cues to joint attention, turn-taking, level of understanding and task progression. We have compared this face-to-face setting with a setting where the robot employs a random gaze behaviour, as well as a voice-only setting where the robot is hidden behind a paper board. In addition to this, we have also manipulated turn-taking cues such as completeness and filled pauses in the robot’s speech. By analysing the participants’ subjective rating, task completion, verbal responses, gaze behaviour, and drawing activity, we show that the users indeed benefit from the robot’s gaze when talking about landmarks, and that the robot’s verbal and gaze behaviour has a strong effect on the users’ turn-taking behaviour. We also present an analysis of the users’ gaze and lexical and prosodic realisation of feedback after the robot instructions, and show that these cues reveal whether the user has yet executed the previous instruction, as well as the user’s level of uncertainty.

[1] Jens Edlund,et al. The Effect of Prosodic Features on the Interpretation of Synthesised Backchannels , 2006, PIT.

[2] J. Bavelas,et al. Listener Responses as a Collaborative Process: The Role of Gaze , 2002 .

[3] H. H. Clark,et al. Understanding by addressees and overhearers , 1989, Cognitive Psychology.

[4] Julia Hirschberg,et al. Turn-taking cues in task-oriented dialogue , 2011, Comput. Speech Lang..

[5] Paul Boersma,et al. Praat, a system for doing phonetics by computer , 2002 .

[6] Gabriel Skantze,et al. Head Pose Patterns in Multiparty Human-Robot Team-Building Interactions , 2013, ICSR.

[7] Nigel Ward,et al. A study in responsiveness in spoken dialog , 2003, Int. J. Hum. Comput. Stud..

[8] Jonas Beskow,et al. Wavesurfer - an open source speech tool , 2000, INTERSPEECH.

[9] Louis-Philippe Morency,et al. A probabilistic multimodal approach for predicting listener backchannels , 2009, Autonomous Agents and Multi-Agent Systems.

[10] Anna Hjalmarsson,et al. The additive effect of turn-taking cues in human and synthetic voice , 2011, Speech Commun..

[11] J. R. Landis,et al. The measurement of observer agreement for categorical data. , 1977, Biometrics.

[12] Joakim Gustafson,et al. Walk This Way: Spatial Grounding for City Exploration , 2014, Natural Interaction with Robots, Knowbots and Smartphones, Putting Spoken Dialog Systems into Practice.

[13] Justus J. Randolph. Free-Marginal Multirater Kappa (multirater K[free]): An Alternative to Fleiss' Fixed-Marginal Multirater Kappa. , 2005 .

[14] Gabriel Skantze,et al. A General, Abstract Model of Incremental Dialogue Processing , 2009, EACL.

[15] Gabriel Skantze,et al. Exploring the effects of gaze and pauses in situated human-robot interaction , 2013, SIGDIAL Conference.

[16] Stefan Kopp,et al. Combining Incremental Language Generation and Incremental Speech Synthesis for Adaptive Information Presentation , 2012, SIGDIAL Conference.

[17] B. Velichkovsky. Communicating attention: Gaze position transfer in cooperative problem solving , 1995 .

[18] Gabriel Skantze,et al. Attention and Interaction Control in a Human-Human-Computer Dialogue Setting , 2009, SIGDIAL Conference.

[19] Candace L. Sidner,et al. Attention, Intentions, and the Structure of Discourse , 1986, CL.

[20] Joakim Gustafson,et al. Cues to perceived functions of acted and spontaneous feedback expressions , 2012 .

[21] A. Anderson,et al. The Effects of Visibility on Dialogue and Performance in a Cooperative Problem Solving Task , 1994 .

[22] Catharine Oertel,et al. Gaze direction as a Back-Channel inviting Cue in Dialogue , 2012 .

[23] E. Schegloff,et al. A simplest systematics for the organization of turn-taking for conversation , 1974 .

[24] Yukiko I. Nakano,et al. Towards a Model of Face-to-Face Grounding , 2003, ACL.