The vernissage corpus: A conversational Human-Robot-Interaction dataset

We introduce a new conversational Human-Robot-Interaction (HRI) dataset with a real-behaving robot inducing interactive behavior with and between humans. Our scenario involves a humanoid robot NAO1 explaining paintings in a room and then quizzing the participants, who are naive users. As perceiving nonverbal cues, apart from the spoken words, plays a major role in social interactions and socially-interactive robots, we have extensively annotated the dataset. It has been recorded and annotated to benchmark many relevant perceptual tasks, towards enabling a robot to converse with multiple humans, such as speaker localization and speech segmentation; tracking, pose estimation, nodding, visual focus of attention estimation in visual domain; and an audio-visual task such as addressee detection. NAO system states are also available. As compared to recordings done with a static camera, this corpus involves the head-movement of a humanoid robot (due to gaze change, nodding), posing challenges to visual processing. Also, the significant background noise present in a real HRI setting makes auditory tasks challenging.

[1]  Ben J. A. Kröse,et al.  From sensors to human spatial concepts , 2007, Robotics Auton. Syst..

[2]  Radu Horaud,et al.  The Ravel data set , 2011 .

[3]  Illah R. Nourbakhsh,et al.  A survey of socially interactive robots , 2003, Robotics Auton. Syst..

[4]  Sebastian Wrede,et al.  Attitude of German museum visitors towards an interactive art guide robot , 2011, 2011 6th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[5]  Sebastian Wrede,et al.  A Framework for the Acquisition of Multimodal Human-Robot Interaction Data Sets with a Whole-System Perspective , 2012 .

[6]  Jean-Marc Odobez,et al.  THE VERNISSAGE CORPUS: A MULTIMODAL HUMAN-ROBOT-INTERACTION DATASET , 2012 .

[7]  Katharina J. Rohlfing,et al.  Systemic Interaction Analysis (SInA) in HRI , 2009, 2009 4th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[8]  Jon Barker,et al.  The CAVA corpus: synchronised stereoscopic and binaural datasets with head movements , 2008, ICMI '08.

[9]  Yasser F. O. Mohammad,et al.  The H3R Explanation Corpus human-human and base human-robot interaction dataset , 2008, 2008 International Conference on Intelligent Sensors, Sensor Networks and Information Processing.