论文信息 - Gestures or speech? Comparing modality selection for different interaction tasks in a virtual environment

Gestures or speech? Comparing modality selection for different interaction tasks in a virtual environment

In this paper, we investigate whether users prefer speech or gesture input for four distinct interaction tasks commonly found in virtual environments: navigation, selection, dialogue, and object manipulation. For this purpose, we implemented an interactive storytelling scenario in which the users could always choose between gesture and speech commands for each interaction. Both input modalities were processed in real-time using a low-cost depth sensor and microphone. We conducted a study in order to identify the modality preferences for each task. We got strong results for the navigational task, for which gestural interaction seemed to be more suitable, and for the dialogue task which was in favour of speech. For the object manipulation and selection tasks we did not observe a clear preference for one of the modalities, but we found indications for why some participants chose speech and others preferred gestures by analysing the participants’ ratings of their experience with the interaction.

Kathrin Janowski | Kathrin Janowski

[1] Philip R. Cohen,et al. On the Relationships Among Speech, Gestures, and Object Manipulation in Virtual Environments: Initial Evidence , 2005 .

[2] Emiel Krahmer,et al. Generating Multimodal References , 2007 .

[3] Joseph J. LaViola,et al. Hands-free multi-scale navigation in virtual environments , 2001, I3D '01.

[4] Elisabeth André,et al. Full Body Gestures Enhancing a Game Book for Interactive Story Telling , 2011, ICIDS.

[5] Ionut Damian,et al. Natural interaction with culturally adaptive virtual characters , 2012, Journal on Multimodal User Interfaces.

[6] Marc Cavazza,et al. Multimodal acting in mixed reality interactive storytelling , 2004, IEEE MultiMedia.

[7] Kazushi Nishimoto,et al. Design and evaluation of gesture interface of an immersive walk-through application for exploring cyberspace , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[8] Sharon L. Oviatt,et al. Multimodal Interaction for 2D and 3D Environments , 1999, IEEE Computer Graphics and Applications.