论文信息 - A novel method for multi-sensory data fusion in multimodal human computer interaction

A novel method for multi-sensory data fusion in multimodal human computer interaction

Multimodal User Interaction (MMUI) technology aims at building natural and intuitive interfaces allowing a user to interact with computer in a way similar to human-to-human communication, for example, through speech and gestures. As a critical component in MMUI, Multimodal Input Fusion explores ways to effectively interpret the combined semantic interpretation of user inputs through multiple modalities. This paper presents a novel approach to multi-sensory data fusion based on speech and manual deictic gesture inputs. The effectiveness of the technique has been validated through experiments, using a traffic incident management scenario where an operator interacts with a map on a large display at a distance and issues multimodal commands through speech and manual gestures. The description of the proposed approach and preliminary experiment results are presented.

[1] Sanjeev Kumar,et al. A multimodal learning interface for sketch, speak and point creation of a schedule chart , 2004, ICMI '04.

[2] Bob Carpenter,et al. The logic of typed feature structures , 1992 .

[3] Michael Johnston,et al. Unification-based Multimodal Parsing , 1998, ACL.

[4] T. Landauer,et al. Handbook of Human-Computer Interaction , 1997 .

[5] Gavriel Salvendy,et al. Handbook of Human-Computer Interaction (Book Review) , 1999, International journal of human computer interactions.

[6] Marc Erich Latoschik. A user interface framework for multimodal VR interactions , 2005, ICMI '05.

[7] J. Jacko,et al. The human-computer interaction handbook: fundamentals, evolving technologies and emerging applications , 2002 .

[8] Fang Chen,et al. A study of manual gesture-based selection for the PEMMI multimodal transport management interface , 2005, ICMI '05.

[9] Michael Johnston,et al. Finite-state Multimodal Parsing and Understanding , 2000, COLING.