论文信息 - On Representing Salience and Reference in Multimodal Human-Computer Interaction

On Representing Salience and Reference in Multimodal Human-Computer Interaction

We discuss ongoing work investigating how humans interact with multimodal systems, focusing on how successful reference to objects and events is accomplished. We describe an implemented multimodal travel guide application being employed in a set of Wizard of Oz experiments from which data about user interactions is gathered. We offer a preliminary analysis of the data which suggests that, as is evident in Huls et al.’s (1995) more extensive study, the interpretation of referring expressions can be accounted for by a rather simple set of rules which do not make reference to the type of referring expression used. As this result is perhaps unexpected in light of past linguistic research on reference, we suspect that this is not a general result, but instead a product of the simplicity of the tasks around which these multimodal systems have been developed. Thus, more complex systems capable of evoking richer sets of human language and gestural communication need to be developed before conclusions can be drawn about unified representations for salience and reference in multimodal settings.

[1] Ellen F. Prince,et al. Toward a taxonomy of given-new information , 1981 .

[2] Candace L. Sidner,et al. Attention, Intentions, and the Structure of Discourse , 1986, CL.

[3] Hiyan Alshawi,et al. Memory and context for language interpretation , 1987 .

[4] Jeanette K. Gundel,et al. Cognitive Status and the Form of Referring Expressions in Discourse , 1993 .

[5] Carla Huls,et al. Automatic Referent Resolution of Deictic and Anaphoric Expressions , 1995, CL.

[6] Adam Cheyer,et al. Multimodal Maps: An Agent-Based Approach , 1995, Multimodal Human-Computer Communication.

[7] Sharon L. Oviatt,et al. Multimodal interfaces for dynamic interactive maps , 1996, CHI.

[8] Sharon Oviatt,et al. Integration and synchronization of input modes during multimodal human-computer interaction , 1997 .

[9] Adam Cheyer,et al. Speech: a privileged modality , 1997, EUROSPEECH.

[10] Antonella De Angeli,et al. Integration and synchronization of input modes during multimodal human-computer interaction , 1997, CHI.

[11] Jean-Claude Martin,et al. A Theoretical Framework for Multimodal User Studies , 1998 .

[12] M. Longair. The Theoretical Framework , 1998 .

[13] Jean-Claude Martin,et al. A Unified Framework for Constructing Multimodal Experiments and Applications , 1998, Cooperative Multimodal Communication.

[14] Adam Cheyer,et al. The Open Agent Architecture , 1997, Autonomous Agents and Multi-Agent Systems.