论文信息 - Symmetric Multimodal Interaction in a Dynamic Dialogue

Symmetric Multimodal Interaction in a Dynamic Dialogue

Two important themes in current work on interfaces are multimodal interaction and the use of dialogue. Human multimodal dialogues are symmetric, i.e., both participants communicate multimodally. We describe a proof of concept system that supports symmetric multimodal communication for speech and sketching in the domain of simple mechanical device design. We discuss three major aspects of the communication: multimodal input processing, multimodal output generation, and creating a dynamic dialogue. While previous systems have had some of these capabilities individually, their combination appears to be unique. We provide examples from our system that illustrate a variety of user inputs and system outputs. Author Keywords multimodal, dynamic dialogue, sketch recognition, sketch generation, speech

Randall Davis | Aaron Adler

[1] Aaron Adler. Segmentation and Alignment of Speech and Sketching in a Design Environment , 2003 .

[2] Sharon L. Oviatt,et al. Designing the User Interface for Multimodal Speech and Pen-Based Gesture Applications: State-of-the-Art Systems and Future Research Directions , 2000, Hum. Comput. Interact..

[3] Mark Steedman,et al. Animated conversation: rule-based generation of facial expression, gesture & spoken intonation for multiple conversational agents , 1994, SIGGRAPH.

[4] Randall Davis,et al. LADDER, a sketching language for user interface developers , 2005, Comput. Graph..

[5] Trevor Darrell,et al. Untethered gesture acquisition and recognition for virtual world manipulation , 2005, Virtual Reality.

[6] Thomas F. Stahovich,et al. Sketch based interfaces: early processing for sketch understanding , 2001, PUI '01.

[7] Randall Davis,et al. Naturally conveyed explanations of device behavior , 2001, PUI '01.

[8] Randall Davis,et al. Recognition of Hand Drawn Chemical Diagrams , 2007, AAAI.

[9] Steven K. Feiner,et al. Negotiation for automated generation of temporal multimedia presentations , 1997, MULTIMEDIA '96.

[10] Edward C. Kaiser,et al. Multimodal new vocabulary recognition through speech and handwriting in a whiteboard scheduling application , 2005, IUI.

[11] Antonella De Angeli,et al. Integration and synchronization of input modes during multimodal human-computer interaction , 1997, CHI.