论文信息 - Situated interaction with a virtual human - perception, action, and cognition

Situated interaction with a virtual human - perception, action, and cognition

In Virtual Reality environments, real humans can meet virtual humans to collaborate on tasks. The agent Max is such a virtual human, providing the human user with a face-to-face collaboration partner in the SFB 360 construction tasks. This paper describes how Max can assist by combining manipulative capabilities for assembly actions with conversational capabilities for mixed-initiative dialogue. During the interaction, Max employs speech, gaze, facial expression, and gesture and is able to initiate assembly actions. We present the underlying model of Max’s competences for managing situated interactions, and we show how the required faculties of perception, action, and cognition are realized and connected in his architecture.

[1] Martine Grice,et al. Deutsche Intonation und GToBI , 2002 .

[2] Herbert H. Clark,et al. Grounding in communication , 1991, Perspectives on socially shared cognition.

[3] Marc Erich Latoschik,et al. Knowledge-based assembly simulation for virtual prototype modeling , 1998, IECON '98. Proceedings of the 24th Annual Conference of the IEEE Industrial Electronics Society (Cat. No.98CH36200).

[4] W. Lewis Johnson,et al. Animated Agents for Procedural Training in Virtual Reality: Perception, Cognition, and Motor Control , 1999, Appl. Artif. Intell..

[5] Ipke Wachsmuth,et al. An implemented approach for a visual programming environment in VR , 2003 .

[6] David R. Traum,et al. Embodied agents for multi-party dialogue in immersive virtual worlds , 2002, AAMAS '02.

[7] Candace L. Sidner,et al. Engagement During Dialogues with Robots , 2005 .

[8] Franz Kummert,et al. Incremental speech recognition for multimodal interfaces , 1998, IECON '98. Proceedings of the 24th Annual Conference of the IEEE Industrial Electronics Society (Cat. No.98CH36200).

[9] Norman I. Badler,et al. Creating Interactive Virtual Humans: Some Assembly Required , 2002, IEEE Intell. Syst..

[10] Ipke Wachsmuth,et al. Towards a cognitively motivated processing of turn-taking signals for the embodied conversational agent Max , 2004 .

[11] Philip R. Cohen,et al. Plans as Complex Mental Attitudes , 2003 .

[12] Bernhard Jung,et al. Reasoning about Objects, Assemblies, and Roles in On-Going Assembly Tasks , 1998, DARS.

[13] Candace L. Sidner,et al. Attention, Intentions, and the Structure of Discourse , 1986, CL.

[14] Marc Erich Latoschik. A gesture processing framework for multimodal interaction in virtual reality , 2001, AFRIGRAPH '01.

[15] Catherine Pelachaud,et al. Performative facial expressions in animated faces , 2001 .

[16] L SidnerCandace,et al. Attention, intentions, and the structure of discourse , 1986 .

[17] Ehud Reiter,et al. Book Reviews: Building Natural Language Generation Systems , 2000, CL.

[18] Ipke Wachsmuth,et al. Extending semantic long-term knowledge on the basis of episodic short-term knowledge , 2003 .

[19] Ipke Wachsmuth,et al. Max - A Multimodal Assistant in Virtual Reality Construction , 2003, Künstliche Intell..

[20] David Traum,et al. CONVERSATIONAL AGENCY: THE TRAINS-93 DIALOGUE MANAGER , 2007 .