论文信息 - Affective multimodal interaction with a 3D agent

Affective multimodal interaction with a 3D agent

The exchange of a ect/emotions is continuous in human-human interaction and plays a key role in decision making. We propose a similar paradigm for human-computer interaction and describe a novel method for classifying a user's input to a computer system for some very basic emotional attitudes. The paradigm is based on multiple communication channels including not only traditional media (eye tracker, gesture recognition, conventional \speech to text" recognition) but also on signals communicated through extra-linguistic (or \nontextual") features of the oral mode (prosodic analysis). A strategy for fusion of concordant input is described, as we believe that a ect is communicated through many channels simultaneously. Our experimental environment centres on an autonomous simpleminded dog-like 3D agent called Bouncy. Our preliminary evaluation of the agent-paradigm furnishes good grounds for believing that the proposed fusion strategy can be used also for revealing very complex attitudes like irony and certain kinds of humour realised as a contradiction between input channels (e.g. between what the user says and how he says it).

Tom Brøndsted | Thomas Dorf Nielsen | Sergio Ortega Gonzalez

[1] Juan Manuel Montero-Martínez,et al. Emotional speech synthesis: from speech database to TTS , 1998, ICSLP.

[2] Thomas D. Nielsen,et al. Classification of Emotional Attitudes in Pet-Directed Speech , 2000 .

[3] Paul Dalsgaard,et al. Segment based variable frame rate speech analysis and recognition using a spectral variation function , 1992, ICSLP.

[4] Rolf Carlson,et al. Experiments with emotive speech - acted utterances and synthesized replicas , 1992, ICSLP.

[5] Paul Dalsgaard,et al. Design, recording and verification of a danish emotional speech database , 1997, EUROSPEECH.

[6] Bruce Blumberg,et al. Sympathetic interfaces: using a plush toy to direct synthetic characters , 1999, CHI '99.

[7] Bruce Blumberg,et al. Multi-level direction of autonomous creatures for real-time virtual environments , 1995, SIGGRAPH.

[8] Sjl Mozziconacci. Speech variability and emotion : production and perception , 1998 .

[9] Catherine I. Watson,et al. Some acoustic characteristics of emotion , 1998, ICSLP.

[10] Thomas B. Moeslund,et al. A platform for developing Intelligent MultiMedia applications , 1998 .

[11] Behavior-based Control of an Interactive Life-like Character , 1998 .