论文信息 - Inside out - Acoustic and visual aspects of verbal and non-verbal communication : Keynote Paper

Inside out - Acoustic and visual aspects of verbal and non-verbal communication : Keynote Paper

In face-to-face communication both visual andauditory information play an obvious andsignificant role. In this presentation we will discusswork done, primarily at KTH, that aims atanalyzing and modelling verbal and non-verbalcommunication from a multi-modal perspective. Inour studies, it appears that both segmental andprosodic phenomena are strongly affected by thecommunicative context of speech interaction. Oneplatform for modelling audiovisual speechcommunication is the ECA, embodiedconversational agent. We will describe how ECAshave been used in our research, including examplesof applications and a series of experiments forstudying multimodal aspects of speechcommunication.

Björn Granström | David House

[1] Olov Engwall,et al. Combining MRI, EMA and EPG measurements in a three-dimensional tongue model , 2003, Speech Commun..

[2] Ivan Fónagy. La mimique buccale , 1976 .

[3] Jonas Beskow,et al. Rule-based visual speech synthesis , 1995, EUROSPEECH.

[4] Björn Granström,et al. Timing and interaction of visual cues for prominence in audiovisual speech perception , 2001, INTERSPEECH.

[5] Dominic W. Massaro,et al. Read my tongue movements: bimodal learning to perceive and produce non-native speech /r/ and /l/ , 2003, INTERSPEECH.

[6] Olle Bälter,et al. Designing the user interface of the computer-based speech training system ARTUR based on early user tests , 2006, Behav. Inf. Technol..

[7] Jens Edlund,et al. Pushy versus meek - using avatars to influence turn-taking behaviour , 2007, INTERSPEECH.

[8] Björn Granström,et al. Visual correlates to prominence in several expressive modes , 2006, INTERSPEECH.

[9] J. V. Kuppevelt,et al. Advances in natural multimodal dialogue systems , 2005 .

[10] Jonas Beskow,et al. Data-driven synthesis of expressive visual speech using an MPEG-4 talking head , 2005, INTERSPEECH.

[11] Björn Granström,et al. Multimodal feedback cues in human-machine interactions , 2002, Speech Prosody 2002.