Difference in visual information between face to face and telephone dialogues

We analyzed conversations between a pair of subjects, under two conditions. One is face to face conversation that has visual contact, and the other is conversation through telephone lines that has not. From the recorded videotape we extracted the subject's actions especially focusing on the head movements. By comparing the dialogues under two conditions, it seems that there are two types of head movements, one is intended to give a response to his partner and the other is to send some signal. We analyze how the visual information contributes in spoken dialogue perception, and the possibility of adopting it in a multi-modal human interface.

[1]  Katsuhiko Shirai Modeling of spoken dialogue with and without visual information , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[2]  Keiko Watanuki,et al.  Some signals of emotional arousal: analysis of conversations using a multimodal interaction database , 1995, EUROSPEECH.

[3]  Katsuhiko Shirai,et al.  Analysis of head movements and its role in spoken dialogue , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.