Effects of delayed presentation of self-embodied avatar motion with network delay

A large network delay is likely to obstruct human interaction in telecommunication systems such as telephony or video conferencing systems. In spite of the extensive investigations that have been carried out on network delays of voice and image data, there have been few studies regarding support for embodied communication under the conditions of network delay. To maintain smooth human interaction, it is important that the various ways in which delay is manifested are understood. We have already developed an embodied virtual communication system that uses an avatar called “VirtualActor,” in which speakers who are remotely located from one another can share embodied interaction in the same virtual space. Responses to a questionnaire that was used in a communication experiment confirmed that a fixed 500-ms network delay has no effect on interactions via VirtualActors. In this paper, we propose a method of presenting a speaker's voice and an avatar's motion feedback in the case of a 1.5-s network delay using VirtualActors. We perform two communication experiments under different conditions of network delay. The aim of the first experiment is to examine the effect of a random time delay on the conversation. The second experiment is conducted under the conditions of a free-form conversation that takes place in 5 scenarios—1 real-time scenario without a network delay and 4 scenarios with network delay that involve a combination of a delay in the talker's voice and in his/her avatar's motion feedback. The subjects consisted of a total of 30 students who worked in 15 pairs and who were familiar with each other. A sensory evaluation shows the effects upon communication of delays in the avatar's motion feedback, from the viewpoint of supporting the interaction.

[1]  Tomio Watanabe,et al.  An Embodied Virtual Wave Communication System , 2000 .

[2]  Belén Carro,et al.  Multimedia Conference Services , 2002 .

[3]  Dae Hee Youn,et al.  An Acoustic Echo Cancellation Based on the Adaptive Lattice-Transversal Joint (LTJ) Filter Structure , 1998 .

[4]  Nobuhiko Kitawaki,et al.  Effects of transmission delay in audiovisual communication , 1994 .

[5]  G. Recanzone Auditory influences on visual temporal rate perception. , 2003, Journal of neurophysiology.

[6]  Karen Ruhleder,et al.  Co-Constructing Non-Mutual Realities: Delay-Generated Trouble in Distributed Interaction , 2004, Computer Supported Cooperative Work (CSCW).

[7]  Carl Gutwin,et al.  Revealing delay in collaborative environments , 2004, CHI.

[8]  Eberhard Hänsler,et al.  Hands-free telephones - joint control of echo cancellation and postfiltering , 2000, Signal Process..

[9]  Steve Benford,et al.  Coping with inconsistency due to network delays in collaborative virtual environments , 1999, VRST '99.

[10]  Tomio Watanabe,et al.  Embodied Interaction Analysis in which the Head Motion of Listener's VirtualActor is Inconsistently Stopped , 2002 .

[11]  Akira Ito,et al.  Human goal attribution toward behavior of artifacts , 2008, RO-MAN 2008 - The 17th IEEE International Symposium on Robot and Human Interactive Communication.

[12]  Steve Benford,et al.  A QoS architecture for collaborative virtual environments , 1999, MULTIMEDIA '99.

[13]  Gunnar Karlsson,et al.  Asynchronous transfer of video , 1996, IEEE Commun. Mag..

[14]  Kajal T. Claypool,et al.  Latency and player actions in online games , 2006, CACM.

[15]  Kyoung Shin Park,et al.  Effects of network characteristics on human performance in a collaborative virtual environment , 1999, Proceedings IEEE Virtual Reality (Cat. No. 99CB36316).