EyeGaze: enabling eye contact over video

Traditional video communication systems offer a very limited experience of eye contact due to the offset between cameras and the screen. In response, we present EyeGaze, which uses multiple Kinect cameras to generate a 3D model of the user, and then renders a virtual camera angle giving the user an experience of eye contact. As a novel approach, we use concepts from KinectFusion, such as a volumetric voxel data representation and GPU accelerated ray tracing for viewpoint rendering. This achieves detail from a noisy source, and allows the real-time video output to be a composite of old and new data. We frame our work in literature on eye contact and previous approaches to supporting it over video. We then describe EyeGaze, and an empirical study comparing it with communication face-to-face or over traditional video. The study shows that while face-to-face is still superior, EyeGaze has added value over traditional video in terms of eye contact, involvement, turn-taking and co-presence.

[1]  Peter-Pike J. Sloan,et al.  Interactive ray tracing for isosurface rendering , 1998, Proceedings Visualization '98 (Cat. No.98CB36276).

[2]  Rocco J. Perla,et al.  Ten Common Misunderstandings, Misconceptions, Persistent Myths and Urban Legends about Likert Scales and Likert Response Formats and their Antidotes , 2007 .

[3]  Marc Levoy,et al.  A volumetric method for building complex models from range images , 1996, SIGGRAPH.

[4]  C. Kleinke Gaze and eye contact: a research review. , 1986, Psychological bulletin.

[5]  Kenton O'Hara,et al.  BISi: a blended interaction space , 2011, CHI EA '11.

[6]  M. Foddy,et al.  Patterns of Gaze in Cooperative and Competitive Negotiation , 1978 .

[7]  Byung-Uk Lee,et al.  Viewpoint Interpolation Using an Ellipsoid Head Model for Video Teleconferencing , 2005, ISVC.

[8]  Mel Slater,et al.  The impact of eye gaze on communication using humanoid avatars , 2001, CHI.

[9]  A. Mehrabian,et al.  Nonverval concomitants of perceived and intended persuasiveness. , 1969, Journal of personality and social psychology.

[10]  S. Duncan,et al.  On signalling that it's your turn to speak☆ , 1974 .

[11]  R. Stokes Human Factors and Appearance Design Considerations of the Mod II PICTUREPHONE® Station Set , 1969 .

[12]  Kentaro Toyama,et al.  Gaze Awareness for Video-Conferencing: A Software Approach , 2000, IEEE Multim..

[13]  M. Patterson A sequential functional model of nonverbal exchange. , 1982 .

[14]  John F. Canny,et al.  MultiView: spatially faithful group video conferencing , 2005, CHI.

[15]  Henry Fuchs,et al.  Encumbrance-free telepresence system with real-time 3D capture and display using commodity depth cameras , 2011, 2011 10th IEEE International Symposium on Mixed and Augmented Reality.

[16]  A. Kendon Some functions of gaze-direction in social interaction. , 1967, Acta psychologica.

[17]  The functions of Looking: Effects of friendship on gaze , 1979 .

[18]  Markus H. Gross,et al.  Gaze Correction for Home Video Conferencing , 2012 .

[19]  Albert Mehrabian,et al.  Nonverbal betrayal of feeling. , 1971 .

[20]  John L. Arnott,et al.  Using spatial cues to improve videoconferencing , 1992, CHI.

[21]  Brian Sutton-Smith,et al.  Effects of Age, Sex, and Task on Visual Behavior during Dyadic Interaction. , 1973 .

[22]  J. Lochman,et al.  Nonverbal communication of couples in conflict. , 1981 .

[23]  Ruigang Yang,et al.  Eye gaze correction with stereovision for video-teleconferencing , 2002, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  M. Argyle,et al.  The meaning of five patterns of gaze , 1974 .

[25]  Ingemar J. Cox,et al.  Teleconferencing eye contract using a virtual camera , 1993, INTERCHI Adjunct Proceedings.

[26]  R. Exline,et al.  Visual Behavior as an Aspect of Power Role Relationships , 1975 .

[27]  Andrew W. Fitzgibbon,et al.  KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera , 2011, UIST.

[28]  Ruigang Yang,et al.  Eye contact in video conference via fusion of time-of-flight depth sensor and stereo , 2011 .

[29]  Kenton O'Hara,et al.  Blended interaction spaces for distributed team collaboration , 2011, TCHI.