Using talking heads for real-time virtual videophone in wireless networks

Low bandwidth makes real-time video over wireless and mobile networks expensive and limits transmission length. An architecture that substitutes a person's face with a text-based facial vector description transmitted over a network in tandem with a voice stream requires minimal bandwidth, enabling real-time videophony over various wireless networks

[1]  Garrison W. Cottrell,et al.  A Six-Unit Network is All You Need to Discover Happiness , 2000 .

[2]  Gary Faigin,et al.  The Artistʼs Complete Guide to Facial Expression , 1990 .

[3]  Gregory D. Abowd,et al.  Human-Computer Interaction (3rd Edition) , 2003 .

[4]  Yi-Bing Lin,et al.  Wireless and Mobile Network Architectures , 2000 .

[5]  G. Cottrell,et al.  EMPATH: A Neural Network that Categorizes Facial Expressions , 2002, Journal of Cognitive Neuroscience.

[6]  J. Cassell,et al.  Embodied conversational agents , 2000 .

[7]  Max H. Garzon,et al.  Neural Net Generation of Facial Displays in Talking Heads , 2003, IWANN.

[8]  Evan Drumwright,et al.  Training a neurocontrol for talking heads , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[9]  Clifford Nass,et al.  The media equation - how people treat computers, television, and new media like real people and places , 1996 .

[10]  James C. Lester,et al.  Animated Pedagogical Agents: Face-to-Face Interaction in Interactive Learning Environments , 2000 .

[11]  Martin Reisslein,et al.  MPEG-4 and H.263 video traces for network performance evaluation , 2001, IEEE Netw..

[12]  Robert Kozma,et al.  Neurofuzzy recognition and generation of facial features in talking heads , 2002, 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE'02. Proceedings (Cat. No.02CH37291).

[13]  P. Ekman An argument for basic emotions , 1992 .