Compact real-time modeling of seated humans by video sprite sequence quantization

We propose an image-based method for real-time modeling of seated humans using upper-body video sprites, which is suitable for applications such as teleconferencing and distance learning. A database of representative video sprite sequences is pre-acquired and pre-uploaded to each remote rendering site. At run-time, for each input sprite, a closely matching sprite is located in the database and the index of the matching sprite is sent to the rendering site, which drastically reduces the data rate. Unlike other data compression methods, our method takes advantage of the limited number of significant body positions a participant assumes during a session. Exploiting redundancy between frames with distant time stamps enables aggressive compression rates with high visual and semantic fidelity.

[1]  D. Gavrila,et al.  3-D model-based tracking of human upper body movement: a multi-view approach , 1995, Proceedings of International Symposium on Computer Vision - ISCV.

[2]  Nikolaos Grammalidis,et al.  Sprite generation and coding of multiview image sequences , 1999, Proceedings 10th International Conference on Image Analysis and Processing.

[3]  Kiyoharu Aizawa,et al.  Model-based image coding advanced video coding techniques for very low bit-rate applications , 1995, Proc. IEEE.

[4]  Jay Torborg,et al.  Talisman: commodity realtime 3D graphics for the PC , 1996, SIGGRAPH.

[5]  Mubarak Shah,et al.  An object-based video coding framework for video sequences obtained from static cameras , 2005, MULTIMEDIA '05.

[6]  Jeremiah Scholl,et al.  Designing a large-scale video chat application , 2005, MULTIMEDIA '05.

[7]  Larry S. Davis,et al.  3-D model-based tracking of humans in action: a multi-view approach , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[8]  G. Cox,et al.  ~ " " " ' l I ~ " " -" . : -· " J , 2006 .

[9]  Richard Szeliski,et al.  A layered video object coding system using sprite and affine motion model , 1997, IEEE Trans. Circuits Syst. Video Technol..

[10]  Vicki de Mey,et al.  Video widgets and video actors , 1993, UIST '93.

[11]  Marc Levoy,et al.  Light field rendering , 1996, SIGGRAPH.

[12]  Alex Pentland,et al.  Pfinder: real-time tracking of the human body , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[13]  Hans Peter Graf,et al.  Photo-Realistic Talking-Heads from Image Samples , 2000, IEEE Trans. Multim..

[14]  A. Laurentini,et al.  The Visual Hull Concept for Silhouette-Based Image Understanding , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Nikolaos Grammalidis,et al.  Sprite generation and coding in multiview image sequences , 2000, IEEE Trans. Circuits Syst. Video Technol..

[16]  Christoph Bregler,et al.  Video Rewrite: Driving Visual Speech with Audio , 1997, SIGGRAPH.

[17]  David Salesin,et al.  Hierarchical image caching for accelerated walkthroughs of complex environments , 1996, SIGGRAPH.

[18]  Edward J. Delp,et al.  New trends in image and video compression , 2000, 2000 10th European Signal Processing Conference.

[19]  C.-C. Jay Kuo,et al.  Optimized mesh and texture multiplexing for progressive textured model transmission , 2004, MULTIMEDIA '04.

[20]  Hans-Peter Seidel,et al.  Free-viewpoint video of human actors , 2003, ACM Trans. Graph..

[21]  Fabio Remondino,et al.  Human figure reconstruction and modeling from single image or monocular video sequence , 2003, Fourth International Conference on 3-D Digital Imaging and Modeling, 2003. 3DIM 2003. Proceedings..

[22]  Hans-Peter Seidel,et al.  Enhancing silhouette-based human motion capture with 3D motion fields , 2003, 11th Pacific Conference onComputer Graphics and Applications, 2003. Proceedings..

[23]  Hans-Peter Seidel,et al.  Combining 2d Feature Tracking And Volume Reconstruction For Online Video-Based Human Motion Capture , 2004, Int. J. Image Graph..

[24]  Zicheng Liu,et al.  Low bit-rate video streaming for face-to-face teleconference , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[25]  Thomas Wiegand,et al.  Draft ITU-T recommendation and final draft international standard of joint video specification , 2003 .

[26]  D. E. Pearson,et al.  Developments in model-based video coding , 1995, Proc. IEEE.

[27]  Alex Pentland,et al.  Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  John Snyder,et al.  Rendering with coherent layers , 1997, SIGGRAPH.