Rate-Distortion Model Based Bit Allocation for 3-D Facial Compression Using Geometry Video

With the extensive applications of 3-D multimedia technology, 3-D content compression has been an important issue, which ensures its smooth transmission on the network with constrained bandwidth. In this letter, we propose a new compression framework for dynamic 3-D facial expressions. Taking advantage of the near-isometric property of human facial expressions, we parameterize the dynamic 3-D faces into an expression-invariant canonical domain, which naturally generates 2-D geometry videos and allows us to apply the well-studied video compression techniques. Due to the difference from natural videos, each dimension (i.e., X, Y and Z, respectively) of the geometry video is regarded as a video sequence and encoded separately. Meanwhile, a model-based joint bit allocation scheme is designed to allocate reasonable bitrate to each dimension by detailed analysis of rate-distortion model for geometry videos, to obtain optimal results under given target bitrate. Experimental results show that up to 25% improvement in terms of bitrate reduction can be achieved, compared to existing algorithms.

[1]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[2]  Ashraf A. Kassim,et al.  Registration and partitioning-based compression of 3-D dynamic data , 2003, IEEE Trans. Circuits Syst. Video Technol..

[3]  Steven J. Gortler,et al.  Geometry images , 2002, SIGGRAPH.

[4]  Libor V ´ ÿ Geometry-Driven Local Neighbourhood Based Predictors for Dynamic Mesh Compression , 2010 .

[5]  Jörn Ostermann,et al.  SPC: Fast and Efficient Scalable Predictive Coding of Animated Meshes , 2010, Comput. Graph. Forum.

[6]  Kiyoharu Aizawa,et al.  Time-Varying Mesh Compression Using an Extended Block Matching Algorithm , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Xiaoming Chen,et al.  Modeling and Compressing 3-D Facial Expressions Using Geometry Videos , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Xiaoming Chen,et al.  Modeling 3D facial expressions using geometry videos , 2010, ACM Multimedia.

[9]  Wen Gao,et al.  Rate-distortion analysis for H.264/AVC video coding and its application to rate control , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Kiyoharu Aizawa,et al.  Patch-based compression for Time-Varying Meshes , 2010, 2010 IEEE International Conference on Image Processing.

[11]  Ventseslav Sainov,et al.  3-D Time-Varying Scene Capture Technologies—A Survey , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  Toby Berger,et al.  Rate distortion theory : a mathematical basis for data compression , 1971 .

[13]  Tihao Chiang,et al.  Scalable rate control for MPEG-4 video , 2000, IEEE Trans. Circuits Syst. Video Technol..

[14]  Peter Eisert,et al.  Rate-distortion-optimized predictive compression of dynamic 3D mesh sequences , 2006, Signal Process. Image Commun..

[15]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[16]  J. C. De Martin,et al.  PSNR control for GOP-level constant quality in H.264 video coding , 2005 .

[17]  Zhengguo Li,et al.  A Novel Rate Control Scheme for Low Delay Video Communication of H.264/AVC Standard , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[18]  Song Zhang,et al.  High-Resolution, Real-time 3D Shape Acquisition , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.