论文信息 - Predictive coding of depth images across multiple views

Predictive coding of depth images across multiple views

A 3D video stream is typically obtained from a set of synchronized cameras, which are simultaneously capturing the same scene (multiview video). This technology enables applications such as free-viewpoint video which allows the viewer to select his preferred viewpoint, or 3D TV where the depth of the scene can be perceived using a special display. Because the user-selected view does not always correspond to a camera position, it may be necessary to synthesize a virtual camera view. To synthesize such a virtual view, we have adopted a depth image-based rendering technique that employs one depth map for each camera. Consequently, a remote rendering of the 3D video requires a compression technique for texture and depth data. This paper presents a predictivecoding algorithm for the compression of depth images across multiple views. The presented algorithm provides (a) an improved coding efficiency for depth images over block-based motion-compensation encoders (H.264), and (b), a random access to different views for fast rendering. The proposed depth-prediction technique works by synthesizing/computing the depth of 3D points based on the reference depth image. The attractiveness of the depth-prediction algorithm is that the prediction of depth data avoids an independent transmission of depth for each view, while simplifying the view interpolation by synthesizing depth images for arbitrary view points. We present experimental results for several multiview depth sequences, that result in a quality improvement of up to 1.8 dB as compared to H.264 compression.

Peter H. N. de With | Yannick Morvan | Dirk Farin

[1] Richard Szeliski,et al. High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[2] George Wolberg,et al. Digital image warping , 1990 .

[3] Harry Shum,et al. Review of image-based rendering techniques , 2000, Visual Communications and Image Processing.

[4] André Kaup,et al. H.264/AVC-compatible coding of dynamic light fields using transposed picture ordering , 2005, 2005 13th European Signal Processing Conference.

[5] Yannick Morvan,et al. Design considerations for view interpolation in a 3D video coding framework , 2006 .

[6] Jr. Leonard McMillan,et al. An Image-Based Approach to Three-Dimensional Computer Graphics , 1997 .

[7] Anthony Vetro,et al. View Synthesis for Multiview Video Compression , 2006 .

[8] Bernhard P. Wrobel,et al. Multiple View Geometry in Computer Vision , 2001 .

[9] Manuel Menezes de Oliveira Neto,et al. Relief texture mapping , 2000, SIGGRAPH.

[10] Steven M. Seitz,et al. View morphing , 1996, SIGGRAPH.

[11] Marcus A. Magnor,et al. Data compression for light-field rendering , 2000, IEEE Trans. Circuits Syst. Video Technol..

[12] Aljoscha Smolic,et al. Efficient Compression of Multi-View Video Exploiting Inter-View Dependencies Based on H.264/MPEG4-AVC , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[13] Y. Morvan,et al. View interpolation along a chain of weakly calibrated cameras , 2008 .