Focus on visual rendering quality through content-based depth map coding

Multi-view video plus depth (MVD) data is a set of multiple sequences capturing the same scene at different viewpoints, with their associated per-pixel depth value. Overcoming this large amount of data requires an effective coding framework. Yet, a simple but essential question refers to the means assessing the proposed coding methods. While the challenge in compression is the optimization of the rate-distortion ratio, a widely used objective metric to evaluate the distortion is the Peak-Signal-to-Noise-Ratio (PSNR), because of its simplicity and mathematically easiness to deal with such purposes. This paper points out the problem of reliability, concerning this metric, when estimating 3D video codec performances. We investigated the visual performances of two methods, namely H.264/MVC and Locally Adaptive Resolution (LAR) method, by encoding depth maps and reconstructing existing views from those degraded depth images. The experiments revealed that lower coding efficiency, in terms of PSNR, does not imply a lower rendering visual quality and that LAR method preserves the depth map properties correctly.

[1]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, SPIE Optics + Photonics.

[2]  Olivier Déforges,et al.  Color LAR Codec: A Color Image Representation and Compression Scheme Based on Local Resolution Adjustment and Self-Extracting Region Representation , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Klaus Hopf,et al.  Key technologies for an advanced 3D TV system , 2004, SPIE Optics East.

[4]  Ajay Luthra,et al.  The H.264/AVC Advanced Video Coding standard: overview and introduction to the fidelity range extensions , 2004, SPIE Optics + Photonics.

[5]  Anthony Vetro,et al.  View Synthesis for Multiview Video Compression , 2006 .

[6]  Christoph Fehn,et al.  Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV , 2004, IS&T/SPIE Electronic Imaging.

[7]  Klaus Diepold,et al.  Depth map compression via compressed sensing , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[8]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[9]  Nuno M. M. Rodrigues,et al.  Compressing depth maps using multiscale recurrent pattern image coding , 2010 .

[10]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[11]  Olivier Déforges,et al.  Interleaved S+P pyramidal decomposition with refined prediction model , 2005, IEEE International Conference on Image Processing 2005.

[12]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..