Graph-based representation for multiview image coding

In this paper, we propose a new representation for multiview image sets. Our approach relies on graphs to describe geometry information in a compact and controllable way. The links of the graph connect pixels in different images and describe the proximity between pixels in the 3D space. These connections are dependent on the geometry of the scene and provide the right amount of information that is necessary for coding and reconstructing multiple views. This multiview image representation is very compact and adapts the transmitted geometry information as a function of the complexity of the prediction performed at the decoder side. To achieve this, our GBR adapts the accuracy of the geometry representation, in contrast with depth coding, which directly compresses with losses the original geometry signal. We present the principles of this graph-based representation (GBR) and we build a complete prototype coding scheme for multiview images. Experimental results demonstrate the potential of this new representation as compared to a depth-based approach. GBR can achieve a gain of 2 dB in reconstructed quality over depth-based schemes operating at similar rates.

[1]  Sehoon Yea,et al.  View synthesis prediction for multiview video coding , 2009, Signal Process. Image Commun..

[2]  Yannick Morvan,et al.  Joint depth/texture bit-allocation for multi-view video compression , 2010 .

[3]  Josep R. Casas,et al.  Multi-View Video Representation Based on Fast Monte Carlo Surface Reconstruction , 2013, IEEE Transactions on Image Processing.

[4]  Heiko Schwarz,et al.  3D High-Efficiency Video Coding for Multi-View Video and Depth Data , 2013, IEEE Transactions on Image Processing.

[5]  Thomas Maugey,et al.  Rate-distortion analysis of multiview coding in a DIBR framework , 2012, Ann. des Télécommunications.

[6]  Harry Shum,et al.  Plenoptic sampling , 2000, SIGGRAPH.

[7]  Antonio Ortega,et al.  Depth map distortion analysis for view rendering and depth coding , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[8]  Thomas Maugey,et al.  Multiview image coding using graph-based approach , 2013, IVMSP 2013.

[9]  Aljoscha Smolic,et al.  Multi-View Video Plus Depth Representation and Coding , 2007, 2007 IEEE International Conference on Image Processing.

[10]  Vladan Velisavljevic,et al.  Multiview Image Coding Using Depth Layers and an Optimized Bit Allocation , 2012, IEEE Transactions on Image Processing.

[11]  Antonio Ortega,et al.  Depth map coding using graph based transform and transform domain sparsification , 2011, 2011 IEEE 13th International Workshop on Multimedia Signal Processing.

[12]  Thomas Maugey,et al.  Graph-based representation and coding of multiview geometry , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[13]  Dong Tian,et al.  New Depth Coding Techniques With Utilization of Corresponding Video , 2011, IEEE Transactions on Broadcasting.

[14]  Yusheng Ji,et al.  Expansion hole filling in depth-image-based rendering using graph-based interpolation , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  Thomas Maugey,et al.  Extended Layered Depth Image Representation in Multiview Navigation , 2014, IEEE Signal Processing Letters.

[16]  Antonio Ortega,et al.  On Dependent Bit Allocation for Multiview Image Coding With Depth-Image-Based Rendering , 2011, IEEE Transactions on Image Processing.

[17]  S. B. Kang,et al.  Survey of image-based representations and compression techniques , 2003, IEEE Trans. Circuits Syst. Video Technol..

[18]  Gene Cheung,et al.  Arithmetic edge coding for arbitrarily shaped sub-block motion prediction in depth video compression , 2012, 2012 19th IEEE International Conference on Image Processing.

[19]  Yo-Sung Ho,et al.  Mesh-Based Depth Coding for 3D Video using Hierarchical Decomposition of Depth Maps , 2007, 2007 IEEE International Conference on Image Processing.

[20]  Qionghai Dai,et al.  Free Viewpoint Video Coding With Rate-Distortion Analysis , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  S. Foix,et al.  Lock-in Time-of-Flight (ToF) Cameras: A Survey , 2011, IEEE Sensors Journal.

[22]  Dong Tian,et al.  View synthesis techniques for 3D video , 2009, Optical Engineering + Applications.

[23]  Thomas Wiegand,et al.  3-D Video Representation Using Depth Maps , 2011, Proceedings of the IEEE.