Texture side information generation for distributed coding of video-plus-depth

We consider distributed video coding in a monoview video-plus-depth scenario, aiming at coding textures jointly with their corresponding depth stream. Distributed Video Coding (DVC) is a video coding paradigm in which the complexity is shifted from the encoder to the decoder. The Side Information (SI) generation is an important element of the decoder, since the SI is the estimation of the to-be-decoded frame. Depth maps enable the calculation of the distance of an object from the camera. The motion between depth frames and their corresponding texture frames (luminance and chrominance components) is strongly correlated, so the additional depth information may be used to generate more accurate SI for the texture stream, increasing the efficiency of the system. In this paper we propose various methods for accurate texture SI generation, comparing them with other state-of-the-art solutions. The proposed system achieves gains on the reference decoder up to 1.49 dB.

[1]  Mads Nielsen,et al.  Multi-hypothesis transform domain Wyner-Ziv video coding including optical flow , 2011, 2011 IEEE 13th International Workshop on Multimedia Signal Processing.

[2]  Frédéric Dufaux,et al.  Wyner-ziv coding for depth maps in multiview video-plus-depth , 2011, 2011 18th IEEE International Conference on Image Processing.

[3]  Zhengyou Zhang,et al.  Low-complexity, near-lossless coding of depth maps from kinect-like depth cameras , 2011, 2011 IEEE 13th International Workshop on Multimedia Signal Processing.

[4]  Joachim Weickert,et al.  Motion Compensated Frame Interpolation with a Symmetric Optical Flow Constraint , 2012, ISVC.

[5]  Xin Huang,et al.  Cross-band noise model refinement for transform domain Wyner-Ziv video coding , 2012, Signal Process. Image Commun..

[6]  Richard Szeliski,et al.  High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[7]  X. Artigas,et al.  Side Information Generation for Multiview Distributed Video Coding Using a Fusion Approach , 2006, Proceedings of the 7th Nordic Signal Processing Symposium - NORSIG 2006.

[8]  Xin Huang,et al.  Side Information and Noise Learning for Distributed Video Coding Using Optical Flow and Clustering , 2012, IEEE Transactions on Image Processing.

[9]  Marco Dalai,et al.  The DISCOVER codec: Architecture, Techniques and Evaluation , 2007, PCS 2007.

[10]  Bernd Girod,et al.  Rate-adaptive codes for distributed source coding , 2006, Signal Process..

[11]  Kai-Kuang Ma,et al.  Adaptive rood pattern search for fast block-matching motion estimation , 2002, IEEE Trans. Image Process..

[12]  Jacob Søgaard,et al.  Exploiting the error-correcting capabilities of low-density parity check codes in distributed video coding using optical flow , 2012, Other Conferences.

[13]  D. Cremers,et al.  Duality TV-L1 flow with fundamental matrix prior , 2008, 2008 23rd International Conference Image and Vision Computing New Zealand.

[14]  Aaron D. Wyner,et al.  The rate-distortion function for source coding with side information at the decoder , 1976, IEEE Trans. Inf. Theory.

[15]  Bernd Girod,et al.  Distributed Video Coding , 2005, Proceedings of the IEEE.

[16]  Marco Zamarin,et al.  Distributed multi-hypothesis coding of depth maps using texture motion information and optical flow , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[17]  Horst Bischof,et al.  A Duality Based Approach for Realtime TV-L1 Optical Flow , 2007, DAGM-Symposium.

[18]  Jack K. Wolf,et al.  Noiseless coding of correlated information sources , 1973, IEEE Trans. Inf. Theory.

[19]  Heiko Schwarz,et al.  Motion vector inheritance for high efficiency 3D video plus depth coding , 2012, 2012 Picture Coding Symposium.

[20]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .