Reconstruction for Distributed Video Coding: A Context-Adaptive Markov Random Field Approach

Within the existing reconstruction process of distributed video coding (DVC), there are two major approaches: the maximum probability reconstruction and the minimum mean square error (MMSE) reconstruction. Both of them assume that each node, a pixel in pixel domain DVC or a coefficient in transform domain DVC, is i.i.d., and reconstruct the value of each node independently by only exploiting statistical correlation between source and side-information. These kinds of models produce considerable amount of artifacts in decoded Wyner-Ziv (WZ) frames and degrade the objective performance. In this paper, we propose a context-adaptive Markov random field (MRF) reconstruction algorithm which exploits both the statistical correlation and the spatio-temporal consistency by modeling the corresponding MRF of a generic DVC architecture, and solve the inference by finding its MRF-based maximum a posteriori (MAP) estimate. The energy function of the MRF model consists of two terms: a data term measuring the statistical correlation, and a geometric regularity term enforcing local spatio-temporal structure consistency which is modeled by optical flow estimation with regard to the critical parameters under a wide variety of DVC scenarios. In case the unreliability of the derived local structure, a confidence parameter is introduced to prevent inappropriate penalizing. To find the reconstructed patch assignment with the largest expected probability in the context-adaptive MRF, the energy minimization for the MRF-based MAP estimate of the WZ frames is solved by global optimization and greedy strategies. Compared to the existing maximum probability and MMSE reconstruction with i.i.d. model, a better subjective and objective performance is validated by extensive experiments.

[1]  Aleksandra Pizurica,et al.  Rate Allocation Algorithm for Pixel-Domain Distributed Video Coding Without Feedback Channel , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[2]  Catarina Brites,et al.  Improving Transform Domain Wyner-Ziv Video Coding Performance , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[3]  Vladimir Kolmogorov,et al.  Convergent Tree-Reweighted Message Passing for Energy Minimization , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Vladimir Kolmogorov,et al.  What energy functions can be minimized via graph cuts? , 2002, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Bernd Girod,et al.  Exploiting spatial correlation in pixel-domain distributed image compression , 2006 .

[7]  Vladimir Pavlovic,et al.  A new spatio-temporal MRF framework for video-based object segmentation , 2008 .

[8]  Stan Z. Li,et al.  Markov Random Field Modeling in Image Analysis , 2001, Computer Science Workbench.

[9]  Catarina Brites,et al.  Motion compensated refinement for low complexity pixel based distributed video coding , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[10]  Stefano Tubaro,et al.  Rate allocation for robust video streaming based on distributed video coding , 2008, Signal Process. Image Commun..

[11]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[12]  Yongsheng Zhang,et al.  Spatial non-stationary correlation noise modeling for Wyner-Ziv error resilience video coding , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[13]  Zixiang Xiong,et al.  Compression of binary sources with side information at the decoder using LDPC codes , 2002, IEEE Communications Letters.

[14]  Rui Zhang,et al.  Wyner-Ziv coding of motion video , 2002, Conference Record of the Thirty-Sixth Asilomar Conference on Signals, Systems and Computers, 2002..

[15]  Zhihai He,et al.  Reconstruction for distributed video coding: a Markov random field approach with context-adaptive smoothness prior , 2010, Visual Communications and Image Processing.

[16]  Sing Bing Kang,et al.  An MRF-Based DeInterlacing Algorithm With Exemplar-Based Refinement , 2009, IEEE Transactions on Image Processing.

[17]  W. Freeman,et al.  Generalized Belief Propagation , 2000, NIPS.

[18]  Touradj Ebrahimi,et al.  Distributed Video Coding: Selecting the most promising application scenarios , 2008, Signal Process. Image Commun..

[19]  Olga Veksler,et al.  Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[20]  Ce Liu,et al.  Exploring new representations and applications for motion analysis , 2009 .

[21]  Eduardo Peixoto,et al.  A Wyner-Ziv Video Transcoder , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Daniel P. Huttenlocher,et al.  Efficient Belief Propagation for Early Vision , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[23]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[24]  Jia Chen,et al.  Spatio-Temporal Markov Random Field for Video Denoising , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Christine Guillemot,et al.  3-D Model-Based Frame Interpolation for Distributed Video Coding of Static Scenes , 2007, IEEE Transactions on Image Processing.

[26]  Ying Zhao,et al.  Data compression of correlated non-binary sources using punctured turbo codes , 2002, Proceedings DCC 2002. Data Compression Conference.

[27]  Houqiang Li,et al.  Distributed image coding based on integrated Markov random field modeling and LDPC decoding , 2008, 2008 IEEE 10th Workshop on Multimedia Signal Processing.

[28]  W.A.C. Fernando,et al.  An enhanced reconstruction algorithm for unidirectional Distributed Video Coding , 2008, 2008 IEEE International Symposium on Consumer Electronics.

[29]  Edward J. Delp,et al.  Adaptive reconstruction for Wyner-Ziv video coders , 2009, Electronic Imaging.

[30]  Bernd Girod,et al.  Rate-adaptive codes for distributed source coding , 2006, Signal Process..

[31]  Catarina Brites,et al.  IMPROVING FRAME INTERPOLATION WITH SPATIAL MOTION SMOOTHING FOR PIXEL DOMAIN DISTRIBUTED VIDEO CODING , 2005 .

[32]  Aaron D. Wyner,et al.  The rate-distortion function for source coding with side information at the decoder , 1976, IEEE Trans. Inf. Theory.

[33]  Christine Guillemot,et al.  Optimal Reconstruction in Wyner-Ziv Video Coding with Multiple Side Information , 2007, 2007 IEEE 9th Workshop on Multimedia Signal Processing.

[34]  Bernd Girod,et al.  Distributed Video Coding , 2005, Proceedings of the IEEE.

[35]  Zhihai He,et al.  An Error Resilient Video Coding Scheme Using Embedded Wyner–Ziv Description With Decoder Side Non-Stationary Distortion Modeling , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[36]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[37]  J. Besag On the Statistical Analysis of Dirty Pictures , 1986 .

[38]  John M. Shea,et al.  Hyper-trellis decoding of pixel-domain Wyner-Ziv video coding , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[39]  Catarina Brites,et al.  Adaptive deblocking filter for transform domain Wyner-Ziv video coding , 2009, IET Image Process..

[40]  Thomas Brox,et al.  High Accuracy Optical Flow Estimation Based on a Theory for Warping , 2004, ECCV.

[41]  Catarina Brites,et al.  Refining Side Information for Improved Transform Domain Wyner-Ziv Video Coding , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[42]  William T. Freeman,et al.  Constructing free-energy approximations and generalized belief propagation algorithms , 2005, IEEE Transactions on Information Theory.

[43]  Chang Wen Chen,et al.  Rate allocation for transform domain Wyner-Ziv video coding without feedback , 2008, ACM Multimedia.