论文信息 - Reconstruction for Distributed Video Coding: A Context-Adaptive Markov Random Field Approach

Reconstruction for Distributed Video Coding: A Context-Adaptive Markov Random Field Approach

Within the existing reconstruction process of distributed video coding (DVC), there are two major approaches: the maximum probability reconstruction and the minimum mean square error (MMSE) reconstruction. Both of them assume that each node, a pixel in pixel domain DVC or a coefficient in transform domain DVC, is i.i.d., and reconstruct the value of each node independently by only exploiting statistical correlation between source and side-information. These kinds of models produce considerable amount of artifacts in decoded Wyner-Ziv (WZ) frames and degrade the objective performance. In this paper, we propose a context-adaptive Markov random field (MRF) reconstruction algorithm which exploits both the statistical correlation and the spatio-temporal consistency by modeling the corresponding MRF of a generic DVC architecture, and solve the inference by finding its MRF-based maximum a posteriori (MAP) estimate. The energy function of the MRF model consists of two terms: a data term measuring the statistical correlation, and a geometric regularity term enforcing local spatio-temporal structure consistency which is modeled by optical flow estimation with regard to the critical parameters under a wide variety of DVC scenarios. In case the unreliability of the derived local structure, a confidence parameter is introduced to prevent inappropriate penalizing. To find the reconstructed patch assignment with the largest expected probability in the context-adaptive MRF, the energy minimization for the MRF-based MAP estimate of the WZ frames is solved by global optimization and greedy strategies. Compared to the existing maximum probability and MMSE reconstruction with i.i.d. model, a better subjective and objective performance is validated by extensive experiments.

[1] Aleksandra Pizurica,et al. Rate Allocation Algorithm for Pixel-Domain Distributed Video Coding Without Feedback Channel , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[2] Catarina Brites,et al. Improving Transform Domain Wyner-Ziv Video Coding Performance , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[3] Vladimir Kolmogorov,et al. Convergent Tree-Reweighted Message Passing for Energy Minimization , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4] Donald Geman,et al. Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5] Vladimir Kolmogorov,et al. What energy functions can be minimized via graph cuts? , 2002, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6] Bernd Girod,et al. Exploiting spatial correlation in pixel-domain distributed image compression , 2006 .

[7] Vladimir Pavlovic,et al. A new spatio-temporal MRF framework for video-based object segmentation , 2008 .

[8] Stan Z. Li,et al. Markov Random Field Modeling in Image Analysis , 2001, Computer Science Workbench.

[9] Catarina Brites,et al. Motion compensated refinement for low complexity pixel based distributed video coding , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[10] Stefano Tubaro,et al. Rate allocation for robust video streaming based on distributed video coding , 2008, Signal Process. Image Commun..

[11] Ajay Luthra,et al. Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[12] Yongsheng Zhang,et al. Spatial non-stationary correlation noise modeling for Wyner-Ziv error resilience video coding , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[13] Zixiang Xiong,et al. Compression of binary sources with side information at the decoder using LDPC codes , 2002, IEEE Communications Letters.

[14] Rui Zhang,et al. Wyner-Ziv coding of motion video , 2002, Conference Record of the Thirty-Sixth Asilomar Conference on Signals, Systems and Computers, 2002..

[15] Zhihai He,et al. Reconstruction for distributed video coding: a Markov random field approach with context-adaptive smoothness prior , 2010, Visual Communications and Image Processing.

[16] Sing Bing Kang,et al. An MRF-Based DeInterlacing Algorithm With Exemplar-Based Refinement , 2009, IEEE Transactions on Image Processing.

[17] W. Freeman,et al. Generalized Belief Propagation , 2000, NIPS.

[18] Touradj Ebrahimi,et al. Distributed Video Coding: Selecting the most promising application scenarios , 2008, Signal Process. Image Commun..

[19] Olga Veksler,et al. Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[20] Ce Liu,et al. Exploring new representations and applications for motion analysis , 2009 .

[21] Eduardo Peixoto,et al. A Wyner-Ziv Video Transcoder , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[22] Daniel P. Huttenlocher,et al. Efficient Belief Propagation for Early Vision , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[23] Gary J. Sullivan,et al. Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[24] Jia Chen,et al. Spatio-Temporal Markov Random Field for Video Denoising , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[25] Christine Guillemot,et al. 3-D Model-Based Frame Interpolation for Distributed Video Coding of Static Scenes , 2007, IEEE Transactions on Image Processing.

[26] Ying Zhao,et al. Data compression of correlated non-binary sources using punctured turbo codes , 2002, Proceedings DCC 2002. Data Compression Conference.

[27] Houqiang Li,et al. Distributed image coding based on integrated Markov random field modeling and LDPC decoding , 2008, 2008 IEEE 10th Workshop on Multimedia Signal Processing.

[28] W.A.C. Fernando,et al. An enhanced reconstruction algorithm for unidirectional Distributed Video Coding , 2008, 2008 IEEE International Symposium on Consumer Electronics.

[29] Edward J. Delp,et al. Adaptive reconstruction for Wyner-Ziv video coders , 2009, Electronic Imaging.

[30] Bernd Girod,et al. Rate-adaptive codes for distributed source coding , 2006, Signal Process..

[31] Catarina Brites,et al. IMPROVING FRAME INTERPOLATION WITH SPATIAL MOTION SMOOTHING FOR PIXEL DOMAIN DISTRIBUTED VIDEO CODING , 2005 .

[32] Aaron D. Wyner,et al. The rate-distortion function for source coding with side information at the decoder , 1976, IEEE Trans. Inf. Theory.

[33] Christine Guillemot,et al. Optimal Reconstruction in Wyner-Ziv Video Coding with Multiple Side Information , 2007, 2007 IEEE 9th Workshop on Multimedia Signal Processing.

[34] Bernd Girod,et al. Distributed Video Coding , 2005, Proceedings of the IEEE.

[35] Zhihai He,et al. An Error Resilient Video Coding Scheme Using Embedded Wyner–Ziv Description With Decoder Side Non-Stationary Distortion Modeling , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[36] G. Bjontegaard,et al. Calculation of Average PSNR Differences between RD-curves , 2001 .

[37] J. Besag. On the Statistical Analysis of Dirty Pictures , 1986 .

[38] John M. Shea,et al. Hyper-trellis decoding of pixel-domain Wyner-Ziv video coding , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[39] Catarina Brites,et al. Adaptive deblocking filter for transform domain Wyner-Ziv video coding , 2009, IET Image Process..

[40] Thomas Brox,et al. High Accuracy Optical Flow Estimation Based on a Theory for Warping , 2004, ECCV.

[41] Catarina Brites,et al. Refining Side Information for Improved Transform Domain Wyner-Ziv Video Coding , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[42] William T. Freeman,et al. Constructing free-energy approximations and generalized belief propagation algorithms , 2005, IEEE Transactions on Information Theory.

[43] Chang Wen Chen,et al. Rate allocation for transform domain Wyner-Ziv video coding without feedback , 2008, ACM Multimedia.