Subgraphs Matching-Based Side Information Generation for Distributed Multiview Video Coding

We adopt constrained relaxation for distributed multiview video coding (DMVC). The novel framework integrates the graph-based segmentation and matching to generate interview correlated side information without knowing the camera parameters, inspired by subgraph semantics and sparse decomposition of high-dimensional scale invariant feature data. The sparse data as a good hypothesis space aim for a best matching optimization of interview side information with compact syndromes, from inferred relaxed coset. The plausible filling-in from a priori feature constraints between neighboring views could reinforce a promising compensation to interview side-information generation for joint multiview decoding. The graph-based representations of multiview images are adopted as constrained relaxation, which assists the interview correlation matching for subgraph semantics of the original Wyner-Ziv image by the graph-based image segmentation and the associated scale invariant feature detector MSER (maximally stable extremal regions) and descriptor SIFT (scale-invariant feature transform). In order to find a distinctive feature matching with a more stable approximation, linear (PCA-SIFT) and nonlinear projections (Locally linear embedding) are adopted to reduce the dimension SIFT descriptors, and TPS (thin plate spline) warping model is to catch a more accurate interview motion model. The experimental results validate the high-estimation precision and the rate-distortion improvements.

[1]  X. Artigas,et al.  Side Information Generation for Multiview Distributed Video Coding Using a Fusion Approach , 2006, Proceedings of the 7th Nordic Signal Processing Symposium - NORSIG 2006.

[2]  F. Pereira,et al.  Evaluating a feedback channel based transform domain Wyner-Ziv video codec , 2008, Signal Process. Image Commun..

[3]  Ram Zamir,et al.  The rate loss in the Wyner-Ziv problem , 1996, IEEE Trans. Inf. Theory.

[4]  Marco Dalai,et al.  The DISCOVER codec: Architecture, Techniques and Evaluation , 2007, PCS 2007.

[5]  Wen Gao,et al.  Wyner–Ziv-Based Multiview Video Coding , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Kannan Ramchandran,et al.  PRISM: A new robust video coding architecture based on distributed compression principles , 2002 .

[7]  Marcus A. Magnor,et al.  Model-based coding of multiviewpoint imagery , 2000, Visual Communications and Image Processing.

[8]  Guillermo Sapiro,et al.  Image inpainting , 2000, SIGGRAPH.

[9]  Yo-Sung Ho,et al.  A Framework for Multi-view Video Coding Using Layered Depth Images , 2005, PCM.

[10]  Mourad Ouaret,et al.  Fusion-based multiview distributed video coding , 2006, VSSN '06.

[11]  Aaron D. Wyner,et al.  The rate-distortion function for source coding with side information at the decoder , 1976, IEEE Trans. Inf. Theory.

[12]  Yan Ke,et al.  PCA-SIFT: a more distinctive representation for local image descriptors , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[13]  Bernd Girod,et al.  Transform-domain Wyner-Ziv codec for video , 2004, IS&T/SPIE Electronic Imaging.

[14]  A. Murat Tekalp,et al.  Scalable Multi-View Video Coding for Interactive 3DTV , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[15]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[16]  Bernd Girod,et al.  Distributed Video Coding , 2005, Proceedings of the IEEE.

[17]  Daniel P. Huttenlocher,et al.  Efficient Graph-Based Image Segmentation , 2004, International Journal of Computer Vision.

[18]  Bernd Girod,et al.  Distributed compression for large camera arrays , 2004, IEEE Workshop on Statistical Signal Processing, 2003.

[19]  Hans-Peter Seidel,et al.  Multivideo compression in texture space , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[20]  Riccardo Leonardi,et al.  Distributed Monoview and Multiview Video Coding: Basics, Problems and Recent Advances , 2007 .

[21]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[22]  Catarina Brites,et al.  IMPROVING FRAME INTERPOLATION WITH SPATIAL MOTION SMOOTHING FOR PIXEL DOMAIN DISTRIBUTED VIDEO CODING , 2005 .

[23]  B. Girod,et al.  Rate-Adaptive Distributed Source Coding using Low-Density Parity-Check Codes , 2005, Conference Record of the Thirty-Ninth Asilomar Conference onSignals, Systems and Computers, 2005..

[24]  Kannan Ramchandran,et al.  Duality between source coding and channel coding and its extension to the side information case , 2003, IEEE Trans. Inf. Theory.

[25]  Zhihai He,et al.  Side information generation with constrained relaxation for distributed multi-view video coding , 2008, 2008 IEEE International Symposium on Circuits and Systems.

[26]  Kannan Ramchandran,et al.  Distributed source coding using syndromes (DISCUS): design and construction , 2003, IEEE Trans. Inf. Theory.

[27]  Jack K. Wolf,et al.  Noiseless coding of correlated information sources , 1973, IEEE Trans. Inf. Theory.

[28]  Cordelia Schmid,et al.  Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[29]  Aljoscha Smolic,et al.  Multi-texture modeling of 3D traffic scenes , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[30]  Catarina Brites,et al.  Refining Side Information for Improved Transform Domain Wyner-Ziv Video Coding , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[31]  Yao Wang,et al.  Multiview video sequence analysis, compression, and virtual viewpoint synthesis , 2000, IEEE Trans. Circuits Syst. Video Technol..