Multiview video plus depth transmission via virtual-view-assisted complementary down/upsampling

Multiview video plus depth is a popular 3D video format which can provide viewers a vivid 3D feeling. However, its requirements in terms of computational complexity and transmission bandwidth are more than that of conventional 2D video. To mitigate these limitations, some works have proposed to reduce the amount of transmitted data by adopting different resolutions for different views, and consequently, the transmitted video is called mixed resolution video. In order to further reduce the transmitted data and maintain good quality at the decoder side; in this paper, we propose a down/upsampling algorithm for 3D multiview video which systematically takes into account the video encoder and decoder. At the encoder side, the rows of the two adjacent views are downsampled following an interlacing and complementary fashion, whereas, at the decoder side, the discarded pixels are recovered by fusing the virtual view pixels with the directional interpolated pixels from the complementary downsampled views. Moreover, the patterns of the texture surrounding the discarded pixels are used to aid the data fusion, so as to enhance edges recovery. Meanwhile, with the assistance of virtual views, at the decoder side, the proposed approach can effectively recover the discarded high-frequency details. The experimental results demonstrate the superior performance of the proposed framework.

[1]  Qionghai Dai,et al.  Stereo Interleaving Video Coding With Content Adaptive Image Subsampling , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Anthony Vetro Frame compatible formats for 3D video distribution , 2010, 2010 IEEE International Conference on Image Processing.

[3]  G. W. STEWARTt ON THE EARLY HISTORY OF THE SINGULAR VALUE DECOMPOSITION * , 2022 .

[4]  Zhi-Gang Zheng,et al.  A region based stereo matching algorithm using cooperative optimization , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Yongdong Zhang,et al.  Parallel deblocking filter for HEVC on many-core processor , 2014 .

[6]  Jui-Chiu Chiang,et al.  Frame-compatible asymmetric stereo video coding considering human perception , 2014, 2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[7]  Liang Li,et al.  Efficient parallel HEVC intra-prediction on many-core processor , 2014 .

[8]  Gary J. Sullivan,et al.  Overview of the Stereo and Multiview Video Coding Extensions of the H.264/MPEG-4 AVC Standard , 2011, Proceedings of the IEEE.

[9]  Xiangjun Zhang,et al.  Low Bit-Rate Image Compression via Adaptive Down-Sampling and Constrained Least Squares Upconversion , 2009, IEEE Transactions on Image Processing.

[10]  Miska M. Hannuksela,et al.  Subjective study on compressed asymmetric stereoscopic video , 2010, 2010 IEEE International Conference on Image Processing.

[11]  Keith J. Hanna,et al.  Hybrid stereo camera: an IBR approach for synthesis of very high resolution stereoscopic image sequences , 2001, SIGGRAPH.

[12]  Markus Flierl,et al.  Motion and Disparity Compensated Coding for Multiview Video , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Yao Zhao,et al.  Virtual-View-Assisted Video Super-Resolution and Enhancement , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[14]  Weisi Lin,et al.  Adaptive downsampling/upsampling for better video compression at low bit rate , 2008, 2008 IEEE International Symposium on Circuits and Systems.

[15]  Yongdong Zhang,et al.  A Highly Parallel Framework for HEVC Coding Unit Partitioning Tree Decision on Many-core Processors , 2014, IEEE Signal Processing Letters.

[16]  C. Fehn A 3D-TV system based on video plus depth information , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[17]  Christoph Fehn,et al.  Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV , 2004, IS&T/SPIE Electronic Imaging.

[18]  Guangming Shi,et al.  Image compression with downsampling and overlapped transform at low bit rates , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[19]  Ricardo L. de Queiroz,et al.  Video compression complexity reduction with adaptive down-sampling , 2011, 2011 18th IEEE International Conference on Image Processing.

[20]  Zheng Zhi A Region Based Stereo Matching Algorithm Using Cooperative Optimization , 2009 .

[21]  Camilo C. Dorea,et al.  Super Resolution for Multiview Images Using Depth Information , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Qionghai Dai,et al.  Opposite parity packing arrangement for stereoscopic video coding , 2011 .

[23]  Zhiyong Gao,et al.  Principal Components Analysis-Based Edge-Directed Image Interpolation , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[24]  Aljoscha Smolic,et al.  Efficient Prediction Structures for Multiview Video Coding , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[25]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[26]  Yongdong Zhang,et al.  Efficient Parallel Framework for HEVC Motion Estimation on Many-Core Processors , 2014, IEEE Transactions on Circuits and Systems for Video Technology.