Multisensor Collaboration Network for Video Compression Based on Wavelet Decomposition

Multi-sensor data has significant advantages over single sensor data due to the unique property of each sensor. We discover that multi-sensor collaboration can save bits and achieve stable compression performance according to quantization parameter (QP). In this paper, we propose a multi-sensor collaboration network for video compression based on wavelet decomposition, called MSCN. We introduce MSCN into 3D video coding based on color and depth sensors. The images acquired by a color sensor represent color and texture of the scene, while the images obtained by a depth sensor represent 3D geometric shape of the scene objects. Two sensor data are complementary, and color images help to reconstruct their corresponding depth images. First, we perform uniform sampling on the input depth video. Then, we compress the color and downsampled depth videos using 3D-HEVC codec. Finally, we reconstruct the depth video from the decoded color and depth videos by color guided depth super-resolution (SR). Experimental results show that MSCN achieves average BD-rate reductions of {−9.3%, −65.6%, −66.3%} and {−6.2%, −67.7%, and −69.2%} on 3D-HEVC test datasets for sampling factors 1, 2 and 4 in Random Access (RA) and All Intra (AI) configurations, respectively. Moreover, they verify that multi-sensor collaboration remarkably saves bits in video compression.

[1]  Kebin Jia,et al.  Fast Depth Intra Coding Based on Depth Edge Classification Network in 3D-HEVC , 2022, IEEE Transactions on Broadcasting.

[2]  Liquan Shen,et al.  Patch-Wise Spatial-Temporal Quality Enhancement for HEVC Compressed Video , 2021, IEEE Transactions on Image Processing.

[3]  Ling Shao,et al.  Multi-Stage Progressive Image Restoration , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Gary J. Sullivan,et al.  Developments in International Video Coding Standardization After AVC, With an Overview of Versatile Video Coding (VVC) , 2021, Proceedings of the IEEE.

[5]  Yui-Lam Chan,et al.  Efficient Depth Intra Frame Coding in 3D-HEVC by Corner Points , 2020, IEEE Transactions on Image Processing.

[6]  Jian Jin,et al.  Pixel-Level View Synthesis Distortion Estimation for 3D Video Coding , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Gustavo Sanchez,et al.  Fast 3D-HEVC Depth Map Encoding Using Machine Learning , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Kebin Jia,et al.  Edge-Detection Based Fast Intra-Mode Selection for Depth Map Coding in 3D-HEVC , 2019, 2019 IEEE Visual Communications and Image Processing (VCIP).

[9]  Dong Liu,et al.  Convolutional Neural Network-Based Block Up-Sampling for HEVC , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Xinbo Gao,et al.  Lightweight Image Super-Resolution with Information Multi-distillation Network , 2019, ACM Multimedia.

[11]  Zoe Liu,et al.  Learning-Based Multi-Frame Video Quality Enhancement , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[12]  Chen Change Loy,et al.  EDVR: Video Restoration With Enhanced Deformable Convolutional Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[13]  Kwanghoon Sohn,et al.  A Large RGB-D Dataset for Semi-supervised Monocular Depth Estimation , 2019, ArXiv.

[14]  Mariana Afonso,et al.  Video Compression Based on Spatio-Temporal Resolution Adaptation , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Dong Liu,et al.  Convolutional Neural Network-Based Residue Super-Resolution for Video Coding , 2018, 2018 IEEE Visual Communications and Image Processing (VCIP).

[16]  Chia-Chen Lin,et al.  Acceleration of Depth Intra Coding for 3D-HEVC by Efficient Early Termination Algorithm , 2018, 2018 IEEE Asia Pacific Conference on Circuits and Systems (APCCAS).

[17]  Xinfeng Zhang,et al.  A Dual-Network Based Super-Resolution for Compressed High Definition Video , 2018, PCM.

[18]  Shiqi Wang,et al.  Convolutional Neural Network-Based Synthesized View Quality Enhancement for 3D Video Coding , 2018, IEEE Transactions on Image Processing.

[19]  Qiang Wu,et al.  Minimum Spanning Forest With Embedded Edge Inconsistency Measurement Model for Guided Depth Map Enhancement , 2018, IEEE Transactions on Image Processing.

[20]  Jing Zhang,et al.  Image guided depth enhancement via deep fusion and local linear regularizaron , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[21]  Yao Zhao,et al.  Convolutional neural network-based depth image artifact removal , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[22]  Wuzhen Shi,et al.  An End-to-End Compression Framework Based on Convolutional Neural Networks , 2017, 2017 Data Compression Conference (DCC).

[23]  Dong Liu,et al.  Convolutional Neural Network-Based Block Up-Sampling for Intra Frame Coding , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[24]  Xiaoou Tang,et al.  Learning a Deep Convolutional Network for Image Super-Resolution , 2014, ECCV.

[25]  Lai-Man Po,et al.  Adaptive block truncation filter for MVC depth image enhancement , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[26]  Kwanghoon Sohn,et al.  Reliability-Based Multiview Depth Enhancement Considering Interview Coherence , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[27]  Heiko Schwarz,et al.  3D High-Efficiency Video Coding for Multi-View Video and Depth Data , 2013, IEEE Transactions on Image Processing.

[28]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[29]  Ju Liu,et al.  Coding Distortion Elimination of Virtual View Synthesis for 3D Video System: Theoretical Analyses and Implementation , 2012, IEEE Transactions on Broadcasting.

[30]  Christoph Fehn,et al.  Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV , 2004, IS&T/SPIE Electronic Imaging.

[31]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, SPIE Optics + Photonics.

[32]  Mao Ye,et al.  Coarse-to-Fine Spatio-Temporal Information Fusion for Compressed Video Quality Enhancement , 2022, IEEE Signal Processing Letters.

[33]  Nataliya Shapovalova,et al.  SUPERVEGAN: Super Resolution Video Enhancement GAN for Perceptually Improving Low Bitrate Streams , 2021, IEEE Access.

[34]  Qiong Liu,et al.  Adaptive Multi-Modality Residual Network for Compression Distorted Multi-View Depth Video Enhancement , 2020, IEEE Access.

[35]  Ying Chen,et al.  Overview of the Multiview and 3D Extensions of High Efficiency Video Coding , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[36]  Chun-Su Park,et al.  Edge-Based Intramode Selection for Depth-Map Coding in 3D-HEVC , 2015, IEEE Transactions on Image Processing.

[37]  C. Poynton MPEG-2 Video Compression , 2006, Encyclopedia of Multimedia.