Convolutional Neural Network Based Up-Sampling for Depth Video Intra Coding

Depth video contains depth and disparity information of a scene, which is critical for 3D video systems. In this paper, a convolutional neural network (CNN) based block upsampling method is proposed to improve the efficiency of depth video intra coding. For each largest coding tree in a depth map, it is down-sampled before sent into encoder and recovered into the original size in an intelligent way after low-resolution coding. A novel texture-assisted CNN (TACNN) is presented to handle the depth block up-sampling. The network is made up of several residual coding units and the features of texture block are extracted to assist the reconstruction of the corresponding depth block. Experimental results show that the proposed method achieves competitive rate-distortion performance compared with the state-of-the-art approaches.

[1]  Wen-Huang Cheng,et al.  Enhanced Intra Prediction with Recurrent Neural Network in Video Coding , 2018, 2018 Data Compression Conference.

[2]  Li Yu,et al.  Simplified Depth Intra Coding Based on Texture Feature and Spatial Correlation in 3D-HEVC , 2018, 2018 Data Compression Conference.

[3]  Bruno Zatt,et al.  Complexity reduction for 3D-HEVC depth maps intra-frame prediction using simplified edge detector algorithm , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[4]  Shuai Li,et al.  Depth Coding Based on Depth-Texture Motion and Structure Similarities , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Dong Liu,et al.  Neural network-based arithmetic coding of intra prediction modes in HEVC , 2017, 2017 IEEE Visual Communications and Image Processing (VCIP).

[6]  Bin Li,et al.  Fully Connected Network-Based Intra Prediction for Image Coding , 2018, IEEE Transactions on Image Processing.

[7]  Thomas Wiegand,et al.  3-D Video Representation Using Depth Maps , 2011, Proceedings of the IEEE.

[8]  Kyoung Mu Lee,et al.  Enhanced Deep Residual Networks for Single Image Super-Resolution , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[9]  Detlev Marpe,et al.  Depth Intra Coding for 3D Video Based on Geometric Primitives , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Dong Liu,et al.  Convolutional Neural Network-Based Block Up-Sampling for Intra Frame Coding , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Jianjun Lei,et al.  Region Adaptive R- $\lambda$ Model-Based Rate Control for Depth Maps Coding , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  Yo-Sung Ho,et al.  Depth Video Coding Using Adaptive Geometry Based Intra Prediction for 3-D Video Systems , 2012, IEEE Transactions on Multimedia.

[13]  Tao Zhang,et al.  Convolutional Neural Networks Based Intra Prediction for HEVC , 2017, 2017 Data Compression Conference (DCC).