论文信息 - Non-Local ConvLSTM for Video Compression Artifact Reduction

Non-Local ConvLSTM for Video Compression Artifact Reduction

Video compression artifact reduction aims to recover high-quality videos from low-quality compressed videos. Most existing approaches use a single neighboring frame or a pair of neighboring frames (preceding and/or following the target frame) for this task. Furthermore, as frames of high quality overall may contain low-quality patches, and high-quality patches may exist in frames of low quality overall, current methods focusing on nearby peak-quality frames (PQFs) may miss high-quality details in low-quality frames. To remedy these shortcomings, in this paper we propose a novel end-to-end deep neural network called non-local ConvLSTM (NL-ConvLSTM in short) that exploits multiple consecutive frames. An approximate non-local strategy is introduced in NL-ConvLSTM to capture global motion patterns and trace the spatiotemporal dependency in a video sequence. This approximate strategy makes the non-local module work in a fast and low space-cost way. Our method uses the preceding and following frames of the target frame to generate a residual, from which a higher quality frame is reconstructed. Experiments on two datasets show that NL-ConvLSTM outperforms the existing methods.

[1] Aggelos K. Katsaggelos,et al. Video Super-Resolution With Convolutional Neural Networks , 2016, IEEE Transactions on Computational Imaging.

[2] Jiajun Wu,et al. Video Enhancement with Task-Oriented Flow , 2018, International Journal of Computer Vision.

[3] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[4] Lei Zhang,et al. Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising , 2016, IEEE Transactions on Image Processing.

[5] Bo Yan,et al. An efficient deep convolutional neural networks model for compressed image deblocking , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[6] Alberto Del Bimbo,et al. Deep Generative Adversarial Compression Artifact Removal , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[7] Zulin Wang,et al. Multi-frame Quality Enhancement for Compressed Video , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[8] Renjie Liao,et al. Detail-Revealing Deep Video Super-Resolution , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[9] Jean-Michel Morel,et al. A non-local algorithm for image denoising , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[10] Thomas S. Huang,et al. Non-Local Recurrent Network for Image Restoration , 2018, NeurIPS.

[11] Matthew A. Brown,et al. Frame-Recurrent Video Super-Resolution , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[12] Michael K. Ng,et al. Reducing Artifacts in JPEG Decompression Via a Learned Dictionary , 2014, IEEE Transactions on Signal Processing.

[13] Alessandro Foi,et al. Image Denoising by Sparse 3-D Transform-Domain Collaborative Filtering , 2007, IEEE Transactions on Image Processing.

[14] Matthias Zwicker,et al. Deep Mean-Shift Priors for Image Restoration , 2017, NIPS.

[15] Kai Zeng,et al. Characterizing perceptual artifacts in compressed video streams , 2014, Electronic Imaging.

[16] Gary J. Sullivan,et al. Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[17] Xianming Liu,et al. Robust Video Super-Resolution with Learned Temporal Dynamics , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[18] Tingting Wang,et al. A Novel Deep Learning-Based Method of Improving Coding Efficiency from the Decoder-End for HEVC , 2017, 2017 Data Compression Conference (DCC).

[19] Avideh Zakhor,et al. An optimization approach for removing blocking effects in transform coding , 1995, IEEE Trans. Circuits Syst. Video Technol..

[20] Avideh Zakhor. Iterative procedures for reduction of blocking effects in transform image coding , 1992, IEEE Trans. Circuits Syst. Video Technol..

[21] Mohammad Amin Sadeghi,et al. BlockCNN: A Deep Network for Artifact Removal and Image Compression , 2018, CVPR Workshops.

[22] Dit-Yan Yeung,et al. Deep Learning for Precipitation Nowcasting: A Benchmark and A New Model , 2017, NIPS.

[23] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[24] Dong Liu,et al. A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding , 2016, MMM.

[25] F. Bossen,et al. Common test conditions and software reference configurations , 2010 .

[26] Xiaoou Tang,et al. Compression Artifacts Reduction by a Deep Convolutional Network , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[27] Chao Yang,et al. Quality Enhancement for Intra Frame Coding Via Cnns: An Adversarial Approach , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[28] Hongyang Chao,et al. One-To-Many Network for Visually Pleasing Compression Artifacts Reduction , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29] Christian Ledig,et al. Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30] Wen Gao,et al. Compression Artifact Reduction by Overlapped-Block Transform Coefficient Estimation With Block Similarity , 2013, IEEE Transactions on Image Processing.

[31] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[32] Xianming Liu,et al. Data-driven sparsity-based restoration of JPEG-compressed images in dual transform-pixel domain , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33] Truong Q. Nguyen,et al. DPW-SDNet: Dual Pixel-Wavelet Domain Deep CNNs for Soft Decoding of JPEG-Compressed Images , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[34] Kai Cui,et al. Decoder Side Image Quality Enhancement exploiting Inter-channel Correlation in a 3-stage CNN: Submission to CLIC 2018 , 2018, CVPR Workshops.

[35] Jian Yang,et al. MemNet: A Persistent Memory Network for Image Restoration , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[36] Wangmeng Zuo,et al. Learning Deep CNN Denoiser Prior for Image Restoration , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37] Bhaskar Ramamurthi,et al. Nonlinear space-variant postprocessing of block coded images , 1986, IEEE Trans. Acoust. Speech Signal Process..

[38] Pavel Zemcík,et al. Compression Artifacts Removal Using Convolutional Neural Networks , 2016, J. WSCG.

[39] Seoung Wug Oh,et al. Deep Video Super-Resolution Network Using Dynamic Upsampling Filters Without Explicit Motion Compensation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[40] Nojun Kwak,et al. Image Restoration by Estimating Frequency Distribution of Local Patches , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[41] Hongyang Chao,et al. Building Dual-Domain Representations for Compression Artifacts Reduction , 2016, ECCV.

[42] Bernhard Schölkopf,et al. Spatio-Temporal Transformer Network for Video Restoration , 2018, ECCV.

[43] Abhinav Gupta,et al. Non-local Neural Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[44] Dit-Yan Yeung,et al. Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[45] Zulin Wang,et al. Enhancing Quality for HEVC Compressed Videos , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[46] Yu-Bin Yang,et al. Image Restoration Using Very Deep Convolutional Encoder-Decoder Networks with Symmetric Skip Connections , 2016, NIPS.

[47] Dong Xu,et al. Deep Kalman Filtering Network for Video Compression Artifact Reduction , 2018, ECCV.

[48] Zulin Wang,et al. Decoder-side HEVC quality enhancement with scalable convolutional neural network , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[49] Chun-Liang Li,et al. One Network to Solve Them All — Solving Linear Inverse Problems Using Deep Projection Models , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[50] Hong Yan,et al. An efficient wavelet-based deblocking algorithm for highly compressed images , 2001, IEEE Trans. Circuits Syst. Video Technol..

[51] Tie Liu,et al. MFQE 2.0: A New Approach for Multi-Frame Quality Enhancement on Compressed Video , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.